ECCV 2026 Workshop

Event-Based Multimodal Vision: Imaging, Perception, and Understanding

Imaging, Perception, and Understanding with Event-Based Multimodal Vision

ECCV 2026 Malmo, Sweden Sept 8-13, 2026

Call for Paper Submission EBMV @ ECCV 2026 Submit
ECCV 2026 workshop skyline visual

Important Dates

Challenge & Website Launch

May 10, 2026

Training / Validation Set Release

May 12, 2026

Test Set & Evaluation Server Online

June 20, 2026

Submission Start

July 01, 2026

Challenge Results Announcement

July 5, 2026

Abstract Registration

July 15, 2026

Submission Deadline

July 20, 2026

Notification to Authors

August 5, 2026

Camera-ready Deadline

August 12, 2026

ECCV 2026 Workshop

Sept 8–13, 2026

About

Our workshop focuses on research at the intersection of event-based sensing and multimodal vision, spanning the full pipeline from sensing systems and low-level imaging to perception and high-level understanding.

Sensing Systems

  • Event-based and neuromorphic vision with multimodal visual sensing.
  • Hybrid systems with event cameras, RGB, LiDAR, IMU, and language.
  • Datasets, simulators, benchmarks, and annotation pipelines.
  • Spatial-temporal calibration, synchronization, and alignment.

Low-Level Imaging

  • Event-guided illumination modeling and brightness adjustment.
  • Enhancement under low-light, HDR, blur, noise, and reconstruction scenarios.
  • Image or intensity reconstruction and physics-aware video synthesis.
  • Illumination-aware representations and sensor-aware modeling.

Mid-Level Perception

  • Optical flow, depth, visual odometry, SLAM, and 3D reconstruction.
  • Segmentation, detection, feature tracking, and visual tracking.
  • Multimodal fusion for challenging dynamic environments.
  • Model-based, embedded, and learning-based event-driven perception.

High-Level Understanding

  • Visual grounding, action understanding, and spatiotemporal reasoning.
  • Event-based MLLMs and event-stream vision-language learning.
  • Autonomous driving, robotics, and embodied intelligence.
  • Cross-modal reasoning with asynchronous sparse event representations.

Computing and Hardware

  • Novel event cameras, neuromorphic processors, and event-based systems.
  • Efficient architectures, including state-space and lightweight designs.
  • Sensor-aware learning grounded in event generation mechanisms.

Applications

  • Computational imaging and industrial inspection.
  • Autonomous driving, robotics, AR/VR, and edge intelligence.
  • Deployment-oriented solutions for dynamic real-world scenarios.
  • Imaging
  • Perception
  • Understanding
  • Event + RGB + Language

Challenges

SEE Challenge 2026 visual

Low-Level Imaging Challenge: Event-Guided Brightness Adjustment with SEE 600K

A benchmark challenge for event-guided brightness adjustment under broad lighting conditions, including low light, over-exposure, mixed illumination, and high-contrast scenes. Participants use RGB frames and event streams to restore well-exposed and structurally faithful images.

Dataset: SEE-600K

EventAid Challenge 2026 visual

Low-Level Imaging Challenge: Event-Guided High-frame-rate Video Reconstruction on EventAid and ERF-X170FPS

A benchmark challenge for event-guided high-frame-rate video reconstruction. Participants use low-frame-rate RGB videos and event streams to recover high-frame-rate videos with accurate restoration of fast motion.

Dataset: EventAid, ERF-X170FPS

SDE Depth Challenge 2026 visual

Low-Level Imaging Challenge: Event-Guided Low-Light Video Depth Estimation on SDE

A benchmark challenge for event-guided depth estimation in low-light video scenes. Participants use RGB frames and event streams to recover accurate depth maps under challenging illumination.

Dataset: SDE

CoSEC Depth Challenge 2026 visual

Mid-Level Perception Challenge: Event-Guided Monocular Depth Estimation with CoSEC

A benchmark challenge for event-guided monocular depth estimation in real-world scenes. Participants use RGB frames and event streams to predict accurate depth maps.

Dataset: CoSEC

CoSEC Segmentation Challenge 2026 visual

Mid-Level Perception Challenge: Event-Guided Semantic Segmentation with CoSEC

A benchmark challenge for event-guided semantic segmentation in real-world scenes. Participants use RGB frames and event streams to predict pixel-level semantic labels.

Dataset: CoSEC

High-level understanding challenge placeholder visual

High-Level Understanding Challenge: Event-Guided Spatial Reasoning with EventBench

This challenge benchmarks MLLMs' spatial reasoning on event-based vision, focusing on object counting, absolute distance estimation, and spatial relationship reasoning.

Dataset: EventBench

High-level understanding challenge placeholder visual

High-Level Understanding Challenge: Event Stream Understanding with EventBench

This challenge aims to systematically evaluate and push the boundaries of MLLMs in understanding and reasoning over event-based vision data. Our goal is to foster the development of robust, parameter-efficient multi-modal models capable of deep semantic comprehension, spatial-temporal reasoning, and fine-grained recognition within complex event streams.

Dataset: EventBench

Schedule

Time Session Speaker / Notes
08:30 - 09:00 Registration TODO
09:00 - 09:15 Opening and Welcome TODO
09:15 - 10:45 Invited Talks Session I TODO
11:00 - 12:30 Challenge Session TODO
14:00 - 15:30 Invited Talks Session II TODO
15:45 - 16:45 Poster and Discussion TODO
16:45 - 17:15 Awards and Closing TODO

Invited Speakers

Kostas Daniilidis portrait

Kostas Daniilidis

University of Pennsylvania, USA

Multi-view geometry, 3D scene reconstruction, and geometric vision theory.

Homepage

Guillermo Gallego portrait

Guillermo Gallego

Technical University of Berlin, Germany

Event-based vision, continuous-time motion estimation, and spatiotemporal modeling.

Homepage

Priyadarshini Panda portrait

Priyadarshini Panda

University of Southern California, USA

Neuromorphic computing, spiking neural networks, and event-driven learning systems.

Homepage

Shintaro Shiba portrait

Shintaro Shiba

The University of Tokyo, Japan

Event-based perception for autonomous driving and large-scale real-world deployment.

Homepage

Federico Becattini portrait

Federico Becattini

University of Siena, Italy

Computer vision, Autonomous driving, Trajectory prediction, Fashion recommendation, Event camera

Homepage

Liyuan Pan portrait

Liyuan Pan

Beijing Institute of Technology, China

Event cameras, spiking vision models, and high-speed visual perception.

Homepage

Daniel Gehrig portrait

Daniel Gehrig

University of Pennsylvania, USA

Event-based optical flow, learning-based motion estimation, and high-speed vision.

Homepage

Bo Mu portrait placeholder

Bo Mu

OmniVision Technologies, Inc., USA

Event-based vision applications, neuromorphic sensors, and real-world deployment.

Homepage

Min Liu portrait placeholder

Min Liu

DVSense Technology Co., Ltd., China

Event camera sensors, imaging hardware design, and industrial vision systems.

Homepage

Jian Deng portrait placeholder

Jian Deng

AlpsenTek, Switzerland

Neuromorphic vision sensors, edge imaging devices, and low-power perception hardware.

Homepage