Area of interest

Anticipating human motion is a key skill for intelligent systems that share a space or interact with humans. Accurate long-term predictions of human movement trajectories, body poses, actions or activities may significantly improve the ability of robots to plan ahead, anticipate the effects of their actions or to foresee hazardous situations. The topic has received increasing attention in recent years across several scientific communities with a growing spectrum of applications in service robots, self-driving cars, collaborative manipulators or tracking and surveillance. This workshop is the eight in a series of ICRA 2019-2026 events. The aim of this workshop is to bring together researchers and practitioners from different communities and to discuss recent developments in this field, promising approaches, their limitations, benchmarking techniques and open challenges.

Social and Predictive Navigation

Exploring the intersection of human behavior and robot navigation, join us to discuss the latest advancements in algorithms and applications that enhance the safety and efficiency of robots in dynamic, human-inhabited environments.

Automated driving

Focusing on the critical role of motion prediction in autonomous vehicles, we address the challenges of real-time decision-making and safety in self-driving technologies, fostering collaboration among researchers and practitioners to shape the future of automated driving.

Collaborative and production robots

Our workshop brings together experts in robotics and AI to explore how motion prediction can improve human-robot collaboration in industrial and service settings. Discover the latest research on anticipatory algorithms that enable robots to work alongside humans more safely and effectively.

Multi-modal Foundation Model Integration

Investigating the integration of multi-modal data in motion prediction, this workshop showcases cutting-edge research on combining visual, sensory, and contextual information to enhance robotic systems' predictive capabilities. Learn about the tools and techniques driving innovation in collaborative and autonomous robotics.

Call for Papers

We welcome researchers in the field to submit papers to be presented in pitch-talks and as posters. Submitted manuscripts can be at most 4 pages (excluding references), formatted according to ICRA standards using the Paper Template downloadable on the IEEE ICRA 2026 website (two-column format). We encourage authors to additionally submit a video clip to complement their manuscript. Submissions will be reviewed and selected based on their originality, relevance to the workshop topics, contributions, technical clarity, and presentation.

Important Dates:

  • Submission Deadline: April 15, 2026 Extended to April 30, 2026
  • Author Notification: April 30, 2026 Final Extension to May 15, 2026
  • Workshop: Friday June 5th, 2026

Upload your paper

We will accept submissions through CMT, as soon as the call for papers is opened.

We look forward to receiving your submissions!

Topics of Interest

We invite researchers to submit original contributions related to long-term human motion prediction. The papers will be presented in pitch-talks and as posters. Topics of interest include, but are not limited to:

  • Motion trajectory prediction in 2D and 3D
  • Predicting articulated human motion
  • Early action and activity recognition
  • Motion and Task Planning in dynamic environments considering motion predictions
  • Uncertainties related to prediction inputs/outputs and their propagation
  • Anticipation of group and crowd motion
  • Human motion prediction and safety
  • Human-robot Interaction considering predictions
  • Evaluation of prediction algorithms: datasets, metrics, and benchmarks
  • Predictive planning and control
  • Applications of motion prediction techniques
  • Visual scene prediction

The Microsoft CMT service was used for managing the peer-reviewing process for this conference. This service was provided for free by Microsoft and they bore all expenses, including costs for Azure cloud services as well as for software development and support.

Speakers

Katie Driggs-Campbell

Katherine Driggs-Campbell, University of Illinois

Zhi Yan

Zhi Yan, ENSTA

Jean Oh

Jean Oh, Carnegie Mellon University

Lukas Schmid

Lukas Schmid, Technische Universität Nürnberg (UTN)

Tim Pfeifer

Tim Pfeifer, Siemens AG

Dave Woollard

David Woollard, Standard AI

Program

Time Speaker Title Abstract
9:00 - 9:15 Organizers Welcome -
9:15 - 9:45 Katherine Driggs-Campbell On the Limitations of Prediction for Interaction and Collaborations (and some solutions)
View Abstract

Robots are becoming prevalent in our everyday lives and are changing the foundations of our way of life. However, the desirable impacts of robots are only achievable if the underlying algorithms can understand and predict human behaviors and trajectories. This research area has been growing in popularity, but how much of it is actually useful for robot planning and collaboration? In this talk, we will discuss some of the limitations (or unnecessary) aspects of prediction algorithms (e.g., partial observations, distribution assumptions). We'll present some solutions using graph representations for long-term motion prediction, and present work on multi-level prediction that is useful for long-term interaction between humans and robots. We'll look at the impact in crowd navigation, collaborative assembly, and repeated interactions in driving. Through our experiments, we found that focusing human models to the task results in effective human-robot interaction that is safe, interactive, and efficient.

9:45 - 10:15 Zhi Yan From Perception to Navigation: The Sandbox of Long-Term Human Motion Prediction
View Abstract

Long-term human motion prediction (LHMP) is one of the key technologies for robots to operate in dynamic, human-centered environments. However, its success largely depends on the robustness of upstream perception and the efficiency of downstream navigation. This talk aims to provide a perspective on LHMP by connecting the different stages of the mobile robot autonomy process. We will first explore the challenges of upstream perception, which uses 3D LiDAR data for human detection and tracking. Then, we will discuss downstream deployment, namely how to enable robots to navigate in a way that is both in line with human habits and respects social norms. By anchoring LHMP between perception and navigation, we will elaborate the systemic and integration requirements for long-term robot autonomy in shared spaces with humans.

10:15 - 10:45 Jean Oh Predicting under the curse of rarity: self-driving to aviation
View Abstract

TBD

10:45 - 11:00 Coffee Break - -
11:00 - 12:00 Poster Session - -
12:00 - 13:30 Lunch break - -
13:30 - 14:00 Lukas Schmid Understanding past, current, and future human motion
View Abstract

Understanding human motion is essential for the next generation of embodied AI and human-centric robotics. However, in robotics, this understanding is characterized through several different phenomena. A robot needs to be able to capture both short-term dynamics, such as currently moving people, and long-term dynamics, such as changes in the scene imparted by human actions outside the view of the robot, in order to reconstruct past and predict future human actions and motion. This talk will highlight recent advances in the detection of both short-term and long-term dynamics, their unification for wholistic 4D robot perception, and the prediction of future actions and motions based on these observations. The presented methods are evaluated on-board mobile robots and available as open-source software.

14:00 - 14:30 Kashyap Chitta World Models: The Next Frontier of Motion Prediction
View Abstract

Predicting how the multi-agent physical world evolves in response to our actions is a fundamental bottleneck in modern Physical AI, particularly in the context of simulation for training autonomous driving agents. While current systems have made strides by using advanced reconstruction techniques like 3D Gaussian Splatting, they remain constrained by limited spatial and temporal prediction horizons. This talk explores the paradigm shift toward generative world modeling for motion prediction, simulation, and control. We will trace the evolution from abstract behavioral simulations to today's state-of-the-art, pixel-level generative simulators, highlighting recent architectures that use diffusion transformers to synthesize long-horizon, action-conditioned sensor data. We then outline the next frontier: moving beyond using world models as "frozen" simulators. We discuss the emerging potential of latent world models that allow reinforcement learning to actively shape the world model during training, enabling applications beyond simulation such as predictive control.

14:30 - 15:00 Johannes Betz Overcoming Blind Spots: Occlusion Consideration for Improved Autonomous Driving Safety
View Abstract

TBD

15:00 - 15:15 Coffee break - -
15:15 - 15:45 Dave Woollard From Prediction to Priors: Stable Structure in Human Motion for Robot Navigation
View Abstract

Much of robot navigation in shared human environments is framed through trajectory prediction: forecasting where people will move and planning accordingly. In this talk, I present a complementary perspective: while individual human trajectories are noisy and difficult to predict exactly, large-scale motion data reveals stable population-level structure that can also guide robot behavior. Drawing on millions of real-world trajectories from operational retail environments, we show how local motion patterns can serve as coordination priors for planning, biasing robots toward behaviors that are more compatible with surrounding human flow. We also discuss implications for evaluation through EvenFlow, a benchmark built from real human navigation scenarios. More broadly, this suggests that human motion data has value not only for prediction, but also for planning and evaluation through the stable behavioral structure it reveals.

15:45 - 16:15 Alessandro Corbetta Understanding Pedestrian Physics: Large-Scale Measurements, Physics-Based Modeling, and Generative AI
View Abstract

Achieving a predictive understanding of pedestrian crowd dynamics is a major scientific challenge, with close connections to a wide range of fields, from statistical physics to social robotics. Over the past fifteen years, advances in automated vision have enabled increasingly precise measurements of crowd behavior across progressively larger spatial scales. Real-world data acquisition in public spaces, operating on a 24/7/365 basis, has led to datasets comprising millions of individual trajectories, capturing both typical patterns and rare events. These developments have opened the door to large-scale, data-driven modeling approaches aimed at quantitatively predicting the statistical features of pedestrian dynamics, well beyond average behaviors. In this talk, I will first review our work on large-scale crowd measurements. I will then address the challenge of developing models that achieve quantitative statistical accuracy across regimes, from dilute to dense crowds. In particular, I will discuss recent and ongoing approaches based on Langevin equations and variational principles, which successfully reproduce experimental ensemble statistics. I will conclude with an application of autoregressive AI methods to predict complex n-body dynamics and infer effective n-body interactions directly from large-scale trajectory data.

16:15 - 16:45 Organizers Closing and Best Poster Award -

Program Committee

  • Allan Wang, Miraikan
  • Andrew Stratton, University of Michigan
  • Boris Ivanovic, NVIDIA Research
  • Davide De Lazzari, University of Padua
  • Drazen Brscic, Kyoto University
  • Hao Xing, TUM
  • Janik Kaden, Chemnitz University of Technology
  • Jon Skerlj, Technical University of Munich
  • Junyi Shi, Aalto University
  • Linlin Cheng, Vrije Universiteit Amsterdam
  • Luis Figueredo, Technical University of Munich
  • Lukas Schmid, MIT-SPARK Lab
  • Mariam Hassan, EPFL
  • Mattia Piccinini, Technical University of Munich
  • Nemanja Djuric, Aurora Innovation
  • Nils Mandischer, University of Augsburg
  • Nisarga Nilavadi, University of Technology Nuremberg
  • Till Hielscher, University of Stuttgart
  • Vincent Pfaefflin, KIT
  • Yang Gao, EPFL
  • Yufei Zhu, Örebro University
  • Zhi Yan, ENSTA

Organizers

Luigi Palmieri

Luigi Palmieri, Robert Bosch GmbH

Allan Wang

Allan Wang, Miraikan - the National Museum of Emerging Science and Innovation

Dražen Brščić

Dražen Brščić, Kyoto University

Luis Figueredo

Luis Figueredo, University of Nottingham

Adriana Tapus

Adriana Tapus, ENSTA, Institut Polytechnique de Paris

Proceedings

The table below lists all the accepted papers to the workshop.

Title (paper PDF) Authors Poster PDF
Plausible and Feasible Long-Term Human Trajectory Prediction via Motion Field-Regularized Flow Matching Yufei Zhu Poster
An Enhanced Human Navigation Simulator for benchmarking Human-Aware Robot Navigation Noé Pérez-Higueras, Miguel Escudero, Fernando Caballero, Luis Merino Poster
Mapping Spatial Motion Patterns in Non-Stationary Environments Junyi Shi, Zekun Wang, Ting Fu, Yufei Quan, Tomasz Kucner Poster
Multi-Agent Reinforcement Learning for Safe Autonomous Driving Under Pedestrian Behavioral Uncertainty Prakash Aryan, Kaushik Raghupathruni, Timo Kehrer, Sebastiano Panichella Poster
Vision-Based Safe Human-Robot Collaboration with Uncertainty Guarantees Jakob Thumm, Marian Frei, Tianle Ni, Matthias Althoff, Marco Pavone Poster
Real-Time Robotic Emotional Expression from Mixed-Reality Demonstrations via Flow Matching Chao Wang, Michael Gienger, Fan Zhang Poster
Efficient and Reliable Teleoperation through Real-to-Sim-to-Real Shared Autonomy Shuo Sha, Yixuan Wang, Binghao Huang, Antonio Loquercio, Yunzhu Li Poster
Navigating the Crowd: Non-linear MPC with Social Forces Dynamics for Human-Aware Robot Navigation Stefano Trepella, Andrea Ostuni, Mauro Martini, Pablo Pueyo, Noé Pérez-Higueras, Marcello Chiaberge, Fernando Caballero, Luis Merino Poster
Multi-Agent Pickup and Delivery in Human-Populated Environments Benedetta Flammini, Leo D'Amato, Francesco Amigoni Poster
Modeling Kinematic Signatures of Aging for Human Motion Prediction in Assistive Robotics Yiran Liu, Eugene Zhao, Karim Habashy, Chang Shu, Pengcheng Xi Poster
Perceived Safety of Workers in Encounters with Large Industrial AGVs Ansgar Howey, Tim Schreiter, Andrey Rudenko, Achim J. Lilienthal Poster
The Pedestrian-Robot Interaction Dataset (PeRoI) for Learning Distinct Social Navigation Forces Subham Agrawal, Nico Ostermann-Myrau, Nils Dengler, Maren Bennewitz Poster
EgoTraj-Bench: Towards Robust Trajectory Prediction under Ego-view Noisy Observations Jiayi Liu, Jiaming Zhou, Ke Ye, Kun-Yu Lin, Allan Wang, Junwei Liang Poster not available

In collaboration with

TUM
ORU
Bosch
RIG
Nvidia
Miraikan
Kyoto University
University of Nottingham
ENSTA Paris
Retenua