Register for the event
Virtual
Americas
CV Meetups
Workshops
Designing Data Infrastructures for Multimodal Mobility Datasets - January 14, 2026
Jan 14, 2026
9:00-10:00 AM Pacific
Online. Register for the Zoom!
About this event
This technical workshop focuses on the data infrastructure required to build and maintain production-grade mobility datasets at fleet scale. We will examine how to structure storage, metadata, access patterns, and quality controls so that mobility teams can treat perception datasets as first-class, versioned “infrastructure” assets. The session will walk through how to design a mobility data stack that connects object storage, labeling systems, simulation environments, and experiment tracking into a coherent, auditable pipeline.
Host

What you’ll learn:

  • Model the mobility data plane: Define schemas for camera, LiDAR, radar, and HD, and represent temporal windows, ego poses, and scenario groupings in a way that is queryable and stable under schema evolution.
  • Build a versioned dataset catalog with FiftyOne: Use FiftyOne customized workspaces and views to represent canonical datasets, and integrate with your raw data sources. All while preserving lineage between raw logs, the curated data, and simulation inputs.
  • Implement governance and access control on mobility data: Configure role-based access and auditable pipelines to enforce data residency constraints while encouraging multi-team collaboration across research, perception, and safety functions.
  • Operationalize curation and scenario mining workflows: Use FiftyOne’s embeddings and labeling capabilities to surface rare events such as adverse weather and sensor anomalies. Assign review tasks, and codify “critical scenario” definitions as reproducible dataset views.
  • Close the loop with evaluation and feedback signals: Connect FiftyOne to training and evaluation pipelines so that model failures feed back into dataset updates
By the end of the workshop, attendees will have a concrete mental model and reference architecture for treating mobility datasets as a governed, queryable, and continuously evolving layer in their stack.

Who should attend:

Data scientists, AV/ADAS engineers, robotics researchers, computer vision practitioners, and data infrastructure architects, looking to standardize and scale real-world mobility datasets for model development and simulation.