Designing Data Infrastructures for Multimodal Mobility Datasets - January 13, 2026
This event has ended, but you can still catch up! Watch the on-demand recordings and register for our future events.
Jan 13, 2026
9:00-10:00 AM Pacific
Online. Register for the Zoom!
About this event
This technical workshop focuses on the data infrastructure required to build and maintain production-grade mobility datasets at fleet scale. We will examine how to structure storage, metadata, access patterns, and quality controls so that mobility teams can treat perception datasets as first-class, versioned “infrastructure” assets. The session will walk through how to design a mobility data stack that connects object storage, labeling systems, simulation environments, and experiment tracking into a coherent, auditable pipeline.
Host
What you’ll learn:
Model the mobility data plane: Define schemas for camera, LiDAR, radar, and HD, and represent temporal windows, ego poses, and scenario groupings in a way that is queryable and stable under schema evolution.
Build a versioned dataset catalog with FiftyOne: Use FiftyOne customized workspaces and views to represent canonical datasets, and integrate with your raw data sources. All while preserving lineage between raw logs, the curated data, and simulation inputs.
Implement governance and access control on mobility data: Configure role-based access and auditable pipelines to enforce data residency constraints while encouraging multi-team collaboration across research, perception, and safety functions.
Operationalize curation and scenario mining workflows: Use FiftyOne’s embeddings and labeling capabilities to surface rare events such as adverse weather and sensor anomalies. Assign review tasks, and codify “critical scenario” definitions as reproducible dataset views.
Close the loop with evaluation and feedback signals: Connect FiftyOne to training and evaluation pipelines so that model failures feed back into dataset updates
By the end of the workshop, attendees will have a concrete mental model and reference architecture for treating mobility datasets as a governed, queryable, and continuously evolving layer in their stack.
Who should attend:
Data scientists, AV/ADAS engineers, robotics researchers, computer vision practitioners, and data infrastructure architects, looking to standardize and scale real-world mobility datasets for model development and simulation.