Liang Yu

Finding the unknown unknowns: intelligent data collection for autonomous driving development

What if you could cut autonomous driving data collection by 99.9%? Learn how an in-car AI identifies and uploads only the most valuable data for retraining.

Finding the unknown unknowns: intelligent data collection for autonomous driving development
#1about 1 minute

Finding the unknown unknowns in autonomous driving

The primary challenge in autonomous driving is identifying and collecting data on rare, anomalous scenarios that models are not trained to handle.

#2about 1 minute

Introducing Cariad and its unified software platform

Cariad, a Volkswagen subsidiary, is building a unified software and tech stack to accelerate innovation for all Volkswagen group brands.

#3about 2 minutes

The Big Loop system for intelligent data collection

The Big Loop system solves the high cost of traditional data collection by intelligently aggregating only useful information using dedicated hardware.

#4about 3 minutes

Understanding the long-tail problem in driving scenarios

The long-tail problem refers to rare but critical events, like noisy sensor data or unknown objects, that can be identified using methods like uncertainty estimation.

#5about 3 minutes

How INSTINCT software identifies valuable data

The INSTINCT software uses deep neural networks to analyze sensor data in real-time and calculate an uncertainty score to flag challenging scenarios for collection.

#6about 3 minutes

The complete data-driven development cycle in action

The Big Loop enables a continuous cycle of driving, uploading valuable data, labeling, retraining models, and deploying them back to vehicles via over-the-air updates.

#7about 1 minute

Scaling data collection with the pioneering fleet

The Big Loop technology is being deployed in a retrofitted "pioneering fleet" to scale data collection before its full rollout in millions of future vehicles.

#8about 5 minutes

Q&A on ethics, model deployment, and regional data

The discussion covers ethical dilemmas, local vs cloud model execution, setting dynamic uncertainty thresholds, and the goal of creating globally applicable models.

Related jobs
Jobs that call for the skills explored in this talk.

Featured Partners

Related Articles

View all articles
DC
Daniel Cranney
How software is steering vehicle technology
The automotive industry is entering a transformative era, and developers have a unique opportunity to be part of it. Cars are no longer just mechanical machines; they’re sophisticated tech platforms with software at their core. This shift, defined by...
How software is steering vehicle technology
CH
Chris Heilmann
With AIs wide open - WeAreDevelopers at All Things Open 2025
Last week our VP of Developer Relations, Chris Heilmann, flew to Raleigh, North Carolina to present at All Things Open . An excellent event he had spoken at a few times in the past and this being the “Lucky 13” edition, he didn’t hesitate to come and...
With AIs wide open - WeAreDevelopers at All Things Open 2025

From learning to earning

Jobs that call for the skills explored in this talk.

Director of Data & AI

Director of Data & AI

Concept LTD
Charing Cross, United Kingdom

Remote
£120-160K
PyTorch
TensorFlow
Data analysis
+2