Álvaro Martín Lozano
Implementing continuous delivery in a data processing pipeline
#1about 4 minutes
From research concepts to production-ready data products
The Volkswagen Data Lab shifted its focus from demonstrating proof-of-concepts to building and deploying real-world data solutions for its clients.
#2about 7 minutes
Core concepts of continuous delivery for data
Continuous delivery for data pipelines requires adapting standard CI/CD principles, where data is the deliverable, by progressing through version control, integration, and deployment stages.
#3about 11 minutes
Implementing a pipeline with immutable, versioned data
The five-step pipeline relies on treating data as immutable, creating a new versioned output for each run to enable simple rollbacks and reproducibility.
#4about 6 minutes
The challenge of orchestrating chained data jobs
Managing dependencies between jobs becomes complex when each job consumes versioned, immutable data inputs from upstream processes.
#5about 5 minutes
Pros and cons of the immutable data approach
While this method offers powerful benefits like reproducibility and instant rollbacks, it introduces challenges in orchestration complexity and increased storage costs.
Related jobs
Jobs that call for the skills explored in this talk.
Full Stack Developer (all genders welcome)
ROSEN Technology and Research Center GmbH
Osnabrück, Germany
Senior
Matching moments
02:53 MIN
Defining continuous integration, delivery, and deployment
CI/CD with Github Actions
01:16 MIN
Tracing the evolution of DevOps from silos to superhighways
Navigating the AI Wave in DevOps
11:32 MIN
Adopting trunk-based development and continuous delivery
100 times more frequent deployments: How did we create a high performance team?
28:19 MIN
The distinct roles of CI and CD pipelines
#90DaysOfDevOps - The DevOps Learning Journey
22:33 MIN
Automating the data pipeline with multi-cloud services
Leverage Cloud Computing Benefits with Serverless Multi-Cloud ML
39:32 MIN
Implementing a CI/CD pipeline for your NLP model
Multilingual NLP pipeline up and running from scratch
12:28 MIN
Using continuous delivery to enable business agility
The Affordances of Quality
14:45 MIN
Using the Modern Data Stack and DBT for transformations
Modern Data Architectures need Software Engineering
Featured Partners
Related Videos
Python-Based Data Streaming Pipelines Within Minutes
Bobur Umurzokov
Charting the Journey to Continuous Deployment with a Value Stream Map
Josh Armitage
CI/CD Patterns and Antipatterns - Things your Pipeline Should (Not) Do
Daniel Raniz Raneland
Industrializing your Data Science capabilities
Dubravko Dolic & Hüdaverdi Cakir
Enabling automated 1-click customer deployments with built-in quality and security
Christoph Ruggenthaler
Progressive Delivery in Kubernetes
Carlos Sanchez
Modern Data Architectures need Software Engineering
Matthias Niehoff
GitLab CI pipelines for a whole company
Martin Beránek
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.

DevOps Architect Pipeline / Dev Container / OpenShift
Siemens AG
Berlin, Germany
C++
GIT
CMake
Linux
DevOps
+7

Ingeniero DevOps
HOLA CONSULTORES SL
Municipality of Vitoria-Gasteiz, Spain
Remote
Bash
Azure
DevOps
Python
+9

Remote Data Engineer: Build Scalable Pipelines & Models
Crossing Hurdles
Municipality of Madrid, Spain
Remote
Python
Machine Learning

Data Engineer: Scalable Pipelines & API Integrations (Hybrid)
Tenth Revolution
Barcelona, Spain
Intermediate
API
PySpark
Amazon Web Services (AWS)

Data Engineer - AWS Cloud & Data Pipelines 98% remote ID2398S
mund consulting AG
Berlin, Germany
Intermediate
ETL
Spark
Python
Gitlab
Confluence
+2


Data Engineer AWS
LOGICALIS SPAIN
Municipality of Zaragoza, Spain
ETL
Data Lake
Data analysis
Continuous Integration
Amazon Web Services (AWS)

Data Engineer AWS
LOGICALIS SPAIN
Municipality of Santander, Spain
ETL
Data Lake
Data analysis
Continuous Integration
Amazon Web Services (AWS)

Data Engineer AWS
LOGICALIS SPAIN
Municipality of Bilbao, Spain
ETL
Data Lake
Data analysis
Continuous Integration
Amazon Web Services (AWS)