Lina Weichbrodt
Is my AI alive but brain-dead? How monitoring can tell you if your machine learning stack is still performing
#1about 2 minutes
Why defining the business problem is crucial for monitoring
Machine learning projects often have vague requirements, making it essential to define success KPIs before implementing monitoring.
#2about 3 minutes
A real-world use case for loan rejection prediction
A machine learning model is used to predict loan application rejections upfront, saving significant monthly costs from credit agency queries.
#3about 3 minutes
Using precision and recall for model training
Precision and recall are chosen as the key metrics to balance the model's accuracy in predicting rejections against the volume of applications it can identify.
#4about 2 minutes
Choosing gradient boosted trees for tabular data
Gradient boosted trees are selected over deep learning for this tabular data problem because they offer comparable performance with much faster training times.
#5about 2 minutes
Using existing tools like Grafana for ML monitoring
You can leverage your existing software monitoring stack like Grafana and Prometheus for machine learning, which is often sufficient and avoids adopting immature tools.
#6about 6 minutes
Monitoring model outcomes with a holdout set
When the true outcome is unknown due to model intervention, a holdout set of live traffic is used to calculate production metrics like precision and recall.
#7about 3 minutes
Translating stakeholder fears into monitoring signals
Address stakeholder concerns by identifying their worst-case scenarios and creating specific metrics to monitor and alert on those potential issues.
#8about 4 minutes
Monitoring the model's response distribution for drift
Track the distribution of model outputs over time using statistical distance metrics like the D1 distance to detect shifts that indicate a problem.
#9about 2 minutes
Creating quality heuristics as sanity checks
Develop simple, human-understandable heuristics, such as the average rank of a user's favorite item, to serve as an intuitive quality indicator.
#10about 2 minutes
Monitoring input data to detect training-serving skew
Compare the distribution of input features between the training environment and live production to identify and debug training-serving skew.
#11about 4 minutes
Key takeaways for practical machine learning monitoring
Monitoring in production focuses on detecting problems with indicator KPIs, not measuring absolute quality, and can be done by working backwards from business impact.
#12about 15 minutes
Q&A on career paths and delayed outcomes
The Q&A session covers topics such as career entry points into machine learning, handling delayed outcomes in business processes, and stakeholder communication.
Related jobs
Jobs that call for the skills explored in this talk.
Picnic Technologies B.V.
Amsterdam, Netherlands
Intermediate
Senior
Python
Structured Query Language (SQL)
+1
WALTER GROUP
Wiener Neudorf, Austria
Intermediate
Senior
Python
Data Vizualization
+1
Wilken GmbH
Ulm, Germany
Senior
Kubernetes
AI Frameworks
+3
Matching moments
02:20 MIN
The evolving role of the machine learning engineer
AI in the Open and in Browsers - Tarek Ziadé
03:07 MIN
Final advice for developers adapting to AI
WeAreDevelopers LIVE – AI, Freelancing, Keeping Up with Tech and More
05:03 MIN
Building and iterating on an LLM-powered product
Slopquatting, API Keys, Fun with Fonts, Recruiters vs AI and more - The Best of LIVE 2025 - Part 2
03:28 MIN
Why corporate AI adoption lags behind the hype
What 2025 Taught Us: A Year-End Special with Hung Lee
09:10 MIN
How AI is changing the freelance developer experience
WeAreDevelopers LIVE – AI, Freelancing, Keeping Up with Tech and More
06:46 MIN
How AI-generated content is overwhelming open source maintainers
WeAreDevelopers LIVE – You Don’t Need JavaScript, Modern CSS and More
08:29 MIN
How AI threatens the open source documentation business model
WeAreDevelopers LIVE – AI, Freelancing, Keeping Up with Tech and More
01:02 MIN
AI lawsuits, code flagging, and self-driving subscriptions
Fake or News: Self-Driving Cars on Subscription, Crypto Attacks Rising and Working While You Sleep - Théodore Lefèvre
Featured Partners
Related Videos
Deployed ML models need your feedback too
David Mosen
The state of MLOps - machine learning in production at enterprise scale
Bas Geerdink
Detecting Money Laundering with AI
Stefan Donsa & Lukas Alber
DevOps for AI: running LLMs in production with Kubernetes and KubeFlow
Aarno Aukia
From Traction to Production: Maturing your LLMOps step by step
Maxim Salnikov
How AI Models Get Smarter
Ankit Patel
You are not my model anymore - understanding LLM model behavior
Andreas Erben
Overview of Machine Learning in Python
Adrian Schmitt
Related Articles
View all articles.gif?w=240&auto=compress,format)
.gif?w=240&auto=compress,format)


From learning to earning
Jobs that call for the skills explored in this talk.

Forschungszentrum Jülich GmbH
Jülich, Germany
Intermediate
Senior
Linux
Docker
AI Frameworks
Machine Learning



Luminance Technologies
Cambridge, United Kingdom
Python
PyTorch
TensorFlow
Computer Vision
Machine Learning
+1


bloomon
Amsterdam, Netherlands
Remote
Python
A/B testing
Machine Learning
Amazon Web Services (AWS)


Agenda GmbH
Remote
Intermediate
API
Azure
Python
Docker
+10

MediaMarktSaturn Retail Group
Ingolstadt, Germany
Python
Docker
PyTorch
Terraform
TensorFlow
+3