Ankit Patel
How AI Models Get Smarter
#1about 2 minutes
How AI models are surpassing human experts
AI models are now exceeding human expert performance on comprehensive benchmarks like MMLU, which measures intelligence across various subjects.
#2about 5 minutes
The shift from labeled to unlabeled data training
The transformer architecture enabled a major shift from training on limited, human-labeled data to pre-training on vast amounts of unlabeled internet text using next-token prediction.
#3about 8 minutes
Refining models with post-training techniques
Pre-trained models are made useful for specific tasks like chatbots through post-training methods such as supervised fine-tuning and reinforcement learning from human feedback (RLHF).
#4about 3 minutes
Improving answer quality with reasoning models
Reasoning models improve accuracy by using test-time scaling, a process where the model prompts itself to double-check facts and logic before providing a final answer.
#5about 5 minutes
A practical workflow for AI application developers
Developers can build AI applications by starting with an API, using structured prompt engineering, and evaluating models in context rather than relying solely on benchmarks.
#6about 3 minutes
Implementing guardrails to secure your application
Protect your AI application from manipulation and misuse by implementing guardrails, detailed system prompts, and specialized guard models to enforce desired behaviors.
#7about 3 minutes
Building modular agentic applications with tools
Agentic applications use a modular architecture where each agent can use specific tools, often defined with natural language prompts, to perform complex tasks.
#8about 4 minutes
Q&A on model behavior and synthetic data
This Q&A covers why LLM responses are non-deterministic, how synthetic data is used for model distillation, and strategies for preventing hallucinations.
Related jobs
Jobs that call for the skills explored in this talk.
Wilken GmbH
Ulm, Germany
Senior
Kubernetes
AI Frameworks
+3
Picnic Technologies B.V.
Amsterdam, Netherlands
Intermediate
Senior
Python
Structured Query Language (SQL)
+1
WALTER GROUP
Wiener Neudorf, Austria
Intermediate
Senior
Python
Data Vizualization
+1
Matching moments
03:07 MIN
Final advice for developers adapting to AI
WeAreDevelopers LIVE – AI, Freelancing, Keeping Up with Tech and More
09:10 MIN
How AI is changing the freelance developer experience
WeAreDevelopers LIVE – AI, Freelancing, Keeping Up with Tech and More
02:20 MIN
The evolving role of the machine learning engineer
AI in the Open and in Browsers - Tarek Ziadé
04:05 MIN
How AI code generators have become more reliable
AI in the Open and in Browsers - Tarek Ziadé
04:28 MIN
Building an open source community around AI models
AI in the Open and in Browsers - Tarek Ziadé
03:28 MIN
Why corporate AI adoption lags behind the hype
What 2025 Taught Us: A Year-End Special with Hung Lee
00:48 MIN
The shift to on-device AI models in smartphones
Fake or News: Coding on a Phone, Emotional Support Toasters, ChatGPT Weddings and more - Anselm Hannemann
03:31 MIN
Using AI to make work more human, not replace humans
Turning People Strategy into a Transformation Engine
Featured Partners
Related Videos
AI: Superhero or Supervillain? How and Why with Scott Hanselman
Scott Hanselman
Bringing the power of AI to your application.
Krzysztof Cieślak
You are not an AI developer
Zan Markan
AI & Ethics
PJ Hagerty
Open Source AI, To Foundation Models and Beyond
Ankit Patel, Matt White, Philipp Schmid, Lucie-Aimée Kaffee & Andreas Blattmann
WWC24 - Ankit Patel - Unlocking the Future Breakthrough Application Performance and Capabilities with NVIDIA
Ankit Patel
The shadows of reasoning – new design paradigms for a gen AI world
Jonas Andrulis
The State of GenAI & Machine Learning in 2025
Alejandro Saucedo
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.

Forschungszentrum Jülich GmbH
Jülich, Germany
Intermediate
Senior
Linux
Docker
AI Frameworks
Machine Learning

Amazon.com Inc.
Senior
R
API
Unix
Perl
Ruby
+7

Agenda GmbH
Remote
Intermediate
API
Azure
Python
Docker
+10

FDTech GmbH
Boldecker Land, Germany
R
GIT
Python
A/B testing
Machine Learning
+1


score4more GmbH
Berlin, Germany
Remote
Intermediate
API
Scrum
React
DevOps
+8

Imec
Azure
Python
PyTorch
TensorFlow
Computer Vision
+1

Agenda GmbH
Raubling, Germany
Remote
Intermediate
API
Azure
Python
Docker
+10

Plain Concepts
Remote
Azure
Python
Computer Vision
Machine Learning
+2