Stanislas Girard
Chatbots are going to destroy infrastructures and your cloud bills
#1about 3 minutes
Comparing web developers and data scientists before GenAI
Before generative AI, web developers focused on CPU-bound tasks and horizontal scaling while data scientists worked with GPU-bound tasks and vast resources.
#2about 3 minutes
The new AI engineer role and the RAG pipeline
The emergence of the AI engineer role combines web development and data science skills, often applied to building RAG pipelines for data ingestion and querying.
#3about 2 minutes
Key architectural challenges in building GenAI apps
Generative AI applications face unique architectural problems, including long response times, sequential bottlenecks, and the difficulty of mixing CPU and GPU-bound processes.
#4about 3 minutes
How a simple chatbot evolves into a large monolith
Adding features like document ingestion and web scraping to a simple chatbot can rapidly increase its resource consumption and Docker image size, creating a complex monolith.
#5about 4 minutes
Refactoring a monolithic AI app into a service architecture
To manage complexity and cost, a monolithic AI application should be refactored by separating user-facing logic from heavy background tasks into distinct, independently scalable services.
#6about 3 minutes
Choosing the right architecture for your application's workload
A monolithic architecture is suitable for low or continuous workloads, while a service-based approach is necessary for applications with high or spiky traffic to manage costs and scale effectively.
#7about 2 minutes
Overlooked challenges of running AI applications in production
Beyond core architecture, running AI in production involves complex challenges like managing GPUs on Kubernetes, model versioning, data compliance, and testing non-deterministic outputs.
#8about 2 minutes
Using creative evaluations and starting with small models
A creative evaluation using a game like Street Fighter reveals that smaller, faster LLMs can outperform larger ones for many use cases, making them a better starting point.
Related jobs
Jobs that call for the skills explored in this talk.
Wilken GmbH
Ulm, Germany
Senior
Kubernetes
AI Frameworks
+3
Picnic Technologies B.V.
Amsterdam, Netherlands
Intermediate
Senior
Python
Structured Query Language (SQL)
+1
ROSEN Technology and Research Center GmbH
Osnabrück, Germany
Senior
TypeScript
React
+3
Matching moments
09:10 MIN
How AI is changing the freelance developer experience
WeAreDevelopers LIVE – AI, Freelancing, Keeping Up with Tech and More
14:06 MIN
Exploring the role and ethics of AI in gaming
Devs vs. Marketers, COBOL and Copilot, Make Live Coding Easy and more - The Best of LIVE 2025 - Part 3
08:29 MIN
How AI threatens the open source documentation business model
WeAreDevelopers LIVE – AI, Freelancing, Keeping Up with Tech and More
06:46 MIN
How AI-generated content is overwhelming open source maintainers
WeAreDevelopers LIVE – You Don’t Need JavaScript, Modern CSS and More
04:28 MIN
Building an open source community around AI models
AI in the Open and in Browsers - Tarek Ziadé
03:55 MIN
The hardware requirements for running LLMs locally
AI in the Open and in Browsers - Tarek Ziadé
06:28 MIN
Using AI agents to modernize legacy COBOL systems
Devs vs. Marketers, COBOL and Copilot, Make Live Coding Easy and more - The Best of LIVE 2025 - Part 3
03:07 MIN
Final advice for developers adapting to AI
WeAreDevelopers LIVE – AI, Freelancing, Keeping Up with Tech and More
Featured Partners
Related Videos
Should we build Generative AI into our existing software?
Simon Müller
AI: Superhero or Supervillain? How and Why with Scott Hanselman
Scott Hanselman
How AI Models Get Smarter
Ankit Patel
Using LLMs in your Product
Daniel Töws
Make it simple, using generative AI to accelerate learning
Duan Lightfoot
Bringing the power of AI to your application.
Krzysztof Cieślak
Supercharge your cloud-native applications with Generative AI
Cedric Clyburn
Unveiling the Magic: Scaling Large Language Models to Serve Millions
Patrick Koss
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.

Starion Group
Municipality of Madrid, Spain
API
CSS
Python
Docker
Machine Learning
+1



OpenAI
München, Germany
Senior
API
Python
JavaScript
Machine Learning

INTENT HQ
Barcelona, Spain
TypeScript
Amazon Web Services (AWS)

Oowlish
Canton de Nice-5, France
Remote
API
Python
PyTorch
TensorFlow
+1


Jordan Martorell S.L.
Barcelona, Spain
Remote
