Roberto Carratalá
One AI API to Power Them All
#1about 5 minutes
The challenge of building production-ready AI applications
The current AI landscape is fragmented with many tools, making it complex to build, scale, and maintain applications with features like RAG and agents.
#2about 3 minutes
Introducing Llama Stack for a unified AI API
Llama Stack, an open-source project from Meta, provides a standardized, modular framework to simplify AI development with a single API for various components.
#3about 3 minutes
Standardizing model inference and safety guardrails
Llama Stack abstracts away differences between local and remote LLMs and integrates safety shields to filter harmful inputs and outputs.
#4about 2 minutes
Simplifying retrieval-augmented generation (RAG) pipelines
Llama Stack organizes the complex RAG process into three distinct, swappable layers for vector embeddings, retrieval, and agentic workflows.
#5about 4 minutes
Building AI agents using the Model Context Protocol
Llama Stack simplifies agent creation by integrating tools, orchestration, and reasoning models through the standardized Model Context Protocol (MCP).
#6about 3 minutes
Gaining application observability with built-in telemetry
Llama Stack provides out-of-the-box telemetry using OpenTelemetry, enabling developers to trace multi-step agent workflows with tools like Jaeger.
#7about 4 minutes
A local demo of inference, safety, and agents
This live demo showcases running Llama Stack locally to perform inference, block unsafe prompts, use an agent to check the weather, and inspect traces in Jaeger.
#8about 1 minute
Transitioning AI applications from local to production
Llama Stack enables a seamless transition from a local development setup to a scalable production environment on Kubernetes by maintaining a consistent API.
#9about 5 minutes
A production demo of a multi-agent business workflow
A complex agent interacts with multiple MCP servers to query a CRM, analyze customer data, send Slack notifications, and generate a PDF report.
Related jobs
Jobs that call for the skills explored in this talk.
Wilken GmbH
Ulm, Germany
Senior
Kubernetes
AI Frameworks
+3
ROSEN Technology and Research Center GmbH
Osnabrück, Germany
Senior
TypeScript
React
+3
Picnic Technologies B.V.
Amsterdam, Netherlands
Intermediate
Senior
Python
Structured Query Language (SQL)
+1
Matching moments
06:28 MIN
Using AI agents to modernize legacy COBOL systems
Devs vs. Marketers, COBOL and Copilot, Make Live Coding Easy and more - The Best of LIVE 2025 - Part 3
09:10 MIN
How AI is changing the freelance developer experience
WeAreDevelopers LIVE – AI, Freelancing, Keeping Up with Tech and More
05:03 MIN
Building and iterating on an LLM-powered product
Slopquatting, API Keys, Fun with Fonts, Recruiters vs AI and more - The Best of LIVE 2025 - Part 2
02:20 MIN
The evolving role of the machine learning engineer
AI in the Open and in Browsers - Tarek Ziadé
03:28 MIN
Why corporate AI adoption lags behind the hype
What 2025 Taught Us: A Year-End Special with Hung Lee
04:04 MIN
Shifting HR from standard products to AI-powered platforms
Turning People Strategy into a Transformation Engine
07:39 MIN
Prompt injection as an unsolved AI security problem
AI in the Open and in Browsers - Tarek Ziadé
03:45 MIN
Preventing exposed API keys in AI-assisted development
Slopquatting, API Keys, Fun with Fonts, Recruiters vs AI and more - The Best of LIVE 2025 - Part 2
Featured Partners
Related Videos
Self-Hosted LLMs: From Zero to Inference
Roberto Carratalá & Cedric Clyburn
DevOps for AI: running LLMs in production with Kubernetes and KubeFlow
Aarno Aukia
The State of GenAI & Machine Learning in 2025
Alejandro Saucedo
Agentic AI Systems for Critical Workloads
Mario Fusco
Enterprise Integration Is Dead! Long Live AI-Driven Integration with Apache Camel
Bruno Meseguer & Markus Eisele
New AI-Centric SDLC: Rethinking Software Development with Knowledge Graphs
Gregor Schumacher, Sujay Joshy & Marcel Gocke
Azure AI Foundry for Developers: Open Tools, Scalable Agents, Real Impact
Oliver Will
Java Meets AI: Empowering Spring Developers to Build Intelligent Apps
Timo Salm
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.

Forschungszentrum Jülich GmbH
Jülich, Germany
Intermediate
Senior
Linux
Docker
AI Frameworks
Machine Learning

Starion Group
Municipality of Madrid, Spain
API
CSS
Python
Docker
Machine Learning
+1


Pasiona Consulting Sl
Municipality of Madrid, Spain
Remote
React
Python
Agile Methodologies


theHRchapter
Calp, Spain
Remote
DevOps
Agile Methodologies
Continuous Integration

Xablu
Hengelo, Netherlands
Intermediate
.NET
Python
PyTorch
Blockchain
TensorFlow
+3

INTENT HQ
Barcelona, Spain
TypeScript
Amazon Web Services (AWS)
