Roberto Carratalá
One AI API to Power Them All
#1about 5 minutes
The challenge of building production-ready AI applications
The current AI landscape is fragmented with many tools, making it complex to build, scale, and maintain applications with features like RAG and agents.
#2about 3 minutes
Introducing Llama Stack for a unified AI API
Llama Stack, an open-source project from Meta, provides a standardized, modular framework to simplify AI development with a single API for various components.
#3about 3 minutes
Standardizing model inference and safety guardrails
Llama Stack abstracts away differences between local and remote LLMs and integrates safety shields to filter harmful inputs and outputs.
#4about 2 minutes
Simplifying retrieval-augmented generation (RAG) pipelines
Llama Stack organizes the complex RAG process into three distinct, swappable layers for vector embeddings, retrieval, and agentic workflows.
#5about 4 minutes
Building AI agents using the Model Context Protocol
Llama Stack simplifies agent creation by integrating tools, orchestration, and reasoning models through the standardized Model Context Protocol (MCP).
#6about 3 minutes
Gaining application observability with built-in telemetry
Llama Stack provides out-of-the-box telemetry using OpenTelemetry, enabling developers to trace multi-step agent workflows with tools like Jaeger.
#7about 4 minutes
A local demo of inference, safety, and agents
This live demo showcases running Llama Stack locally to perform inference, block unsafe prompts, use an agent to check the weather, and inspect traces in Jaeger.
#8about 1 minute
Transitioning AI applications from local to production
Llama Stack enables a seamless transition from a local development setup to a scalable production environment on Kubernetes by maintaining a consistent API.
#9about 5 minutes
A production demo of a multi-agent business workflow
A complex agent interacts with multiple MCP servers to query a CRM, analyze customer data, send Slack notifications, and generate a PDF report.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
22:29 MIN
Testing Spring AI applications with local LLMs
What's (new) with Spring Boot and Containers?
06:08 MIN
Understanding the modern LLM application stack
Building AI Applications with LangChain and Node.js
05:08 MIN
The opaque and complex stack of modern LLM services
You are not my model anymore - understanding LLM model behavior
04:20 MIN
Comparing open source tools for serving LLMs
Self-Hosted LLMs: From Zero to Inference
31:19 MIN
Exploring APIs and frameworks for Java developers
Enter the Brave New World of GenAI with Vector Search
13:32 MIN
Introducing RAGStack as an opinionated development framework
Accelerating GenAI Development: Harnessing Astra DB Vector Store and Langflow for LLM-Powered Apps
34:19 MIN
A final summary of Stack Overflow's AI journey
The Data Phoenix: The future of the Internet and the Open Web
23:48 MIN
Integrating decentralized tech and AI into your stack
End-to-End TypeScript: Completing the Modern Development Stack
Featured Partners
Related Videos
Self-Hosted LLMs: From Zero to Inference
Roberto Carratalá & Cedric Clyburn
DevOps for AI: running LLMs in production with Kubernetes and KubeFlow
Aarno Aukia
The State of GenAI & Machine Learning in 2025
Alejandro Saucedo
Agentic AI Systems for Critical Workloads
Mario Fusco
Enterprise Integration Is Dead! Long Live AI-Driven Integration with Apache Camel
Bruno Meseguer & Markus Eisele
Azure AI Foundry for Developers: Open Tools, Scalable Agents, Real Impact
Oliver Will
New AI-Centric SDLC: Rethinking Software Development with Knowledge Graphs
Gregor Schumacher, Sujay Joshy & Marcel Gocke
Java Meets AI: Empowering Spring Developers to Build Intelligent Apps
Timo Salm
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.


Lead Engineer - Agentic AI Platform (AWS, Bedrock, Multi-Tenant Control Plane)
CloudiQS
Remote
£70-106K
Senior
React
Python
Node.js
+5

AI Agent Builder & Experimenter (Fullstack)
autonomous-teaming
München, Germany
Remote
API
React
Python
TypeScript

MLOps / DevOps Engineer (AI/ML & GenAI) Ubicación: España
Talent Connect
Municipality of Madrid, Spain
Bash
Azure
DevOps
Python
Docker
+9
![Full Stack Engineer (AI and FrontendFocused)"}}]},{"@context":"https://schema.org/","@type":"JobPosting","@id":"#jobPosting","title":"Senior Full Stack Engineer](https://wearedevelopers.imgix.net/public/default-job-listing-cover.png?w=400&ar=3.55&fit=crop&crop=entropy&auto=compress,format)
Full Stack Engineer (AI and FrontendFocused)"}}]},{"@context":"https://schema.org/","@type":"JobPosting","@id":"#jobPosting","title":"Senior Full Stack Engineer
Luzmo
Senior
CSS
RxJS
Node.js
Angular
JavaScript
+4

Senior / Lead AI Developer - LLMs & Agentic Workflows
KEMIO Consulting
Municipality of Madrid, Spain
Remote
Senior
API
Python

AI & Embedded ML Engineer (Real-Time Edge Optimization)
autonomous-teaming
Canton of Toulouse-5, France
Remote
C++
GIT
Linux
Python
+1

AI & Embedded ML Engineer (Real-Time Edge Optimization)
autonomous-teaming
München, Germany
Remote
C++
GIT
Linux
Python
+1
