Anshul Jindal & Martin Piercy

Aug 20, 2025 • World Congress 2025

Your Next AI Needs 10,000 GPUs. Now What?

Training large language models is a networking problem, not a compute problem. Learn how to keep thousands of GPUs from sitting idle.

#1about 2 minutes

Introduction to large-scale AI infrastructure challenges

An overview of the topics to be covered, from the progress of generative AI to the compute requirements for training and inference.

#2about 4 minutes

Understanding the fundamental shift to generative AI

Generative AI creates novel content, moving beyond prediction to unlock new use cases in coding, content creation, and customer experience.

#3about 6 minutes

Using NVIDIA NIMs and blueprints to deploy models

NVIDIA Inference Microservices (NIMs) and blueprints provide pre-packaged, optimized containers to quickly deploy models for tasks like retrieval-augmented generation (RAG).

#4about 4 minutes

An overview of the AI model development lifecycle

Building a production-ready model involves a multi-stage process including data curation, distributed training, alignment, optimized inference, and implementing guardrails.

#5about 6 minutes

Understanding parallelism techniques for distributed AI training

Training massive models requires splitting them across thousands of GPUs using tensor, pipeline, and data parallelism to manage compute and communication.

#6about 2 minutes

The scale of GPU compute for training and inference

Training large models like Llama requires millions of GPU hours, while inference for a single large model can demand a full multi-GPU server.

#7about 3 minutes

Key hardware and network design for AI infrastructure

Effective multi-node training depends on high-speed interconnects like NVLink and network architectures designed to minimize communication latency between GPUs.

#8about 3 minutes

Accessing global GPU capacity with DGX Cloud Lepton

NVIDIA's DGX Cloud Lepton is a marketplace connecting developers to a global network of cloud partners for scalable, on-demand GPU compute.

Wilken GmbH
Ulm, Germany

Senior

Kubernetes

AI Frameworks

+3

Picnic Technologies B.V.
Amsterdam, Netherlands

Intermediate

Senior

Python

Structured Query Language (SQL)

+1

ROSEN Technology and Research Center GmbH
Osnabrück, Germany

Senior

TypeScript

React

+3

The hardware requirements for running LLMs locally

03:55 MIN

The hardware requirements for running LLMs locally

AI in the Open and in Browsers - Tarek Ziadé

Exploring the role and ethics of AI in gaming

14:06 MIN

Exploring the role and ethics of AI in gaming

Devs vs. Marketers, COBOL and Copilot, Make Live Coding Easy and more - The Best of LIVE 2025 - Part 3

The shift to on-device AI models in smartphones

00:48 MIN

The shift to on-device AI models in smartphones

Fake or News: Coding on a Phone, Emotional Support Toasters, ChatGPT Weddings and more - Anselm Hannemann

Building an open source community around AI models

04:28 MIN

Building an open source community around AI models

AI in the Open and in Browsers - Tarek Ziadé

How AI is changing the freelance developer experience

09:10 MIN

How AI is changing the freelance developer experience

WeAreDevelopers LIVE – AI, Freelancing, Keeping Up with Tech and More

Using Chrome's built-in AI for on-device features

06:44 MIN

Using Chrome's built-in AI for on-device features

Devs vs. Marketers, COBOL and Copilot, Make Live Coding Easy and more - The Best of LIVE 2025 - Part 3

AI lawsuits, code flagging, and self-driving subscriptions

01:02 MIN

AI lawsuits, code flagging, and self-driving subscriptions

Fake or News: Self-Driving Cars on Subscription, Crypto Attacks Rising and Working While You Sleep - Théodore Lefèvre

The evolving role of the machine learning engineer

02:20 MIN

The evolving role of the machine learning engineer

AI in the Open and in Browsers - Tarek Ziadé

Featured Partners

WWC24 - Ankit Patel - Unlocking the Future Breakthrough Application Performance and Capabilities with NVIDIA

WWC24 - Ankit Patel - Unlocking the Future Breakthrough Application Performance and Capabilities with NVIDIA

Ankit Patel

about a year ago • World Congress 2024

A Deep Dive on How To Leverage the NVIDIA GB200 for Ultra-Fast Training and Inference on Kubernetes

A Deep Dive on How To Leverage the NVIDIA GB200 for Ultra-Fast Training and Inference on Kubernetes

Kevin Klues

about 4 months ago • World Congress 2025

Efficient deployment and inference of GPU-accelerated LLMs

Efficient deployment and inference of GPU-accelerated LLMs

Adolf Hohl

about a year ago • World Congress 2024

Unveiling the Magic: Scaling Large Language Models to Serve Millions

Unveiling the Magic: Scaling Large Language Models to Serve Millions

Patrick Koss

about 4 months ago • World Congress 2025

How AI Models Get Smarter

How AI Models Get Smarter

Ankit Patel

about 5 months ago • World Congress 2025

AI Factories at Scale

AI Factories at Scale

Thomas Schmidt

about a year ago • World Congress 2024

Exploring LLMs across clouds

Exploring LLMs across clouds

Tomislav Tipurić

about 4 months ago • World Congress 2025

Generative AI power on the web: making web apps smarter with WebGPU and WebNN

Generative AI power on the web: making web apps smarter with WebGPU and WebNN

Christian Liebel

about a year ago • World Congress 2024

Related Articles

View all articles

CH

Chris Heilmann

AI overspill Dec 2026: AI in a JAM, Blocking AI browsers, learning programming languages

Welcome to the AI overspill for December 2025. Here are some of the links that didn’t make it to the WeAreDevelopers Dev Digest but are still interesting to check out. Jonah Glover of Hightouch tried to use Claude to re-create the recreate the 1996 ...

AI overspill Dec 2026: AI in a JAM, Blocking AI browsers, learning programming languages

CH

Chris Heilmann

Got AI ideas but no money? Here are 10 free ways to level up your AI skills with Google Cloud

The AI skills gap is real and developers are feeling the pressure to bring AI into their everyday work. As the managing director of Google Cloud’s learning organization, the question I hear from technical leaders more than any other is: How do we mov...

Got AI ideas but no money? Here are 10 free ways to level up your AI skills with Google Cloud

EG

Elizabeth Fuentes Leone, AWS Developer Advocate, GenAI

From Prototype to Production: Build AI Agents with This Free 4-Course Learning Path

AI agents are moving from demos to production systems. However, most developers face challenges bridging this gap. This learning path shows you how. Interest in AI agents continues to grow in 2025. Developers are building autonomous systems that reas...

From Prototype to Production: Build AI Agents with This Free 4-Course Learning Path

DC

Daniel Cranney

Stephan Gillich - Bringing AI Everywhere

In the ever-evolving world of technology, AI continues to be the frontier for innovation and transformation. Stephan Gillich, from the AI Center of Excellence at Intel, dove into the subject in a recent session titled "Bringing AI Everywhere," sheddi...

Stephan Gillich - Bringing AI Everywhere

From learning to earning

Jobs that call for the skills explored in this talk.

AI Systems and MLOps Engineer for Earth Observation

Forschungszentrum Jülich GmbH
Jülich, Germany

Intermediate

Senior

Linux

Docker

AI Frameworks

Machine Learning

Distinguished Software Architect - Deep Learning and HPC Communications

Nvidia
Bramley, United Kingdom

C++

PyTorch

TensorFlow

Senior Software Engineer - DGX Cloud API Services

Nvidia
Bramley, United Kingdom

£230K

Senior

API

Terraform

Kubernetes

Amazon Web Services (AWS)

Senior Software Engineer - DGX Cloud API Services

Nvidia
München, Germany

€230K

Senior

API

Terraform

Kubernetes

Amazon Web Services (AWS)

Senior AI Software Engineer, GenAI Framework

Nvidia

Remote

Senior

API

Python

PyTorch

Solution Architect - AI Factory

Nvidia

Remote

Intermediate

C++

Bash

Linux

Python

+3

Solutions Architect - Deep Learning for Drug Discovery

Nvidia

Remote

Intermediate

C++

Python

Machine Learning

Software Architecture

Senior Software Architect - Deep Learning and HPC Communications

Nvidia
Bramley, United Kingdom

£292K

Senior

C++

Linux

Node.js

PyTorch

+1

GPU Architect (Graphics Processors R&D for AI)

IC Resources
Luton, United Kingdom

Remote

£100K

Senior

API

UML

OpenCL