Robert Lehmann
Planet-Scale Dashboards
#1about 3 minutes
The challenge of creating monitoring dashboards from scratch
Monitoring is often an afterthought, leading to painful incident response without the necessary dashboards for troubleshooting.
#2about 3 minutes
Understanding Google's unique observability scaling challenges
Google's massive scale, global distribution, and monorepo architecture created a unique need for a scalable, reusable monitoring solution.
#3about 5 minutes
Building reusable dashboards with templated dimensions
Replace hardcoded values in queries with template variables, called dimensions, to create a single dashboard that can be reused for any service.
#4about 6 minutes
Solving dashboard discovery with scopes and traits
Address the problem of too many dashboards by having users select a "scope" (e.g., a service), which then uses discovered "traits" to show only relevant dashboards.
#5about 2 minutes
Modeling different entities with scope types
Introduce "scope types" to create namespaces for different kinds of monitorable entities, such as servers, databases, or machine learning models.
#6about 4 minutes
Why infrastructure as code is not the right solution
Static provisioning with infrastructure-as-code or dashboards-as-code is insufficient because it lacks dynamic runtime information and creates a stale second source of truth.
#7about 3 minutes
Improving performance at scale with query variants
Use pre-aggregated metrics and define multiple query "variants" within a graph, allowing the system to automatically select the most performant query based on the user's drill-down level.
#8about 1 minute
Visualizing dependencies with a service graph
Leverage the scope and dependency information to build a service graph that helps engineers quickly navigate between related systems during an incident.
#9about 1 minute
Key takeaways for building planet-scale dashboards
A summary of the core principles: use dimensions for reusability, traits for discovery, scope types for genericity, and variants for performance.
Related jobs
Jobs that call for the skills explored in this talk.
Wilken GmbH
Ulm, Germany
Senior
Kubernetes
AI Frameworks
+3
Hubert Burda Media
München, Germany
€80-95K
Intermediate
Senior
JavaScript
Node.js
+1
Matching moments
01:32 MIN
Organizing a developer conference for 15,000 attendees
Cat Herding with Lions and Tigers - Christian Heilmann
03:34 MIN
The business case for sustainable high performance
Sustainable High Performance: Build It or Pay the Price
03:16 MIN
Improving the developer feedback loop with specialized tools
Developer Time Is Valuable - Use the Right Tools - Kilian Valkhof
01:06 MIN
Malware campaigns, cloud latency, and government IT theft
Fake or News: Self-Driving Cars on Subscription, Crypto Attacks Rising and Working While You Sleep - Théodore Lefèvre
03:39 MIN
Breaking down silos between HR, tech, and business
What 2025 Taught Us: A Year-End Special with Hung Lee
05:28 MIN
The origin story of the Polypane developer browser
Developer Time Is Valuable - Use the Right Tools - Kilian Valkhof
03:38 MIN
Balancing the trade-off between efficiency and resilience
What 2025 Taught Us: A Year-End Special with Hung Lee
02:39 MIN
Establishing a single source of truth for all data
Cat Herding with Lions and Tigers - Christian Heilmann
Featured Partners
Related Videos
Monitoring as Code - Managing your dashboards at scale
Gabriel Labachelerie
Single Server, Global Reach: Running a Worldwide Marketplace on Bare Metal in a Cloud-Dominated World
Jens Happe
Modularity: Let's dig deeper
Pratishtha Pandey
The Rise of Reactive Microservices
David Leitner
New AI-Centric SDLC: Rethinking Software Development with Knowledge Graphs
Gregor Schumacher, Sujay Joshy & Marcel Gocke
Building Systems that Last
Werner Vogels
Handling incidents collaboratively is like solving a rubix cube
Nele Uhlemann
Blueprints for Success: Steering a Global Data & AI Architecture
Dominik Schneider
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.

Forschungszentrum Jülich GmbH
Jülich, Germany
Intermediate
Senior
Linux
Docker
AI Frameworks
Machine Learning


Google
Charing Cross, United Kingdom
£67K
Senior
Google Cloud Platform

apeirum
Municipality of Santander, Spain
Remote
Senior
CSS
HTML
React
TypeScript



Neweasy
Charing Cross, United Kingdom
Remote
£65-91K
Data analysis

