Site Reliability Engineer (SRE) Manager
Role details
Job location
Tech stack
Job description
We're looking for an SRE Engineering Manager to join our dream team and help us build and maintain highly reliable, scalable, and secure infrastructure for our innovative, industry-leading products., * Oversee the architecture, scalability, and maintenance of our cloud infrastructure, ensuring system reliability, security, and efficiency.
- Act as a player-coach, contributing hands-on to day-to-day engineering and operational tasks.
- Define and implement SRE best practices (such as CI/CD pipelines, Infrastructure as Code, automated failovers, chaos engineering, and blameless post-mortems) across all projects under your responsibility.
- Define and track crucial reliability metrics (SLIs, SLOs, SLAs, error budgets, MTTR, MTTD) to evaluate the health of our platforms and monitor performance and stability.
- Collaborate closely with the Product and Engineering teams to align the infrastructure roadmap with new products and iterations, balancing feature velocity with system reliability.
- Assess the technical feasibility of new projects and features, providing high-level estimates, capacity planning, and architectural recommendations.
- Enhance team unity, ensuring that everyone feels part of the project and the company, and championing a culture of reliability across the broader engineering organization.
Requirements
Do you have experience in Unity?, If you're a highly motivated person with a real interest in complex systems, a passion for automation, and a drive to ensure maximum uptime for cutting-edge products, we've got the perfect job for you!, * You have previous experience in Site Reliability Engineering, systems engineering, or software engineering with an infrastructure focus.
- You have experience supporting multiple products with suites of cloud computing services such as Google Cloud and AWS.
- You know how to implement SRE best practices (Infrastructure as Code, observability, automation, incident management).
- You have experience with AI (e.g., utilizing AI-driven operations/AIOps or supporting AI-based infrastructure).
- You have proven leadership of SRE, DevOps, Platform, or Infrastructure teams, ensuring that they are aligned with engineering and work towards a common goal of reliability and scalability.
- You have previous experience managing people, delivery, system quality, and operational processes.
- You are an expert in promoting teamwork so that SREs collaborate seamlessly with software development teams, creating synergies for continuous improvement and shared ownership.
- You like to work in agile and dynamic environments.
- You have excellent communication and leadership skills.
- You are fluent in English. Spanish is a nice to have!
Benefits & conditions
Growth and career development
- At Leadtech, we prioritize your growth. Enjoy a flexible career path with personalized internal training and an annual budget for external learning opportunities.
Work-Life balance
- Benefit from a flexible schedule with flextime and the option of working full remote or from our Barcelona office. Enjoy free Friday afternoons with a 7-hour workday, plus a 35-hour workweek in July and August so you can savor summer!
Comprehensive benefits
- Competitive salary, full-time permanent contract, and top-tier private health insurance (including dental and psychological services).
- 25 days of vacation plus your birthday off, with flexible vacation options-no blackout days!
Unique Perks
- If you wish to come, in our office in Barcelona you'll find it coplete with free coffee, fresh fruit, snacks, a game room, and a rooftop terrace with stunning Mediterranean views.
- Additional benefits include ticket restaurant and nursery vouchers, paid directly from your gross salary.
Join us in an environment where you're free to innovate, learn, and grow alongside passionate professionals. At Leadtech, you'll tackle exciting challenges and be part of a vibrant team dedicated to delivering exceptional user experiences