Software Engineering manager
Role details
Job location
Tech stack
Job description
We're building the next-generation foundational platform - with developer experience at the heart of every decision. Leveraging modern technologies and a true platform-as-a-product mindset, we're reshaping how the entire organisation builds, ships, and operates software. As Software Engineering manager, you will play a pivotal role in defining our golden paths, shaping our developer portal, and enabling a large engineering organisation to move faster, safer, and with far more autonomy. This is your chance to build the rails that power high-velocity product engineering at scale., Team leadership: Line manage, mentor and grow a team of platform engineers and SREs. Build a high-performing, psychologically safe culture that values learning, accountability and continuous improvement. Technical strategy & delivery: Define and execute the platform roadmap including compute, networking, storage, CI/CD and Kubernetes-based services. Balance technical debt, reliability work and new feature delivery. Reliability & SLAs: Own SLIs/SLOs for platform services, drive reliability programmes, and ensure teams meet agreed availability and performance targets. Observability & incident management: Champion end-to-end observability (metrics, tracing, logging), lead major incident responses, and run blameless post-incident reviews with actionable RCA and follow-through. Automation & infrastructure as code: Promote automation-first approaches for provisioning, deployments and runbook automation using IaC and CI/CD best practice. Security & compliance: Work closely with InfoSec to embed secure configurations, secrets management and compliance controls into the platform lifecycle. Key outcomes: Reliable, observable platform: Platform services meet agreed SLAs/SLOs and have clear dashboards, playbooks and automated remediation where appropriate. Reduced incident impact: Noticeable reductions in MTTD/MTTR with documented RCA actions completed and shared learnings adopted broadly. High-performing team: A motivated engineering team that continually improves delivery, reliability practices and operational readiness. Cost-efficient operations: Demonstrable cloud cost and capacity improvements without compromising service reliability.
Requirements
Proven engineering management experience: 3 years + leading platform, SRE or infrastructure teams in a cloud-native environment (typically 3+ years in management plus strong hands-on experience). Strong SRE/DevOps background: Hands-on experience operating cloud services at scale with a solid understanding of SLIs/SLOs, error budgets and incident lifecycle. Public cloud expertise: Deep experience with AWS services (compute, networking, storage and managed services) and cloud operational patterns. Infrastructure as Code: Practical experience with Terraform, CloudFormation or equivalent. Containerisation & orchestration: Strong knowledge of Kubernetes, Helm and related ecosystem tools. Observability tooling: Experience implementing and operating metrics, tracing and logging stacks such as Prometheus, Grafana, ELK/OpenSearch or Jaeger. CI/CD & automation: Familiarity with pipeline tooling (GitHub Actions, GitLab CI, Jenkins) and automation frameworks to enable reliable delivery. Programming & scripting: Comfortable writing and reviewing code or scripts (Python, Go, Bash) to automate tasks and build platform tooling.