DevOps Engineer
Role details
Job location
Tech stack
Job description
As a DevOps Engineer, you'll play a critical role in ensuring the reliability, security, and operational excellence of our cloud-native platform. This role has a strong operational focus, providing UK afternoon to evening coverage aligned with US customer activity, and taking ownership of incident response, release readiness, and day-to-day platform stability.
You'll work closely with engineering, operations, and customer-facing teams to ensure issues are triaged and resolved effectively, alerts are actionable, and runbooks and processes are consistently followed. A key part of the role is reducing operational toil through targeted automation, CI/CD improvements, and the creation of repeatable operational workflows.
You'll also help shape our platform foundations, ensuring operational patterns align with our "golden path" standards and support future self-service capabilities. This is a hands-on role with real ownership, including mentoring junior engineers and directly influencing how we operate and scale our production environment., * Acting as the primary engineer for UK afternoon-evening support coverage, handling triage, investigation, resolution, and escalation of incidents
- Leading operational execution by ensuring alerts are actionable, runbooks are followed, and incidents are managed consistently and professionally
- Mentoring and pairing with a junior engineer, providing coaching on troubleshooting, cloud operations, and safe change practices
- Owning and managing the operational backlog during Phase 1, identifying repeat toil, prioritising improvements, and driving fixes to completion
- Improving release readiness and coordination, working closely with engineering and release management practices where in place
- Contributing to platform foundations by ensuring operational patterns align with "golden path" standards and support future self-service capabilities
- Delivering targeted automation and standardisation to reduce ticket volume, including: CI/CD pipeline improvements, Repeatable operational workflows (e.g. service restarts, deployment checks, environment validation)
Requirements
-
3-5 years' experience in DevOps, SRE, Platform Engineering, or Production Operations roles
-
Strong hands-on AWS experience (or equivalent cloud provider), including:
- Compute/runtime environments (containers and/or serverless)
- Practical networking fundamentals (VPCs, security groups, routing)
- IAM and access management with least-privilege principles
- Logging, metrics, alerting, and incident triage
- Solid experience with Infrastructure as Code, ideally Terraform, and safe change/release practices
- Strong CI/CD experience, including designing or improving pipelines, build/deploy patterns, and quality gates
- Confident scripting and automation skills using Python and/or Bash, with the ability to build reliable small tools
- Excellent documentation and communication skills, including writing clear runbooks and post-incident reviews
- Comfortable working cross-functionally with engineering, operations/support, and customer-facing teams during incidents
Nice to have
- Experience operating container platforms such as ECS, EKS, or Kubernetes, and/or service mesh patterns
- Observability and reliability experience, including dashboards, SLO/SLA thinking, and improving alert quality
- Security- and compliance-aware delivery experience (e.g. secrets management, vulnerability management processes)
- Experience building internal developer tooling or self-service workflows
Benefits & conditions
Health Insurance - Provided through AXA, covering medical, dental, optical, mental health, and therapies. Employees also have free access to Spill, offering confidential mental health support and therapy.
Life Insurance - Covers four times your basic salary, along with Income Protection for up to 36 months at 75% of salary, including rehabilitation support.
Pension Scheme - A salary sacrifice pension scheme through Royal London. Send contributes 8%, with a minimum employee contribution of 4%.
Time Off - 25 days of annual leave, plus public holidays. We also offer volunteering time and a dedicated wellness day.
Enhanced Parental Leave - Includes 12 weeks of fully paid leave for all new parents, along with additional support for birth-giving parents.
Learning and Development - An annual budget via Learnerbly, providing access to books, courses, conferences, and other resources to support your growth.