Site Reliability Engineer

Robert Walters
Glasgow, United Kingdom
2 days ago

Role details

Contract type
Temporary contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Glasgow, United Kingdom

Tech stack

Agile Methodologies
Artificial Intelligence
ARM
Relational Databases
Python
PostgreSQL
Microsoft SQL Server
MySQL
Networking Basics
Oracle Applications
Reliability Engineering
Power BI
Tableau
Scripting (Bash/Python/Go/Ruby)
Large Language Models
Snowflake
Data Layers
Data Management
Data Pipelines

Job description

  • Ensure reliability, availability, and performance of large-scale data and analytics platforms across DEV, QA, and PROD.
  • Apply SRE principles to design resilient services, define SLIs/SLOs, and drive continuous reliability improvements.
  • Automate workflows and tooling (primarily Python) to reduce operational toil and improve repeatability.
  • Build and manage CI/CD pipelines for data pipelines, cloud platforms, semantic models, and services.
  • Support release and change management, ensuring safe deployments, validation, and rollback readiness.
  • Serve as a senior escalation point for incidents, performing root cause analysis and preventive remediation.
  • Design and maintain monitoring, alerting, and observability for platform components and data workloads.
  • Operate and optimize cloud-based data platforms (eg, Snowflake) for stability, scalability, and cost efficiency.
  • Support deployment and reliability of AI-enabled services, including monitoring, failure handling, and runbook creation.
  • Collaborate with engineering and product teams to ensure AI/analytics features are production-ready and safely integrated.
  • Contribute to operational documentation, runbooks, and platform operating model improvements.

Requirements

  • 5+ years in SRE, production, or platform engineering with hands-on experience operating large-scale production systems.
  • Strong Python automation and Scripting, CI/CD pipeline development, and modern software delivery practices.
  • Experience with cloud data platforms (Snowflake preferred), relational databases (PostgreSQL, MySQL, Oracle, SQL Server), and troubleshooting across application, platform, and data layers.
  • Knowledge of change and incident management in enterprise environments.
  • Excellent communication and collaboration across teams.

Technical competencies

  • Monitoring, alerting, observability, infrastructure-as-code, Unix/Linux systems, basic networking.

Desired/Nice-to-Have:

  • Semantic data modelling, analytics platforms, and BI tools (Tableau, Power BI).
  • Exposure to Snowflake Cortex or GenAI/LLM tools in production.
  • Agile delivery experience (sprints, backlog refinement) and knowledge of ITSM processes.
  • Experience supporting AI-enabled or analytics-driven platforms at scale in regulated enterprises.

About the company

Robert Walters is the world's most trusted talent solutions business. Across the globe, we deliver recruitment, outsourcing, and talent advisory services for businesses of all sizes, opening doors for people with diverse skills, ambitions, and backgrounds. Who You Will Work With Our client is global financial services firm that manages wealth, navigates complex markets, and design strategic financial objectives. The firm provides risk management solutions across a variety of sectors, emphasizing long-term relationships, and innovative approaches to financial challenges.

Apply for this position