Site Reliability Engineer
Robert Walters
Glasgow, United Kingdom
2 days ago
Role details
Contract type
Temporary contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Experience level
SeniorJob location
Glasgow, United Kingdom
Tech stack
Agile Methodologies
Artificial Intelligence
ARM
Relational Databases
Python
PostgreSQL
Microsoft SQL Server
MySQL
Networking Basics
Oracle Applications
Reliability Engineering
Power BI
Tableau
Scripting (Bash/Python/Go/Ruby)
Large Language Models
Snowflake
Data Layers
Data Management
Data Pipelines
Job description
- Ensure reliability, availability, and performance of large-scale data and analytics platforms across DEV, QA, and PROD.
- Apply SRE principles to design resilient services, define SLIs/SLOs, and drive continuous reliability improvements.
- Automate workflows and tooling (primarily Python) to reduce operational toil and improve repeatability.
- Build and manage CI/CD pipelines for data pipelines, cloud platforms, semantic models, and services.
- Support release and change management, ensuring safe deployments, validation, and rollback readiness.
- Serve as a senior escalation point for incidents, performing root cause analysis and preventive remediation.
- Design and maintain monitoring, alerting, and observability for platform components and data workloads.
- Operate and optimize cloud-based data platforms (eg, Snowflake) for stability, scalability, and cost efficiency.
- Support deployment and reliability of AI-enabled services, including monitoring, failure handling, and runbook creation.
- Collaborate with engineering and product teams to ensure AI/analytics features are production-ready and safely integrated.
- Contribute to operational documentation, runbooks, and platform operating model improvements.
Requirements
- 5+ years in SRE, production, or platform engineering with hands-on experience operating large-scale production systems.
- Strong Python automation and Scripting, CI/CD pipeline development, and modern software delivery practices.
- Experience with cloud data platforms (Snowflake preferred), relational databases (PostgreSQL, MySQL, Oracle, SQL Server), and troubleshooting across application, platform, and data layers.
- Knowledge of change and incident management in enterprise environments.
- Excellent communication and collaboration across teams.
Technical competencies
- Monitoring, alerting, observability, infrastructure-as-code, Unix/Linux systems, basic networking.
Desired/Nice-to-Have:
- Semantic data modelling, analytics platforms, and BI tools (Tableau, Power BI).
- Exposure to Snowflake Cortex or GenAI/LLM tools in production.
- Agile delivery experience (sprints, backlog refinement) and knowledge of ITSM processes.
- Experience supporting AI-enabled or analytics-driven platforms at scale in regulated enterprises.
About the company
Robert Walters is the world's most trusted talent solutions business. Across the globe, we deliver recruitment, outsourcing, and talent advisory services for businesses of all sizes, opening doors for people with diverse skills, ambitions, and backgrounds.
Who You Will Work With
Our client is global financial services firm that manages wealth, navigates complex markets, and design strategic financial objectives. The firm provides risk management solutions across a variety of sectors, emphasizing long-term relationships, and innovative approaches to financial challenges.