Senior Site Reliability Engineer - Infrastructure
Role details
Job location
Tech stack
Job description
We are seeking a Senior Reliability Engineer to join the Platform Engineering Domain in the Scalability Team.
The mission of Platform Engineering is to provide trusted performant self-service platforms that empower product teams to build the bank the world loves to use. Scalability is part of this mission to develop solutions for container orchestration.
As one of the first banks completely hosted in the cloud - our security resilience and productivity standards require not only the use of a modern technology stack but the building of teams in line with our principles and in the service of our product teams, the company and our customers. In this role you will :
Design, develop, implement and own products and solutions to improve the security, reliability and scalability of N26 cloud infrastructure and systems in the workload orchestration domain.
Work with product customers and stakeholders to understand their needs and build the right products and solutions.
Take an active part in the strategy and roadmap definition and prioritisation.
Mentor and support other team members and also learn from them.
Requirements
Hands-on production expertise in the design, implementation and maintenance of Kubernetes clusters.
Extensive knowledge and hands-on experience in AWS Cloud infrastructure and services.
Solid experience in Go.
Previous experience with Linux, Terraform and CI/CD solutions (Argo or similar).
Networking experience with mesh solutions (Istio or similar).
Monitoring, troubleshooting and incident management experience. Nice to have :
Relevant backend development experience. Traits :
Good communication skills and a sense of ownership with a systematic problem-solving approach.
Proactiveness, collaboration and eagerness to learn. Whats in it for you, A high degree of autonomy and access to cutting edge technologies - all while working with a friendly team of peers of diverse nationalities, life experiences and family statuses., * Kubernetes
- FMEA
- Continuous Improvement
- Elasticsearch
- Go
- Root cause Analysis
- Maximo
- CMMS
- Maintenance
- Mechanical Engineering
- Manufacturing
- Troubleshooting