Site Reliability Engineer
Senior Site Reliability Engineer (SRE) – Azure & Terraform | London
Location: London, UK (Hybrid – 3 Days Onsite per Week)
Contract: Initial 6 Months
About the Rol
eWe are looking for an experienced Senior Site Reliability Engineer (SRE) with strong expertise in Azure Cloud, Terraform, and Reliability Engineering practices to join a large-scale enterprise transformation programme
.
This is a senior technical leadership position requiring extensive experience in designing, building, and maintaining highly available, scalable, and resilient cloud platforms while driving SRE best practices across engineering team
s.
Key Responsibilit
- iesLead Site Reliability Engineering initiatives across enterprise applications and platfor
- ms.Design and implement scalable, resilient, and highly available cloud solutions on Microsoft Azu
- re.Develop and maintain Infrastructure as Code (IaC) using Terrafo
- rm.Define and manage SLI, SLO, SLA, and Error Budget framewor
- ks.Drive platform reliability, operational excellence, and continuous improvement initiativ
- es.Lead technical design reviews, architecture discussions, and engineering governance activiti
- es.Partner with Product Owners, Scrum Teams, Architects, and Stakeholders to deliver high-quality solutio
- ns.Improve software delivery processes, automation, observability, and operational efficien
- cy.Support incident management, root cause analysis, capacity planning, and performance optimizati
- on.Mentor engineering teams and promote SRE best practices across the organizati
on.
Required Skills & Experi
- ence15+ years of overall IT experie
- nce.Minimum 5 years of dedicated Site Reliability Engineering (SRE) experie
- nce.Strong hands-on experience with Microsoft Az
- ure.Strong Terraform experience in enterprise-scale environme
- nts.Excellent understanding
- of:SRE Princi
- plesSLI / SLO /
- SLAError Bud
- getsObservabi
- lityIncident Manage
- mentProduction Reliabi
- lityCapacity Plan
- ningExperience leading technical teams and architecture discussi
- ons.Strong stakeholder management and customer-facing communication ski
lls.
Nice to
- HaveKuber
- netesD
- ockerAzure D
- evOpsProme
- theusGr
- afanaS
- plunkDyna
- traceAzure Mo
- nitorApplication Ins
- ightsPython / PowerShell Scri