About the role
AI summarisedThe SRE Manager will lead SRE teams, design and scale reliable, secure, and high-performance infrastructure across cloud and hybrid environments. This hands-on leadership role involves establishing reliability patterns, driving large-scale systems design, and building automation frameworks for production systems.
TechnologyFull-time
Key Responsibilities
- Leading SRE teams
- Designing and scaling reliable, secure, and high-performance infrastructure across cloud and hybrid environments
- Establishing reliability patterns
- Driving large-scale systems design
- Building automation frameworks to support production systems at scale
Requirements
- 10 or more years of experience in SRE, DevOps, or Infrastructure Engineering roles
- 2 or more years in a managerial capacity
- Deep expertise in cloud infrastructure (AWS, GCP, or AliCloud)
- Deep expertise in container orchestration (Kubernetes, EKS)
- Proven experience with Infrastructure as Code tools such as Terraform and CloudFormation
- Strong understanding of distributed systems, networking, and systems design at scale
- Proficiency in at least one programming or scripting language, such as Python, Go, or Bash
- Solid background in CI/CD tools and modern deployment strategies, for example Spinnaker and GitOps
- Familiarity with security best practices in cloud and containerized environments
- Experience with HSMs and cryptographic operations at scale is a plus