About the role
AI summarisedThe SRE Manager will lead engineering teams in designing and scaling high-performance cloud and hybrid infrastructure for the JMET SRE Team. This hands-on leadership role involves architectural ownership of reliability patterns, automation frameworks, and security across distributed systems.
TechnologyOnsite
Key Responsibilities
- Lead SRE teams in designing and scaling reliable, secure, and high-performance infrastructure across cloud and hybrid environments.
- Establish reliability patterns and drive large-scale systems design.
- Build automation frameworks to support production systems at scale.
- Provide architectural ownership and strategic influence across multiple domains including application and infrastructure security.
- Manage incident response engineering and resilience automation.
Requirements
- 10 or more years of experience in SRE, DevOps, or Infrastructure Engineering roles.
- 2 or more years of experience in a managerial capacity.
- Deep expertise in cloud infrastructure such as AWS, GCP, or AliCloud.
- Experience with container orchestration including Kubernetes and EKS.
- Proven experience with Infrastructure as Code tools such as Terraform and CloudFormation.
- Strong understanding of distributed systems, networking, and systems design at scale.
- Proficiency in at least one programming or scripting language, such as Python, Go, or Bash.
- Solid background in CI/CD tools and modern deployment strategies such as Spinnaker and GitOps.
- Familiarity with security best practices in cloud and containerized environments.
- Experience with HSMs and cryptographic operations at scale is preferred.