About the role

AI summarised

This Senior Site Reliability Engineer role at Apple Information Security focuses on building autonomous platforms to operate core information security systems at hyper-scale. The engineer will design and implement software-driven automation to ensure scalability, availability, and security of critical infrastructure, collaborating closely with Security Engineering, Threat Detection, and Incident Response teams. The role emphasizes pushing the boundaries of reliability engineering through advanced tools, automation, and proactive system management.

TechnologyOnsiteSoftware and Services

Key Responsibilities

Design and implement software that solves operational challenges for hyper-scale information security systems
Develop automation that anticipates, reacts to, and stays ahead of system scale
Work closely with Security Engineering, Threat Detection, Incident Response and other internal functions
Ensure scalability, availability and security of tools and infrastructure supporting Apple’s cybersecurity mission
Collaborate with partner teams to deliver optimal engineering solutions efficiently
Break new ground using advanced tools and approaches in reliability engineering
Build self-managing systems for core information security platforms
Contribute to a highly collaborative team culture that values good ideas and rewards innovation

Requirements

Proven experience in Site Reliability Engineering or a related field
Strong programming skills in Python, Go or Swift
Experience working with cloud compute environments like AWS, GCP or Azure
Experience with infrastructure as code (IaC), configuration management, CI/CD, and automation (e.g., Terraform, Pulumi, CloudFormation, Ansible, Chef, Puppet, Jenkins)
Cloud deployment and CI/CD problem diagnosis and troubleshooting
Experience or experimentation building systems that leverage Agentic AI principles, tools, platforms and frameworks
Strong understanding and experience in implementing monitoring and observability tools like Splunk, Grafana, Prometheus
Building and operating container orchestrating systems (Docker, Kubernetes, Vagrant and micro-services)
Experience administering and troubleshooting Linux systems including the usage of standard Linux utilities
Experience in shell scripting (e.g., bash/zsh) and system administration
Experience with measuring, analyzing, and optimizing system performance
Passion for high-quality code, tests, documentation and production services
Participation in an on-call rotation
Bachelor’s degree in Computer Science, or a related field, or equivalent practical experience

Senior Site Reliability Engineer - Apple Information Security

About the role

Key Responsibilities

Requirements