Apple

Senior Site Reliability Engineer - Apple Information Security

Apple
TechnologySingaporeOnsitePosted 6 months ago

About the role

AI summarised

This Senior Site Reliability Engineer role at Apple Information Security focuses on building autonomous platforms to operate core information security systems at hyper-scale. The engineer will design and implement software-driven automation to ensure scalability, availability, and security of critical infrastructure, collaborating closely with Security Engineering, Threat Detection, and Incident Response teams. The role emphasizes pushing the boundaries of reliability engineering through advanced tools, automation, and proactive system management.

TechnologyOnsiteSoftware and Services

Key Responsibilities

  • Design and implement software that solves operational challenges for hyper-scale information security systems
  • Develop automation that anticipates, reacts to, and stays ahead of system scale
  • Work closely with Security Engineering, Threat Detection, Incident Response and other internal functions
  • Ensure scalability, availability and security of tools and infrastructure supporting Apple’s cybersecurity mission
  • Collaborate with partner teams to deliver optimal engineering solutions efficiently
  • Break new ground using advanced tools and approaches in reliability engineering
  • Build self-managing systems for core information security platforms
  • Contribute to a highly collaborative team culture that values good ideas and rewards innovation

Requirements

  • Proven experience in Site Reliability Engineering or a related field
  • Strong programming skills in Python, Go or Swift
  • Experience working with cloud compute environments like AWS, GCP or Azure
  • Experience with infrastructure as code (IaC), configuration management, CI/CD, and automation (e.g., Terraform, Pulumi, CloudFormation, Ansible, Chef, Puppet, Jenkins)
  • Cloud deployment and CI/CD problem diagnosis and troubleshooting
  • Experience or experimentation building systems that leverage Agentic AI principles, tools, platforms and frameworks
  • Strong understanding and experience in implementing monitoring and observability tools like Splunk, Grafana, Prometheus
  • Building and operating container orchestrating systems (Docker, Kubernetes, Vagrant and micro-services)
  • Experience administering and troubleshooting Linux systems including the usage of standard Linux utilities
  • Experience in shell scripting (e.g., bash/zsh) and system administration
  • Experience with measuring, analyzing, and optimizing system performance
  • Passion for high-quality code, tests, documentation and production services
  • Participation in an on-call rotation
  • Bachelor’s degree in Computer Science, or a related field, or equivalent practical experience