Apple

Site Reliability Engineer

Apple
TechnologySingaporeOnsitePosted 12 months ago

About the role

AI summarised

The Site Reliability Engineer at Apple will be responsible for ensuring the reliability, performance, and availability of critical software infrastructure supporting iOS, macOS, and manufacturing systems. This role involves deploying, monitoring, troubleshooting, and developing tools for multi-tiered systems, working closely with cross-functional teams including systems, network, database, and security engineers. The position requires a blend of strategic engineering and hands-on technical work in DevOps, automation, and system administration to maintain high availability and application security across Apple's product lifecycle.

TechnologyOnsiteCorporate Functions

Key Responsibilities

  • Deploying, monitoring, troubleshooting and developing tools for all team's solutions
  • Configure, tune, and tackle multi-tiered systems to achieve optimal application performance, stability and availability
  • Review hardware, software infrastructure and application functionality for optimization
  • Responsible for the full system lifecycle including configuration, code deployment in user acceptance test and production environments
  • Monitor infrastructure and application services and drive incident management
  • Collaborate with Apple's production support team, application engineers, project managers, systems engineers, network engineers, database administrators and QA team to effectively ensure availability and reliability of various solutions

Requirements

  • Unix or Linux administration and performance tuning skills
  • 0 ~ 5 years of leading services in a large scale *nix environment
  • Java and JVM technologies runtime configurations and troubleshooting
  • Proficient in Python/Go/other scripting language
  • Experience with DevOps tools, processes, and culture
  • Validated experience with Automation skills using Ansible, Chef, Jenkins, Puppet
  • Oracle DB knowledge and troubleshooting skills
  • Infrastructure knowledge of Networks, load balancers, Firewalls and WAF
  • SDLC and release engineering including source code repository and build tools including SVN and GIT
  • Network, System and Application Security knowledge
  • Application design, development, API programming and improvement using Java, Javascript, HTML, CSS, spring, hibernate, object oriented analysis and design
  • Experience with Kafka or other message queueing technology