About the role
AI summarisedSenior Linux Platform Engineer at AMD, responsible for installing, configuring, and supporting Linux OS and engineering infrastructure across on-prem, lab, datacenter, and hybrid cloud environments. The role involves OS lifecycle management, platform reliability, infrastructure automation using Python and Ansible, and global team collaboration. Requires 8+ years of Unix/Linux system administration experience and strong programming skills.
FablessFull-time
Key Responsibilities
- Install/Configure/Support Linux OS and engineering infrastructure across on-prem, lab/datacenter, and hybrid cloud environments.
- OS lifecycle management: Image creation, patching, upgrades, hardening, baseline configuration, and standardization
- Maintain platform reliability through disciplined operation: Incident Response, Root Cause Analysis, and continuous improvement.
- Design and implement infrastructure automation using Python and modern automation frameworks (e.g. Ansible) to support Infrastructure as Code and self-service capabilities
- Work with global team to provide support and complete IT projects.
- Assist with hardware / software installation and troubleshooting in lab / datacenter spaces (rack/stack coordination, validation, break-fix as required)
Requirements
- 8+ years of experience of Unix/Linux systems administration in physical, virtual / cloud environment.
- Demonstrate passion to learn, ownership and getting things done.
- Strong programming experience to deploy automation using Python & Ansible for Infrastructure as Code & self-service capabilities.
- Working knowledge of HPC environments including cloud providers like Azure, GCP, AWS
- Basic understanding of network administration: TCP/IP, DNS, routing, firewall and load balancing
- Experience with VDI technologies like Citrix / ETX
- Experience with Virtualization such as: KVM / Xen / VMware / Kubernetes
- Experience with Server hardware and deployment in Datacenter
- Experience with NAS (NFS & CIFS) storage like NetApp, Isilon & Nasuni
- Experience with LSF (Load Sharing Facility)
- Experience working within secured, firewall-controlled enterprise network
- Large cluster management tools such as: Puppet, CFengine
- Experience with monitoring tools such as Nagios, ELK stack, Kibana/Prometheus
- Bachelor's Degree in Science or Engineering