About the role
AI summarisedJoin our team to operate and maintain critical, multi-cloud infrastructure services across AWS, Azure, and GCP. This role involves hands-on operational support for production environments, focusing on infrastructure lifecycle management, security compliance, application deployment, and adherence to ITIL processes within a 24/7 rotational support model.
ElectronicsOnsiteInformation Technology
Key Responsibilities
- Operate and maintain cloud-native services in production across AWS, Microsoft Azure, and Google Cloud Platform.
- Monitor and troubleshoot infrastructure performance, uptime, and scalability across all multi-cloud platforms.
- Support production and staging environments with 24/7 reliability objectives, participating in shift rotations.
- Maintain infrastructure deployment pipelines using IaC tools like Terraform or Ansible, and troubleshoot environment drift.
- Lead OS patching operations for RHEL (v8-v10) and Windows Server (2016-2025) using specialized management tools.
- Execute security remediations based on CIS Benchmarks and government security baselines across cloud platforms.
- Manage and resolve ITSM tickets via ServiceNow or Jira, driving escalation when necessary.
- Deploy and troubleshoot applications across Windows and Linux operating systems, providing OS-level diagnostics.
- Implement and maintain application monitoring and alerting frameworks for proactive issue resolution.
Requirements
- Hands-on experience with cloud services including AWS, Azure, and GCP.
- Proven ability to support production environments requiring 24/7 operational readiness.
- Working knowledge of Infrastructure as Code (IaC) using Terraform, Ansible, or ARM templates.
- Experience leading OS patching cycles for enterprise environments (RHEL/Windows Server).
- Strong adherence to ITIL processes including Incident, Problem, and Change Management.
- Ability to perform security hardening based on industry benchmarks (e.g., CIS).
- Familiarity with container technologies like Docker and Kubernetes is required.
- Experience managing service level agreements (SLAs) and operational level agreements (OLAs).