OCBC

SRE, Automation Engineer (Private/Public Cloud, BigFix)

OCBC
BusinessOCBC SingaporeFull-time1 months ago

About the role

AI summarised

This is a Site Reliability Engineering (SRE) and Automation Engineer role focused on managing private and public cloud infrastructure along with BigFix automation. The engineer will be responsible for ensuring reliability, scalability, and automation of cloud environments, implementing monitoring and incident response, and managing configuration management tools like BigFix and Ansible.

BusinessFull-timeGeneral

Key Responsibilities

  • Design, implement, and maintain automation solutions for private and public cloud environments using tools like Ansible, Terraform, and BigFix.
  • Manage and optimize cloud infrastructure on AWS, Azure, and GCP to ensure high availability and performance.
  • Develop and maintain CI/CD pipelines using Jenkins and Git for automated deployments.
  • Implement monitoring and alerting solutions using Prometheus, Grafana, ELK, and Splunk.
  • Respond to incidents, perform root cause analysis, and implement preventive measures.
  • Collaborate with development teams to improve system reliability and scalability.
  • Automate routine operational tasks using Python, Bash, and PowerShell scripts.
  • Manage container orchestration platforms like Kubernetes and Docker.
  • Ensure security best practices are applied across cloud and on-premise infrastructure.
  • Document system configurations, procedures, and runbooks.

Requirements

  • 3-7 years of experience in Site Reliability Engineering, DevOps, or Cloud Infrastructure roles.
  • Strong experience with configuration management tools such as BigFix and Ansible.
  • Proficiency in scripting languages: Python, Bash, and PowerShell.
  • Hands-on experience with cloud platforms: AWS, Azure, or GCP.
  • Experience with containerization and orchestration: Docker and Kubernetes.
  • Solid understanding of CI/CD pipelines and tools like Jenkins and Git.
  • Experience with monitoring and logging tools: Prometheus, Grafana, ELK, Splunk.
  • Knowledge of networking concepts and security best practices.
  • Excellent problem-solving and troubleshooting skills.
  • Strong communication and collaboration skills.
  • Bachelor's degree in Computer Science, Information Technology, or related field.
  • Preferred certifications: AWS Certified DevOps Engineer, Azure DevOps Engineer Expert, or Certified Kubernetes Administrator.