OCBC

Observability Engineer, AVP/ Manager

OCBC
BusinessOCBC SingaporeFull-time1 months ago

About the role

AI summarised

The Observability Engineer (AVP/Manager) at a leading Singapore bank is responsible for designing and implementing end-to-end observability solutions for infrastructure, applications, and microservices. The role involves developing dashboards, automating monitoring processes, and collaborating with stakeholders to ensure system health and performance. Candidates need 5+ years of experience with tools like Broadcom DX UIM, Cisco AppDynamics, and Grafana, along with strong scripting and communication skills.

BusinessFull-timeGeneral

Key Responsibilities

  • Design and implement end-to-end observability solutions for infrastructure, applications, and microservices.
  • Develop and maintain dashboards for real-time visibility into system health and performance.
  • Collaborate with stakeholders to understand monitoring requirements and provide clear, actionable insights.
  • Assess and track observability maturity across the organization.
  • Implement and manage solutions for metrics collection and alerting.
  • Automate monitoring processes using scripting languages (Shell, Python, PowerShell).
  • Integrate and optimize monitoring tools for performance and scalability.
  • Troubleshoot and resolve observability-related issues across environments.

Requirements

  • Bachelor's degree in computer science, Information Technology, or related field.
  • 5+ years of hands-on experience in managing observability solutions (Design, Build, Run & Maintain).
  • Experience with enterprise logging platforms (e.g., Elasticsearch, Splunk).
  • Proficiency in automation scripting (Ansible, Terraform, PowerShell, etc.).
  • Strong understanding of both application and infrastructure domains.
  • Experience with container orchestration platforms like OpenShift or Kubernetes.
  • Solid knowledge of hardware, operating systems, and system services performance tuning.
  • Excellent analytical and problem-solving skills with a creative mindset.
  • Strong communication skills in written and spoken English.
  • Proven ability to work collaboratively across teams and influence stakeholders.
  • Ability to perform under pressure and meet tight deadlines with minimal supervision.
  • Deep understanding of Infrastructure Monitoring (Host Systems, Networks and Open systems), Application Performance Monitoring (APM) of monolithic and containerized applications, Microservices Monitoring, Dashboard creation and visualization.
  • Proficiency in scripting: Shell, Python, PowerShell.
  • Preferred Experience: Infrastructure Monitoring: Broadcom DX UIM; APM: Cisco AppDynamics, Elastic APM; NPM: Solarwinds; Dashboards: Grafana; Experience in financial institutions or regulated environments; Knowledge of Elastic Stack (Elasticsearch, Logstash, Kibana) is a plus.