About the role
AI summarisedThe Observability Engineer (AVP/Manager) at a leading Singapore bank is responsible for designing and implementing end-to-end observability solutions for infrastructure, applications, and microservices. The role involves developing dashboards, automating monitoring processes, and collaborating with stakeholders to ensure system health and performance. Candidates need 5+ years of experience with tools like Broadcom DX UIM, Cisco AppDynamics, and Grafana, along with strong scripting and communication skills.
BusinessFull-timeGeneral
Key Responsibilities
- Design and implement end-to-end observability solutions for infrastructure, applications, and microservices.
- Develop and maintain dashboards for real-time visibility into system health and performance.
- Collaborate with stakeholders to understand monitoring requirements and provide clear, actionable insights.
- Assess and track observability maturity across the organization.
- Implement and manage solutions for metrics collection and alerting.
- Automate monitoring processes using scripting languages (Shell, Python, PowerShell).
- Integrate and optimize monitoring tools for performance and scalability.
- Troubleshoot and resolve observability-related issues across environments.
Requirements
- Bachelor's degree in computer science, Information Technology, or related field.
- 5+ years of hands-on experience in managing observability solutions (Design, Build, Run & Maintain).
- Experience with enterprise logging platforms (e.g., Elasticsearch, Splunk).
- Proficiency in automation scripting (Ansible, Terraform, PowerShell, etc.).
- Strong understanding of both application and infrastructure domains.
- Experience with container orchestration platforms like OpenShift or Kubernetes.
- Solid knowledge of hardware, operating systems, and system services performance tuning.
- Excellent analytical and problem-solving skills with a creative mindset.
- Strong communication skills in written and spoken English.
- Proven ability to work collaboratively across teams and influence stakeholders.
- Ability to perform under pressure and meet tight deadlines with minimal supervision.
- Deep understanding of Infrastructure Monitoring (Host Systems, Networks and Open systems), Application Performance Monitoring (APM) of monolithic and containerized applications, Microservices Monitoring, Dashboard creation and visualization.
- Proficiency in scripting: Shell, Python, PowerShell.
- Preferred Experience: Infrastructure Monitoring: Broadcom DX UIM; APM: Cisco AppDynamics, Elastic APM; NPM: Solarwinds; Dashboards: Grafana; Experience in financial institutions or regulated environments; Knowledge of Elastic Stack (Elasticsearch, Logstash, Kibana) is a plus.