DBS Bank

ED/SVP, Site Reliability Engineering Operation Lead, SRE & Governance, Group Technology

DBS Bank
BusinessSingapore - CentralFull-time1 weeks ago

About the role

AI summarised

This is a senior leadership role (ED/SVP) in Site Reliability Engineering at a major bank. The position oversees SRE operations, governance, and strategy, ensuring reliability, scalability, and compliance of technology systems. The role involves managing teams, driving automation, and collaborating with stakeholders to maintain high service availability.

BusinessFull-timeGeneral

Key Responsibilities

  • Lead and manage the SRE operations team to ensure high availability and reliability of critical banking systems.
  • Define and implement SRE governance frameworks, policies, and best practices across the organization.
  • Drive automation initiatives to improve operational efficiency and reduce manual toil.
  • Oversee incident management, root cause analysis, and post-mortem processes to prevent recurrence.
  • Establish and monitor service level objectives (SLOs) and service level indicators (SLIs) for all critical services.
  • Collaborate with development and infrastructure teams to design resilient and scalable systems.
  • Manage capacity planning and performance tuning to meet business growth demands.
  • Ensure compliance with regulatory requirements and internal risk management standards.
  • Provide strategic direction for SRE tooling, monitoring, and observability platforms.
  • Mentor and develop team members, fostering a culture of reliability and continuous improvement.

Requirements

  • Minimum 10 years of experience in Site Reliability Engineering, DevOps, or related fields.
  • Proven experience in leading SRE or infrastructure operations teams in a large-scale environment.
  • Deep understanding of SRE principles, including SLIs, SLOs, error budgets, and toil reduction.
  • Strong knowledge of cloud platforms (AWS, GCP, or Azure) and containerization technologies (Kubernetes, Docker).
  • Experience with automation tools (e.g., Terraform, Ansible, Jenkins) and CI/CD pipelines.
  • Expertise in monitoring and observability tools (e.g., Prometheus, Grafana, Datadog, Splunk).
  • Solid understanding of networking, security, and database concepts.
  • Excellent leadership, communication, and stakeholder management skills.
  • Ability to drive cultural change and promote reliability best practices across teams.
  • Bachelor's degree in Computer Science, Engineering, or a related field.
  • Relevant certifications (e.g., AWS Certified DevOps Engineer, Google Professional Cloud DevOps Engineer) are preferred.
  • Experience in the banking or financial services industry is a plus.