DBS Bank

SVP/VP, Site Reliability Engineering Domain Lead, SRE & Governance, Group Technology

DBS Bank
BusinessSingapore - CentralFull-time1 weeks ago

About the role

AI summarised

The role is for a senior SRE Domain Lead at a bank, responsible for leading SRE practices, governance, and reliability engineering across the organization.

BusinessFull-timeGeneral

Key Responsibilities

  • Business Function Group Technology enables and empowers the bank with an efficient, nimble and resilient infrastructure through a strategic focus on productivity, quality & control, technology, people capability and innovation. In Group Technology, we manage the majority of the Bank's operational processes and inspire to delight our business partners through our multiple banking delivery channels. Roles & Responsibilities - Manage a large team of Production Support Personnel across 3 geographical locations - Ensure SLAs on Alerts and Incidents are proactively managed and reduce in Mean Time To Recover (MTTR) by 20% - Ensure strict adherence to Standard Operating Procedure for recovery - Deliver a playbook for onboarding on new tasks / activities to Production support - Identify opportunities to automate Production support activities and reduction in manual acti - Application improvements ranging from performance and operational improvements, identification and remediation of system and automate Toils. - Automation of manual activities/ processes and System Health checks for Production teams. (Automation experience required) and ensuring SLIs/ SLOs are met. - Follow Production Support Processes and giving input to strengthen time to time - Providing status to leads, stakeholders and working with vendors to review the design/fix/enabling for production deployment - Coordinate recurring issues and ensure long-term resolution through proper Incident and Problem Management - Working with various teams like Infrastructure, development team to resolve, analysis of root cause for complex issues and outages - Strong stakeholder management skills with focus on continuous service improvement, consistent delivery and stability of production. - Drives Root Cause Analysis with technology partners, post incident resolution and facilitates RCA reviews. - Work with Risk team to respond timely to Audit & Risk RFIs. Manage Audit walkthroughs Requirements - 10 - 12 years of strong experience in the Banking industry with minimum 5+ years in Run-the-Bank (RTB) lead role with a proven track record of working in Banking environment - Implement Site Reliability Engineering principles with regards to performance, reliability, monitoring, alerting and maintenance in Production environment. Pro-active Capacity monitoring & Observability of production Infrastructure, automated alerting, performance monitoring and reporting tools - Automation of manual tasks in a Production Support - Build and maintain Production monitoring and automation solutions - Build and implement Service improvements. Identify, measure and report performance trends – SLIs/ SLOs/ SLAs periodically and improve systems performance and associated performance KPIs - Sound understanding of RDBMS / Unix / Cloud/ Large banking applications - Strong team player, effective at communicating internationally and used to working closely with remote teams - Good knowledge of infrastructure technologies used, with focus on AIX/Oracle/Java/ Openshift - Solid understanding of BAU support, incident, problem management processes as well as escalation management across a diversified environment - Understanding of Risk Management, Disaster Recovery, Business Continuity, IT Security Architecture, and IT Regulatory Compliance. - Present facts and recommendations effectively in oral and written form - Pro-active, independent, resourceful, and able to work in a team -en Location: DBS Asia Central Job: Technology Schedule: Regular Employee Status: Full time

Requirements

Requirements were not listed in the extracted data for this post.