Thales

Site Reliability Engineer

Thales
Aerospace & DefenseSingaporeFull-time1 months ago

About the role

AI summarised

Site Reliability Engineer at Thales, working in the Digital Identity & Security business line on On-Demand Connectivity products. Responsible for deploying, operating, and maintaining cloud-based platforms in GCP using SRE practices, with a focus on automation, monitoring, and 24/7 on-call support.

Aerospace & DefenseFull-timeGeneral

Key Responsibilities

  • Work in a DevOps team to deploy, operate, maintain and improve ODC products in GCP Cloud, following the SRE approach.
  • Responsible for deployment of Thales products in cloud.
  • Perform on-boarding test, communicate technical risk concerns and help prepare mitigation plans.
  • Responsible for System monitoring with real-time monitoring tools.
  • Extend and acknowledge completion of handover milestones to Tiers I, II to comply with contractual SLAs.
  • Responsible for support operations tasks to shape the product roadmap and establish strong operational readiness across teams.
  • Provide technical guidance for new or evolution of services and for consolidated technical analyses.
  • Participate at the preparation and review of technical product & customer specific documentation.
  • Provide technical direction when CAB requires Tier II input, expertise or changes with high-risk impacts on customer SLAs.
  • Ensure the integrity of the solution functional baseline and architecture.
  • Develop and maintain IAC code and automation tools.
  • Perform regular performance tuning, technological watch and updates on service platform.

Requirements

  • Degree in Computer Science or any related discipline.
  • 4+ years of experience in relevant field.
  • Hands-on in deployment with Kubernetes and GCP (preferred)/ AWS/ Azure administration and support in production grade environment.
  • Hands-on experience in Continuous Integration and Continuous Delivery (CI/CD) tools like Gitlab, Terraform, Ansible, Helm, Hashicorp (any).
  • Strong knowledge of System Integration, Operation, Maintenance and proven experience with automation tools including Gitlab.
  • Strong working experience on one of the scripting language – SHELL/Python is required.
  • Knowledge of Agile methodology and Service Delivery best practices.
  • Knowledge on Cloud service provider i.e. GCP/AWS/Azure, monitoring tools, networking, infrastructure and Linux.
  • Experience in Telecom domain will be highly preferred.