Thales

Data Architect

Thales
Aerospace & DefenseSingaporeOnsitePosted 4 weeks ago

About the role

AI summarised

We are seeking a seasoned Data Architect to lead the design, adaptation, and implementation of a robust Data Warehouse platform. This role is critical in architecting scalable, unified data solutions that seamlessly combine the strengths of data lakes and warehouses to enable advanced analytics, AI/ML applications, and secure enterprise data access.

Aerospace & DefenseOnsite

Key Responsibilities

  • Own the Architecture of the Data Warehouse, ensuring architecture diagrams and technologies remain current.
  • Define data modeling standards for raw, curated, and serving layers.
  • Develop strategies for data ingestion, storage, cataloging, governance, and secured data access (data-at-rest, data-in-transit).
  • Lead the design and implementation of ingestion pipelines for structured and unstructured data.
  • Oversee deployment pipelines using CI/CD best practices for data-oriented infrastructure.
  • Ensure high availability, scalability, and performance of the technical implementation.
  • Implement robust data security frameworks including encryption, data masking, and fine-grained access controls.
  • Define and enforce data governance policies covering data lineage, quality, and regulatory compliance.

Requirements

  • Bachelor's in Computer Science or Information Technology (Master's preferred).
  • Strong experience designing highly available data platforms using MinIO as the data storage infrastructure.
  • Expertise in SQL, data modeling (SCD Type 1, 2, 6), ETL/ELT design, and query performance tuning.
  • Expertise in containerization and orchestration (Docker, Kaniko, Kubernetes).
  • Strong expertise in designing ETL & ELT data pipelines for structured or unstructured data, including Change Data Capture.
  • Expertise in distributed data processing technologies (Apache Spark 3.0, Apache Flink 2.0, Apache Iceberg, Trino, Apache Kafka).
  • Hands-on experience with scalable data storage platforms (e.g., Azure Data Lake Storage, MinIO S3).
  • Deep understanding of data security, encryption, IAM, and compliance standards.
  • Proficiency in programming languages Java 8+ or Kotlin 2.x.
  • Proficiency with Git-based protocols (Gitlab, Gitea) for Continuous Integration.
  • Proficiency with Linux command line commands.