About the role
AI summarisedThe Senior HPC Engineer is responsible for optimizing and evaluating high-performance computing systems across CPU, memory, storage, and networking layers. This role supports new product introduction and ongoing instrument operations by diagnosing performance bottlenecks, validating hardware and software workflows, and developing test procedures and work instructions. The engineer collaborates with cross-functional teams in Singapore and the US to drive product enhancements, ensure operational excellence, and maintain high-performance standards throughout the product lifecycle.
BiotechOnsite
Key Responsibilities
- Engage early with the development team to understand HPC system level design and interactions with different functional components such as EE, Software/Firmware, Operating System, FPGA
- Diagnose HPC low-level performance bottlenecks across CPU, RAM, storage, networking and peripheral drivers
- Collaborate with software, hardware, and systems engineering teams to qualify new software, hardware and operating workflow for the HPC
- Evaluate hardware components including CPUs, memory, storage, networking, and peripheral devices for platform continuity and develop sustainable solutions
- Established work instruction of HPC integration workflow, test procedure and diagnostic workflows for engineering and manufacturing use
- Conduct comprehensive root-cause investigations for complex production issues using structured problem-solving methodologies (FMEA, DOE, and statistical analysis)
- Complete Non-conformance Records based on investigation outcomes and drive the release of instruments back into production
- Plan and execute engineering studies to support experiments and data collection efforts
- Participate in continuous improvement initiatives and operational excellence programs
- Write and update work instructions as needed to improve test procedures
Requirements
- Typically requires a minimum of 5 years of related experience with a Bachelor’s degree in Computing; or 3 years and a Master’s degree; or a PhD without experience; or equivalent work experience
- Experience in Linux High Performance Computing systems (RHEL/Oracle/CentOS) in enterprise environments
- Experience in analysing HPC workload and scheduling system eg. Slurm, RunAI, Profiling etc.
- Experience in troubleshooting low-level performance bottlenecks across compute, storage, and networking layers
- Deep understanding of compute system tuning for memory, CPU, and network performance
- Good understanding of CPU architecture across Intel, AMD, and ARM with multi-threading operating system
- Knowledge of TCP, UDP, RDMA, and server/network interface
- Knowledge of modern memory technologies such as DDR4/DDR5, DIMM, LPDDR
- Experience with scripting languages, such as Python, Perl, or Bash to extract debug information from embedded systems and high-performing compute system
- Excellent team player, interpersonal and communication skills
- Willing to travel to US for extended periods to learn the technology and collaborate with US design teams
- Solid working knowledge of GDP, ISO & cGMP practices
- Experience with FDA regulated medical device product development is preferred
- Experience in life science–related projects or internships is advantageous; exposure to medical device environments is preferred