About the role
AI summarisedSenior Systems Engineer responsible for ensuring 24/7 server availability for a global pathogen data sharing platform at A*STAR BII. The role involves managing virtual machines, network configuration, security, and collaborating with scientific teams to support continuous global access to genomic data.
ResearchFull-timeBioinformatics Institute
Key Responsibilities
- Server management (Apache, Nginx, etc), network configuration, implementing security measures (firewalls, SSL), performance monitoring, and DNS management.
- Set up, configure, and deploy virtual machines in response to varying data submission and processing loads.
- Manage relationships with network providers and ensure service-level agreements (SLAs) are met.
- Troubleshoot systems/network issues, identify root causes, and implement corrective actions to restore services promptly.
- Conduct regular performance assessments and capacity planning to anticipate infrastructure scaling needs.
- Support disaster recovery and failover mechanisms to ensure uninterrupted platform operation.
- Collaborate with compute and application teams for systems integration.
- Maintain infrastructure documentation, performance logs, and provider agreements.
- Stay up-to-date with best practices in networking, virtualization, and infrastructure automation.
Requirements
- Bachelor degree in Information Technology, Computer and Systems Engineering, or a related field.
- Possess web hosting experience in managing, maintaining, and troubleshooting server infrastructure to ensure website uptime, security, and performance.
- In-depth knowledge of Linux/Unix environments and system administration.
- Understanding of cybersecurity principles and network security implementation.
- Experience with virtualization platforms (e.g. Proxmox VE, VMware, KVM, Hyper-V, or cloud equivalents).
- Knowledge of shell scripting, common programming languages like Perl, Python, Java, C/C++.
- Strong troubleshooting and problem-solving skills, with ability to respond effectively under pressure.
- Familiarity with server load balancing, redundancy, and failover solutions.
- Hands-on experience with network monitoring tools and diagnostic utilities.
- Excellent communication skills and ability to work in a multidisciplinary, global team.