Responsibilities:
- Oversee the setup and configuration of scalable AWS infrastructure, focusing on compute and storage solutions.
- Contribute to the creation and analysis of a clinical “big data” database.
- Work with interdisciplinary teams to create seamless data acquisition interfaces from sequencing facilities, public data sources, and clinical samples.
- Transition existing pipelines while ensuring version control and addressing challenges such as large data volumes and environment dependencies.
- Build and maintain both production and development environments, utilizing critical tools like Nextflow, Conda, and Matlab.
- Automate workflows, containerize applications, and enhance interfaces to create a dynamic and efficient computational ecosystem that supports research teams.
Profile:
- PhD degree in Computer Science, Bioinformatics, or related field
- Minimum of 3 years of experience with HPCs, cloud infrastructure management and databases
- Strong knowledge of Amazon Web Services (AWS) experience, including AWS Parallel Server, VPC networking, S3, EC2, RDS, and EFS
- Strong knowledge of AWS Cloud resources management and optimization
- Strong knowledge of Terraform and containerization technologies (Docker/Singularity)
- Experience with SLURM and workflow management systems (Nextflow)
- Proficient in scripting languages (Python, SQL)
- Experience with version control systems (Github)
- Excellent communication and collaboration skills
- Excellent problem-solving and analytical skills
Sprachanforderungen:
- A Letter of reference is required to be considered for this position
- All documents must be submitted in English