EPFL, the Swiss Federal Institute of Technology in Lausanne, is one of the most dynamic university campuses in Europe and ranks among the top 20 universities worldwide. The EPFL employs more than 6,500 people supporting the three main missions of the institutions: education, research and innovation. The EPFL campus offers an exceptional working environment at the heart of a community of more than 18,500 people, including over 14,000 students and 4,000 researchers from more than 120 different countries.

Biomedical Data Integration Specialist

About the Swiss Data Science Center (SDSC)

The Swiss Data Science Center (SDSC) is a national research infrastructure in data science and artificial intelligence (AI). Its mission—to enable data-driven science and innovation for societal impact—drives its initiatives in research projects, knowledge and technology transfer, and education. With a large multidisciplinary team of professionals in Lausanne, Zurich and Villigen, the SDSC provides expertise and services to various domains, such as health and biomedical sciences, energy and sustainability, climate and environment, and large-scale scientific infrastructures. The SDSC also offers initial and executive education programs through EPFL and ETH Zurich.

Your mission

You will play a critical role in bridging the gap between our engineering teams and biomedical researchers. In this interdisciplinary position, you will develop data pipelines, curate and prepare biomedical datasets for analysis and long-term storage, and serve as the primary point of contact for data onboarding and support. You will contribute to high-impact research infrastructure and projects by ensuring that high-quality, well-structured, and standards-compliant data is readily available to drive discovery and innovation.

Main duties and responsibilities include

Data Pipeline Development & Automation

• Design, develop, and maintain workflows and scripts to ingest, transform, validate, and harmonize biomedical data.

• Build tools to support data curation, annotation, and traceability.

• Integrate diverse data types (e.g., clinical records, omics, imaging) into internal or external repositories.

Data Curation & Management

• Ensure data consistency, completeness, and adherence to biomedical data standards (e.g., HL7/FHIR, CDISC, OMOP, DICOM).

• Develop and maintain metadata schemas and documentation to facilitate data reuse.

• Perform quality control and validation checks before data publication or analysis.

User Engagement & Support

• Onboard new research teams and users into data management workflows.

• Provide training, documentation, and ongoing support for data submission and retrieval processes.

• Act as the liaison between researchers, engineers, and IT teams to define and refine data requirements.

Collaboration & Alignment

• Participate in project planning meetings to represent data integration perspectives.

• Work closely with internal engineering teams responsible for system architecture and platform development.

• Contribute to continuous improvement of data management practices and infrastructure.

Your profile

Education & Experience

• Degree in Biomedical Informatics, Bioinformatics, Computational Biology, Data Science, or a related field.

• Experience in biomedical data science including data analysis, management, curation, and integration.

• Prior work in research or clinical data environments is a strong plus.

Technical Skills

• Proficiency in scripting languages such as Python, R, or similar.

• Familiarity with workflow management tools (e.g., Airflow, Nextflow, Snakemake) is advantageous.

• Understanding of relational databases and query languages (e.g., SQL).

• Knowledge of biomedical data formats, ontologies, and standards.

Soft Skills

• Strong communication skills to translate technical concepts to diverse stakeholders.

• Detail-oriented with a commitment to data quality and reproducibility.

• Collaborative mindset and willingness to support researchers and team members.

We offer

• A dynamic, interdisciplinary environment at the forefront of data-driven innovation in health and biomedical research.

• Opportunities to engage with leading academic, clinical, and industry stakeholders across Switzerland and beyond.

• A collaborative team culture within SDSC, an institute jointly hosted by EPFL and ETH Zurich.

• Attractive employment conditions in line with EPFL/ETH domain policies.

Informations

Contract Start Date : to be agreed

Activity Rate Min : 80.00 

Activity Rate Max : 100.00 

Contract Type: CDD

Duration: 1 year renewable 

Reference: 1690 

For further information, please contact: hrdatascience@datascience.ch