Skip to main content

Bioinformatics Data Engineer

At Genmab, we’re committed to building extra[not]ordinary futures together, by developing antibody products and pioneering, knock-your-socks-off therapies that change the lives of patients and the future of cancer treatment and serious diseases. From our people who are caring, candid, and impact-driven to our business, which is innovative and rooted in science, we believe that being proudly unique, determined to be our best, and authentic is essential to fulfilling our purpose.

The role

As Bioinformatics Data Engineer you will contribute to the mission of the global data engineering function and be responsible for many aspects of data including architecture, access, classification, standards, integration, pipelines and visualization. Although your role will involve a diverse set of data-related responsibilities, your expertise will be on automated processing of mostly biological research data for the Discovery Department and, particularly, for the Discovery Data Scientists.  You will leverage your expertise in pipeline development with scientific data objects to model and catalog large amounts of data with corresponding metadata layers.  

You will work closely with data scientists to determine what metadata will be required to retrieve data and how to capture the information in an automated way. Your ultimate goal will be to place data at the fingertips of stakeholders and enable science to go faster. You will join an enthusiastic, fast-paced and explorative global data engineering team.

The Data Products team supports Genmab's mission by helping researchers use data at its full potential! Particularly, the Utrecht team supports the Discovery department with the ingestion, flow, and processing of biological and operational data. We work closely with researchers, managers, IT staff and data scientists to find solutions together that fit Genmab’s data needs.

The Data Products team is spread between Princeton (USA), Copenhagen (Denmark) and Utrecht (The Netherlands). This position would be joining the six data engineers currently working in Utrecht (2-3 days onsite expected).

Responsibilities

  • Design, develop and deploy reproducible data pipelines using cloud-native tools. All our pipelines use infrastructure as code, have automated tests and are as re-usable and reproducible as possible.

  • Connect with collaborators (scientists, project managers, etc.) to translate their needs and questions into technical requirements. We then use the requirements to build data pipelines and visualizations that are meaningful, comprehensible, and practical for them.

  • Every data engineer has projects to lead and others in which there are only smaller contributions.

  • Generate comprehensive documentation of the data products developed, both for technical and non-technical users.

  • Promote good (coding/data) practices and lead by example.

Requirements

  • MS/PhD or equivalent experience in Computer Science, Bioinformatics, or related field.

  • 3+ years of demonstrated working experience as a data engineer.

  • Experience with data pipeline design and creation is a must. The pipelines should use good coding practices and the right tool for the job. Experience with ETL jobs (e.g. AWS Glue, Databricks jobs, AWS Lambda) and orchestrators (e.g. AWS StepFunctions) is desirable.

  • Solid experience in database design (partitions, schemas, choosing database type, etc.) and querying languages (SQL, pyspark or similar) is a requirement. Experience with delta lake (delta tables) is a plus.

  • Strong experience writing Python code (including OOP, automated testing, etc.). Experience using R is a plus.

  • Basic knowledge of FAIR principles and GXP rules for data handling is also advantageous but not rigorously required.

  • Although understanding biological data (experimental and clinical data) is not a strong requirement, it could make the candidate more efficient in the job.

  • Experience using version control system (git) in collaborative projects is required. Knowledge in CI/CD pipelines is an advantage.

  • Needs good communication skills in the English language, which is the primary language spoken at Genmab.

About You

  • You are passionate about our purpose and genuinely care about our mission to transform the lives of patients through innovative cancer treatment
  • You bring rigor and excellence to all that you do. You are a fierce believer in our rooted-in-science approach to problem-solving
  • You are a generous collaborator who can work in teams with diverse backgrounds
  • You are determined to do and be your best and take pride in enabling the best work of others on the team
  • You are not afraid to grapple with the unknown and be innovative
  • You have experience working in a fast-growing, dynamic company (or a strong desire to)
  • You work hard and are not afraid to have a little fun while you do so

Locations

Genmab leverages the effectiveness of an agile working environment, when possible, for the betterment of employee work-life balance. Our offices are designed as open, community-based spaces that work to connect employees while being immersed in our state-of-the-art laboratories. Whether you’re in one of our collaboratively designed office spaces or working remotely, we thrive on connecting with each other to innovate.

Anderen bekeken ook

Bioinformatics Data Engineer

Bedrijf:
Genmab
Gemeente:
Zuid-Holland
Contracttype: 
Vast contract, Voltijds
Categorieën: 
Data Engineer, Biostatistician
Opleidingsniveau: 
PhD
Master
Gepubliceerd:
08.02.2024
Deel nu: