MSD is seeking a data engineer to support integration of laboratory instrument data using the industry standards from the Allotrope Foundation. In this role, the data engineer will enable frictionless data flow by using automation, data standards, and by following FAIR principles. The resulting analysis-ready data will be used by MSD’s analytical chemists and data scientists advancing pharmaceutical research.
Key responsibilities:
• Building ETL pipelines to integrate data from laboratory instruments
• Creating database queries to support effective use of the laboratory instrument data
• Responding to Allotrope Foundation‘s releases and changes in MSD's DevOps and cloud infrastructure
• Developing MSD's extensions of Allotrope Simple Models in JSON Schema
Required experience and skills:
• Data engineering (e.g., SQL, ETL jobs)
• Using NoSQL databases (e.g., document-oriented databases, graph databases)
• Programming in Python
• Experience with DevSecOps practices like continuous integration (CI) and continuous delivery (CD), source code version control (e.g., Git), infrastructure-as-code (e.g., CloudFormation, Terraform), containers (e.g., Docker).
Desired experience and skills:
• Semantic web technology stack (e.g., RDF data format, SPARQL query language)
• Amazon Web Services, particularly the serverless services (e.g., Lambda, Glue, Step Functions)
• Experience with Atlassian stack of tools for agile software development (e.g., Jira, Confluence).
• Good communication and collaboration skills, and ability to collaborate with other teams to develop solutions.
• Demonstrates growth mindset and ability to work with enterprise teams.
Education: B.A. or B.S. degree, preferably in IT; or relevant skills and experience.
Language: English, professional proficiency.
Location: Fully remote, EMEA region preferred.
.png)

