We are looking for Software Engineers (SE) who can break down and solve complex problems with a strong motivation to get things done with a boots-on-the-ground, pragmatic mindset! Our engineers own their products end to end and influence the way our products and technology are deployed to facilitate most aspects of drug discovery, impacting hundreds of thousands of patients around the world. We are looking for engineers who can creatively handle complex dependencies and ambiguous requirements, compete with business priorities, while producing fit-for-purpose, optimal solutions.
We are hoping that you are passionate about collaborating across the interface between hard-core software development and research and discovery data analysis.
• Design and implement engineering tools, applications and solutions that facilitate research processes and scientific discovery in several areas of our drug discovery process.
• Help drive the design and architecture of adopted engineering solutions with a detail-oriented mindset
• Promote and help with the adoption of development, design, architecture, and DevOps best practices, with a particular focus on agile deliver mindset
• Lead and mentor smaller team of developers (squads) to ensure timely and quality delivery of multiple product iterations
• Drive product discovery and requirements clarification for ambiguous and/or undefined problems framed with uncertainty.
• Manage technical and business dependencies and bottlenecks; balance technical constraints with business requirements; and deliver maximum business impact with solid customer experience
• Help stakeholders with go/no-go decisions on software and infrastructure by assessing gaps in existing software solutions (internal/external), by vetting technologies/platforms and vendor products
• Strong collaboration, organization skills in cross-functional teams; ability to effectively communicate with technical and non-technical audiences; work closely with scientists, peers, and business leaders in different geographical locations to define and deliver complex engineering features.
MUST
- Proficient (3+ years hands-on experience)w/ at least one language: Java (preferred), Python or C#
- Proficient with Nextflow pipelines, execution, maintenance, debugging
- Bioengineering is large advantage, but not a must.
- with building Cl/CD workflows with Jenkins or equivalent
- with using laC frameworks (Cloud Formation, Ansible, Terraform)
- to build microservice-architecture solutions with a focus on scientific data analysis and management
- to integrate AWS services (EC2/RDS/S3/Batch/KMS/ECS etc .. ) into production workflows
SHOULD HAVE
(2+ years hands-on experience):
- Previous experience with Epam-Cora deployment/support
- with these scripting languages: Python, Bash
- to build production workflows using Java/Spring (preferred), Python or C#/ .Net
- with Linux OS command line
- to drive API (REST, GraphQL, etc.) driven, modular development of production workflows and integration with 3rd party vendor platforms
- to create relational data models, ETL processing pipelines using PostgreSQL (preferred), Oracle, SQL Server, MySQL
SOFT-SKILS MUST
- Strong collaborator, communicator,
- Strong problem-solving skills
- Experienced with making technical partnerships with research and business teams
NICE TO HAVE
(1 + years of experience):
- with building resource intensive HPC analysis modules and/or data processing tasks
- with developing and deploying containerized applications (i.e., Docker, Singularity)
- with containerization platform: Kubernetes/Helm
- with end-to-end testing framework: Robot/Selenium
- Prior experience with
- with non-relational database vendors (Elastic Search, etc .. )
- with building, troubleshooting C, C++ libraries and dependencies on Linux (maven, make, etc .. )
- with frontend development (any JS framework)
Our Engineering team builds core components used by Lab's data analytics, visualization, and management workflows. The analysis tools and pipelines built for data processing by our team in partnership with our scientists aim to accelerate research and the discovery of new therapies for our patients.
We collect, annotate, analyze petabytes of scientific data (multi-omics, chemistry, imaging, safety) used in biomarker research, drug safety/efficacy, drug target discovery, and compendium diagnostic development. We help our scientists to process, analyze scientific data at scale by developing highly parallelized analytical workflows ran on HPC infrastructure (on-prem & cloud); to manage, explore and visualize various scientific data modalities by developing bespoke data models, bioinformatics ETL processes, data retrieval and visualization services using distributed micro-service architecture, FAIR data principles, SPA type dashboards, industry specific regulatory compliant data integrity, auditing, and security access controls.
We are a creative and disciplined software engineering team, using agile practices and established technology stacks to design and develop large-scale data analytics, visualization, and management software solutions for local and on-cloud hosted HPC datacenters, as well as to integrate 3rd party analytical platforms with internal data workflows to address pressing engineering and data science challenges of life-science.