Senior Scientific Data Engineer, Data Platform
Recursion
Location
London or Milton Park office (hybrid, 50% office time)
Type
Full-time
Posted
Dec 15, 2025
Compensation
USD 84000 – 84000
Mission
What you will drive
- Build, scale, and operate a data platform that allows users to discover and query across Recursion's diverse datasets including billions of compounds, petabytes of cellular microscopy images, and millions of assay results
- Build relatability and query-ability into heterogeneous datasets by working with biologists, chemists, and data scientists to enable future scientific discovery
- Act as a mentor, coach, and sponsor by sharing technical knowledge and experiences to deliver impact, learning, and growth across teams
Impact
The difference you'll make
This role enables the discovery of new medicines by building data platforms that make diverse biological and chemical datasets discoverable and queryable, ultimately advancing the future of medicine and improving patient lives.
Profile
What makes you a great fit
- Degree in drug-discovery related science (e.g., Chemistry, Biology) to make informed choices on scientific data
- 5+ years of deep experience in modern, cloud-based data engineering tools to build platforms for large datasets
- Expertise in data platform architectures and technologies (data lake, data warehouse) with knowledge of Python, dbt, Prefect, BigQuery, GCP, Kubernetes, and related tools
- Experience working collaboratively on projects with significant ambiguity and technical complexity spanning multiple systems and technologies
Benefits
What's in it for you
Estimated annual base range: £75,900 - £101,900, plus eligibility for annual bonus and equity compensation, comprehensive benefits package, hybrid work environment (50% office time), and company values emphasizing collaboration, learning, and urgency for patients.
About
Inside Recursion
Recursion is a clinical-stage TechBio company that decodes biology to industrialize drug discovery using its Recursion Operating System, which leverages massive biological, chemical, and patient-centric datasets and machine learning to advance medicine.