Pangaea Data
Data Engineer (Healthcare Data)
Salary
Competitive salary
Work type
Hybrid
Level
mid
Category
Data Engineering
About the role
As Data Engineer you will join Pangaea’s team to design and develop integrated applications for its PALLUX platform
About Pangaea Data
Pangaea Data (Pangaea) is a South San Francisco and London based business founded by Dr Vibhor Gupta and Prof Yike Guo (Director Data Science Institute at Imperial College London; Provost, Hong Kong University of Science and Technology). They have worked in medicine and computing for over 20 years and have raised over $300 million through their academic research, including a $110 million grant focused on development work on large language models in medicine. Pangaea’s AI platform, PALLUX, is configured on clinical guidelines to find more untreated (undiagnosed, miscoded, at-risk) and under-treated patients with hard-to-diagnose conditions for screening and treatment at the point of care. Pangaea’s advisors include industry veterans from healthcare and the life sciences, including Lord David Prior (former chairman, NHS England) and Mr. Andy Palmer (former CIO, Novartis).
The Role
As Data Engineer (Healthcare Data), you will join Pangaea’s team to lead and support the development of reliable, scalable, and secure data solutions. The ideal candidate will be experienced with healthcare data standards (e.g. FHIR, OMOP), possess a strong understanding of data privacy regulations (e.g., HIPAA, GDPR), and have technical expertise to design and implement data pipelines, storage systems, and integrations.
This role will continue to evolve as the business grows, but in the short term it will also involve development of the software product and collaboration with the clinical and scientific team. A strong software engineering background and knowledge in AI, especially Machine Learning and Natural Language Processing, is essential. For the right candidate, this is a senior technical position with scope to grow into a leadership role.
Key technical responsibilities will include:
- Design, implement, and maintain ETL pipelines to collect, clean, and transform healthcare data from various sources such as EHR systems, APIs, and databases
- Ensure data quality and integrity through robust testing and validation processes
- Optimize storage solutions for structured and unstructured healthcare data using databases (e.g., MongoDB) and cloud-based data warehouses (e.g., Azure Cosmos, Azure Fabric)
- Collect and maintain gold standard datasets for evaluation and benchmarking with clear instructions, version control, and API documentations.
- Maintain strict compliance with data privacy regulations such as HIPAA, GDPR, and other local healthcare policies
- Work closely with the clinical team to understand data requirements and translate them into technical solutions
- Collaborate with the AI team to provide clean, well-structured datasets for research, and AI/ML models
- Stay up-to-date with the latest data engineering technologies and best practices
Mandatory Requirements
Technical skills:
- Experience working with Electronic Health Records (EHR) systems (e.g. Epic, Cerner)
- A university qualification (Bachelors, Masters, Doctorate) with at least two years of university study in Computer Science, Informatics, Data Science, Engineering, or related
- Experience in data engineering, with a focus on healthcare data preferred
- Familiarity with NoSQL databases (e.g., MongoDB) and relational databases (e.g., PostgreSQL, MySQL)
- 5+ years in Python and SQL work
- Knowledge of ETL tools (e.g., Apache Airflow) and cloud platforms (e.g., AWS, Azure, GCP).
- Understand data modelling concepts and best practices. Experience with healthcare data standards (e.g., HL7, FHIR, ICD, SNOMED, DICOM) preferred
- Excellent problem-solving and communication skills
Personal traits:
- Ability to communicate complex ideas effectively, both verbally and written
- Ability to engage all levels of the company and the customers’ organizations
- Ability to work collaboratively in a team environment
Nice to Have
- 3-5 years experience of managing teams
- Experience working on large-scale, commercial software development projects is a plus
- Experience with research communities and/or efforts, including having published papers (being listed as author) at AI/ML/NLP/CV conferences (e.g. Bio-IT, NeuraIPS, ICML, ICLR, ACL, CVPR and KDD) and journals
- Experience and knowledge of deploying AI and Data solutions for healthcare and pharmaceuticals at scale is desirable
Perks and Benefits
- Flexible working hours
- Salary dependent on experience
- Package of attractive benefits including private medical insurance and monthly travel card
- You will join a dedicated highly renowned team offering you the opportunity to grow and develop your professional skills and profile
- You will have the opportunity to learn about building a startup business from experienced professionals and serial entrepreneurs
Application Contact Information
Your application should include a CV and cover letter highlighting your relevant experiences and motivations. Please send this to careers@pangaeadata.ai
Tech stack

UK Biobank
Data Engineer
£ – £40k
Manchester · 21h ago

BBC
Senior Principal Data Analyst
£85k – £95k
London · 21h ago

BBC
Technology & Data Analyst
£56k – £64k
London · 21h ago

Blue Light Card
Engineering Manager
Competitive salary
London · 22h ago

Blue Light Card
Platform Engineer
Competitive salary
London · 22h ago

Sportscotland
Business Analyst
£ – £48k
Glasgow · 1d ago

Evri
Customer Experience MI Analyst
£ – £26k
Morley · 1d ago

ATG Entertainment
Data Analyst
£35k – £45k
London · 1d ago

Humanoid
Data & Integrations Architect
Competitive salary
London · 2d ago

FMG
Business Analyst
£31k – £41k
Huddersfield · 2d ago

National Highways
Network Data & Intelligence Analyst
£36k – £40k
Bedford · 2d ago

Nottingham Building Society
Senior BI Developer
£ – £50k
Nottingham · 2d ago

National Trust
Business Analyst
£ – £44k
Swindon · 2d ago

Leidos
Data Engineer (Postgres)
£47k – £61k
Farnborough · 3d ago

Sermo
Reporting Analyst - Insights & Analytics
Competitive salary
London · 3d ago

Valda Energy
Commercial Insights Analyst
£30k – £35k
Bicester · 3d ago

Valda Energy
Graduate Pricing Analyst
£26k – £28k
Bicester · 3d ago

Valda Energy
BI Analyst
£30k – £35k
Bicester · 3d ago
Brand Addition
BI Report Developer
Competitive salary
Manchester · 3d ago

Rightmove
Data Engineer
Competitive salary
London · 4d ago

Rightmove
Analytics Engineer
Competitive salary
London · 4d ago

LiveScore
Data Engineer
Competitive salary
London · 4d ago

Thames Water
Business Intelligence Analyst
£ – £65k
Reading · 4d ago

9Fin
Data Engineering Lead
Competitive salary
London · 5d ago

Bending Spoons
Data analyst
£112k – £251k
London · 5d ago

Carter Jonas
Data Analyst
Competitive salary
Birmingham · 5d ago

Saga
Graduate Data Analyst
£28k – £30k
Folkestone · 5d ago

Saga
Data Analyst
£35k – £40k
Folkestone · 5d ago

Wheely
Data Engineer
Competitive salary
London · 5d ago

easyJet
Data Engineer
Competitive salary
Luton · 5d ago

EXL
GCP Data Engineer
Competitive salary
Edinburgh · 5d ago

Envision
Data Test Engineer
Competitive salary
Remote · 5d ago

Xapo Bank
Data Analyst - Client Coverage Group
Competitive salary
Remote · 5d ago

Twinkl
Growth Analytics Lead
Competitive salary
Remote · 6d ago
Department for Business & Trade
Data Quality Lead – Active SC Clearance
£550 – £
London · 6d ago
Pangaea Data
Data Engineer (Healthcare Data)
Competitive salary
London · 6d ago
Synthesia
Senior Data Engineer
Competitive salary
London · 6d ago
St Austell Brewery
BI Analyst
£28k – £31k
Saint Austell · 6d ago
St Austell Brewery
BI Analyst Lead
£35k – £40k
Saint Austell · 6d ago
Blackpool and The Fylde College (B&FC)
BI Analyst
£39k – £42k
Bispham · 6d ago
Hunter Douglas
Junior Product Configuration Analyst
Competitive salary
Colwick · 1w ago
Hunter Douglas
Product Configuration Test Analyst
Competitive salary
Colwick · 1w ago
easyJet
Business Analyst - Web & Content Management Systems
Competitive salary
Luton · 1w ago
Fractal
Senior Data Engineer
Competitive salary
London · 1w ago
Fractal
Data Analyst / BI Developer
Competitive salary
London · 1w ago
Fractal
Business Analyst
Competitive salary
London · 1w ago
UCS College Group
Data Engineer
£29k – £31k
Taunton · 1w ago
InHealth
Senior Data Analyst
£40k – £45k
Middlewich · 1w ago
Riverside
Business Analyst
£55k – £61k
Liverpool · 1w ago
Minute Media
Analytics Engineer
Competitive salary
London · 1w ago
Minute Media
Senior Data Analyst (LTV)
Competitive salary
London · 1w ago
Minute Media
Data Analyst - AI & Agent Traffic Analytics
Competitive salary
London · 1w ago
Moneyfarm
Analytics Engineer
Competitive salary
London · 1w ago
ProSapient
Analytics Engineer (Data Platform)
Competitive salary
London · 1w ago
Calderdale College
Data Analyst
£27k – £30k
Halifax · 1w ago
NatWest Group
Data API Engineer
£74k – £111k
London · 1w ago
Buzz Bingo
Digital Web Analyst
Competitive salary
Gibraltar, UK · 1w ago
LiveWest
Assistant Data Analyst
£29k – £30k
Tolvaddon · 1w ago
Rentokil
Business Analyst
Competitive salary
Crawley · 1w ago
Altro
Senior Data Architect
Competitive salary
Letchworth Garden City · 1w ago
Sizewell C
Data Analytics Lead
Competitive salary
London · 1w ago
Capita
Senior Business Analyst
Competitive salary
Remote · 1w ago
Pipedrive
Director of Marketing Analytics
Competitive salary
London · 1w ago
Trustpilot
Director of Engineering
Competitive salary
London · 1w ago
Nebius
Senior Analyst (Agentic Search)
Competitive salary
London · 1w ago
Recraft
ML Data Engineer
Competitive salary
London · 1w ago
Livestock Information
PowerBI Developer
£50k – £54k
UK-based · 1w ago
Zempler Bank
Data Analyst
Competitive salary
London · 1w ago
Zempler Bank
Associate Operations Data Analyst
Competitive salary
Liverpool · 1w ago

Kubrick Group
Senior Databricks Engineer
Competitive salary
London · 2w ago

Funding Circle
Analytics Lead, Existing Customer Management
Competitive salary
London · 2w ago
Funding Circle
Senior Product Analyst
Competitive salary
London · 2w ago

Funding Circle
Analytics Engineer
Competitive salary
London · 2w ago

Virgin Media O2
Senior Data Engineer
Competitive salary
London · 2w ago

Zopa Bank
Data Analyst (BI & Operations)
Competitive salary
London · 2w ago

Zempler Bank
Data Analyst
Competitive salary
London · 2w ago

Zempler Bank
Associate Operations Data Analyst
Competitive salary
Liverpool · 2w ago

QuantSpark
Data Engineer
Competitive salary
London · 2w ago

London Stock Exchange Group
Senior Data Engineer
Competitive salary
London · 2w ago

Ovo Energy
Lead Data Analyst (Data & AI Platform)
£62k – £70k
London · 2w ago

Ebury
Data Manager - Financial Crime
Competitive salary
London · 2w ago

Compare the Market
Data Delivery Manager (Data & AI)
Competitive salary
London · 2w ago

Zopa
Senior Product Analyst (Current Account)
Competitive salary
London · 2w ago

Dojo
Senior Data Team Lead
Competitive salary
London · 2w ago

Checkout.com
Analyst, Insights & Analytics
Competitive salary
London · 2w ago

Checkout.com
Analytics Engineer
Competitive salary
London · 2w ago

Tem Energy
Staff Analytics Engineer
£101k – £101k
Remote · 2w ago

Zoopla
Data Analyst
Competitive salary
London · 2w ago

Rightmove
Senior Product Analyst
Competitive salary
London · 2w ago

Starling Bank
Analytics Engineer (Finance)
Competitive salary
London · 2w ago

Meta
Product Growth Analyst
Competitive salary
London · 3w ago
Pangaea Data
Data Engineer (Healthcare Data)
Salary
Competitive salary
Work type
Hybrid
Level
mid
Category
Data Engineering
About the role
As Data Engineer you will join Pangaea’s team to design and develop integrated applications for its PALLUX platform
About Pangaea Data
Pangaea Data (Pangaea) is a South San Francisco and London based business founded by Dr Vibhor Gupta and Prof Yike Guo (Director Data Science Institute at Imperial College London; Provost, Hong Kong University of Science and Technology). They have worked in medicine and computing for over 20 years and have raised over $300 million through their academic research, including a $110 million grant focused on development work on large language models in medicine. Pangaea’s AI platform, PALLUX, is configured on clinical guidelines to find more untreated (undiagnosed, miscoded, at-risk) and under-treated patients with hard-to-diagnose conditions for screening and treatment at the point of care. Pangaea’s advisors include industry veterans from healthcare and the life sciences, including Lord David Prior (former chairman, NHS England) and Mr. Andy Palmer (former CIO, Novartis).
The Role
As Data Engineer (Healthcare Data), you will join Pangaea’s team to lead and support the development of reliable, scalable, and secure data solutions. The ideal candidate will be experienced with healthcare data standards (e.g. FHIR, OMOP), possess a strong understanding of data privacy regulations (e.g., HIPAA, GDPR), and have technical expertise to design and implement data pipelines, storage systems, and integrations.
This role will continue to evolve as the business grows, but in the short term it will also involve development of the software product and collaboration with the clinical and scientific team. A strong software engineering background and knowledge in AI, especially Machine Learning and Natural Language Processing, is essential. For the right candidate, this is a senior technical position with scope to grow into a leadership role.
Key technical responsibilities will include:
- Design, implement, and maintain ETL pipelines to collect, clean, and transform healthcare data from various sources such as EHR systems, APIs, and databases
- Ensure data quality and integrity through robust testing and validation processes
- Optimize storage solutions for structured and unstructured healthcare data using databases (e.g., MongoDB) and cloud-based data warehouses (e.g., Azure Cosmos, Azure Fabric)
- Collect and maintain gold standard datasets for evaluation and benchmarking with clear instructions, version control, and API documentations.
- Maintain strict compliance with data privacy regulations such as HIPAA, GDPR, and other local healthcare policies
- Work closely with the clinical team to understand data requirements and translate them into technical solutions
- Collaborate with the AI team to provide clean, well-structured datasets for research, and AI/ML models
- Stay up-to-date with the latest data engineering technologies and best practices
Mandatory Requirements
Technical skills:
- Experience working with Electronic Health Records (EHR) systems (e.g. Epic, Cerner)
- A university qualification (Bachelors, Masters, Doctorate) with at least two years of university study in Computer Science, Informatics, Data Science, Engineering, or related
- Experience in data engineering, with a focus on healthcare data preferred
- Familiarity with NoSQL databases (e.g., MongoDB) and relational databases (e.g., PostgreSQL, MySQL)
- 5+ years in Python and SQL work
- Knowledge of ETL tools (e.g., Apache Airflow) and cloud platforms (e.g., AWS, Azure, GCP).
- Understand data modelling concepts and best practices. Experience with healthcare data standards (e.g., HL7, FHIR, ICD, SNOMED, DICOM) preferred
- Excellent problem-solving and communication skills
Personal traits:
- Ability to communicate complex ideas effectively, both verbally and written
- Ability to engage all levels of the company and the customers’ organizations
- Ability to work collaboratively in a team environment
Nice to Have
- 3-5 years experience of managing teams
- Experience working on large-scale, commercial software development projects is a plus
- Experience with research communities and/or efforts, including having published papers (being listed as author) at AI/ML/NLP/CV conferences (e.g. Bio-IT, NeuraIPS, ICML, ICLR, ACL, CVPR and KDD) and journals
- Experience and knowledge of deploying AI and Data solutions for healthcare and pharmaceuticals at scale is desirable
Perks and Benefits
- Flexible working hours
- Salary dependent on experience
- Package of attractive benefits including private medical insurance and monthly travel card
- You will join a dedicated highly renowned team offering you the opportunity to grow and develop your professional skills and profile
- You will have the opportunity to learn about building a startup business from experienced professionals and serial entrepreneurs
Application Contact Information
Your application should include a CV and cover letter highlighting your relevant experiences and motivations. Please send this to careers@pangaeadata.ai