PhD position - Large Language Models & natural language processing to unlock the potential of healthcare

Updated: 21 days ago
Deadline: 27 Mar 2026

14 Mar 2026
Job Information
Organisation/Company

University Medical Center Utrecht (UMC Utrecht)
Research Field

Computer science » Programming
Medical sciences » Medicine
Researcher Profile

First Stage Researcher (R1)
Application Deadline

27 Mar 2026 - 22:59 (UTC)
Country

Netherlands
Type of Contract

Temporary
Job Status

Not Applicable
Hours Per Week

36.0
Is the job funded through the EU Research Framework Programme?

Not funded by a EU programme
Is the Job related to staff position within a Research Infrastructure?

No

Offer Description

The Methods of Epidemiological Research team at the Julius Center of the UMC Utrecht is seeking an enthusiastic and ambitious PhD candidate. In this project, you will focus on developing, validating, and applying fine-grained natural language processing (NLP) and large language model (LLM) methods to unlock information from Dutch electronic health record (EHR) free text for secondary use in research.
Electronic health records contain a wealth of relevant patient information in unstructured free-text notes. While structured (coded) fields are commonly reused for research, a substantial amount of nuanced and context-rich patient information remains locked in the large amount of narrative texts in EHRs.
NLP and LLM-based methods offer great promise to unlock this important patient information, e.g. for secondary use in (bio)medical and epidemiological research. However, current methods and approaches typically map free text to structured data using rule-based methods, which may lead to inaccurate classifications. In addition, data for secondary use is currently mapped to coarse-grained ontologies or common data models (e.g., the Observational Medical Outcomes Partnership Common Data Model), which may lead to substantial information loss. Moreover, most medical NLP/LLM tools are developed and validated in English, leaving a major gap for Dutch EHR data. This PhD project aims to address these challenges by developing and validating fine-grained information extraction approaches for Dutch EHR texts, minimizing information loss while ensuring robustness, transparency, and practical usability.
You will work at the intersection of epidemiology, clinical research, data science, and AI, with a strong methodological focus.


Where to apply
Website
https://www.academictransfer.com/en/jobs/359346/phd-position-large-language-mod…

Requirements
Specific Requirements

We are looking for a candidate who:

  • Holds an MSc degree in artificial intelligence, natural language processing, computer science, biostatistics, data science, (clinical) epidemiology, biomedical sciences, or a related field
  • Has strong interest in methodological research in healthcare
  • Has experience with NLP and/or language modeling techniques
  • Has solid programming skills (e.g., Python)
  • Is motivated to work at the interface of AI and clinical research
  • Enjoys interdisciplinary collaboration
  • Has excellent written and spoken English skills
  • Proficiency in Dutch is a pre-requisite given the focus on unlocking Dutch EHR texts
  • Experience with (bio)medical research, ontologies, semantic parsing, or privacy-preserving AI methods, is considered a plus.

Additional Information
Benefits

The maximum salary for this position (36 - 36 hours) is € 3.108,00 gross per month based on full-time employment.
In addition, we offer an annual benefit of 8.3%, holiday allowance, travel expenses and career opportunities. The terms of employment are in accordance with the Cao University Medical Centers (UMC).


Additional comments

Contact our colleague:
Carl Moons
0887550495
KMOONS@UMCUTRECHT.NL


Website for additional job details

https://www.academictransfer.com/359346/

Work Location(s)
Number of offers available
1
Company/Institute
UMC Utrecht
Country
Netherlands
City
Utrecht
Postal Code
3584CX
Street
Heidelberglaan 100
Geofield


Contact
City

Utrecht
Website

http://www.umcutrecht.nl/
Street

Heidelberglaan 100
Postal Code

3584 CX

STATUS: EXPIRED

  • X (formerly Twitter)
  • Facebook
  • LinkedIn
  • Whatsapp

  • More share options
    • E-mail
    • Pocket
    • Viadeo
    • Gmail
    • Weibo
    • Blogger
    • Qzone
    • YahooMail



Similar Positions