Resources
Database Credentialed Access
CXR-PRO: MIMIC-CXR with Prior References Omitted
Vignav Ramesh, Nathan Chi, Pranav Rajpurkar
generation free-text radiology reports references to priors retrieval large language models
Published: Nov. 23, 2022. Version: 1.0.0
Model Credentialed Access
Characterization of Stigmatizing Language in Medical Records
Keith Harrigian, Ayah Zirikly, Brant Chee, Alya Ahmad, Anne Links, Somnath Saha, Mary Catherine Beach, Mark Dredze
clinical natural language processing domain transfer bias stigmatizing language large language models mimic
Published: Nov. 6, 2023. Version: 1.0.0
Database Credentialed Access
MIMIC-IV-Ext-GPT-3_5-Generated-Discharge-Summaries-for-Low-Resource-Codes
Matúš Falis, Aryo Pradipta Gema, Hang Dong, Luke Daines, Siddharth Basetti, Michael Holder, Rose Penfold, Alexandra Birch, Beatrice Alex
icd coding large language model data augmentation
Published: Dec. 16, 2024. Version: 1.0.0
Database Credentialed Access
MS-CXR: Making the Most of Text Semantics to Improve Biomedical Vision-Language Processing
Benedikt Boecking, Naoto Usuyama, Shruthi Bannur, Daniel Coelho de Castro, Anton Schwaighofer, Stephanie Hyland, Harshita Sharma, Maria Teodora Wetscherek, Tristan Naumann, Aditya Nori, Javier Alvarez Valle, Hoifung Poon, Ozan Oktay
vision-language processing chest x-ray phrase grounding localization
Published: Nov. 15, 2024. Version: 1.1.0
Database Credentialed Access
MIMIC-IV-Note: Deidentified free-text clinical notes
Alistair Johnson, Tom Pollard, Steven Horng, Leo Anthony Celi, Roger Mark
deidentification critical care electronic health record natural language processing clinical notes mimic
Published: Jan. 6, 2023. Version: 2.2
Database Credentialed Access
RadVLM Instruction Dataset
Nicolas Deperrois, Hidetoshi Matsuo, Samuel Ruiperez-Campillo, Moritz Vandenhirtz, Sonia Laguna, Alain Ryser, Koji Fujimoto, Mizuho Nishio, Thomas Sutter, Julia Vogt, Jonas Kluckert, Thomas Frauenfelder, Christian Bluethgen, Farhad Nooralahzadeh, Michael Krauthammer
chest x-rays vision-language models medical ai
Published: Sept. 25, 2025. Version: 1.0.0
Database Credentialed Access
MedVAL-Bench: Expert-Annotated Medical Text Validation Benchmark
Asad Aali, Vasiliki Bikia, Maya Varma, Nicole Chiou, Sophie Ostmeier, Arnav Singhvi, Magdalini Paschali, Ashwin Kumar, Andrew Johnston, Karimar Amador Martinez, Eduardo Perez Guerrero, Paola Cruz Rivera, Sergios Gatidis, Christian Bluethgen, Eduardo Pontes Reis, Eddy Zandee van Rilland, Poonam Hosamani, Kevin Keet, Minjoung Go, Evelyn Ling, David Larson, Curtis Langlotz, Roxana Daneshjou, Jason Hom, Sanmi Koyejo, Emily Alsentzer, Akshay Chaudhari
Published: Nov. 14, 2025. Version: 1.0.1
Database Credentialed Access
MIMIC-Ext-MIMIC-CXR-VQA: A Complex, Diverse, And Large-Scale Visual Question Answering Dataset for Chest X-ray Images
Seongsu Bae, Daeun Kyung, Jaehee Ryu, Eunbyeol Cho, Gyubok Lee, Sunjun Kweon, Jungwoo Oh, Lei JI, Eric Chang, Tackeun Kim, Edward Choi
question answering machine learning evaluation chest x-ray radiology benchmark electronic health records multimodal deep learning visual question answering
Published: July 19, 2024. Version: 1.0.0
Database Credentialed Access
RadGraph-XL: A Large-Scale Expert-Annotated Dataset for Entity and Relation Extraction from Radiology Reports
Jean-Benoit Delbrouck
Published: Sept. 12, 2025. Version: 1.0.0
Database Credentialed Access
RadVLM Instruction Dataset
Nicolas Deperrois, Hidetoshi Matsuo, Samuel Ruiperez-Campillo, Moritz Vandenhirtz, Sonia Laguna, Alain Ryser, Koji Fujimoto, Mizuho Nishio, Thomas Sutter, Julia Vogt, Jonas Kluckert, Thomas Frauenfelder, Christian Bluethgen, Farhad Nooralahzadeh, Michael Krauthammer
chest x-rays vision-language models medical ai
Published: Sept. 25, 2025. Version: 1.0.0