Resources
Database Credentialed Access
CXR-PRO: MIMIC-CXR with Prior References Omitted
Vignav Ramesh, Nathan Chi, Pranav Rajpurkar
generation free-text radiology reports references to priors retrieval large language models
Published: Nov. 23, 2022. Version: 1.0.0
Model Credentialed Access
Characterization of Stigmatizing Language in Medical Records
Keith Harrigian, Ayah Zirikly, Brant Chee, Alya Ahmad, Anne Links, Somnath Saha, Mary Catherine Beach, Mark Dredze
clinical natural language processing domain transfer bias stigmatizing language large language models mimic
Published: Nov. 6, 2023. Version: 1.0.0
Database Credentialed Access
MIMIC-IV-Ext-GPT-3_5-Generated-Discharge-Summaries-for-Low-Resource-Codes
Matúš Falis, Aryo Pradipta Gema, Hang Dong, Luke Daines, Siddharth Basetti, Michael Holder, Rose Penfold, Alexandra Birch, Beatrice Alex
icd coding large language model data augmentation
Published: Dec. 16, 2024. Version: 1.0.0
Database Credentialed Access
MS-CXR: Making the Most of Text Semantics to Improve Biomedical Vision-Language Processing
Benedikt Boecking, Naoto Usuyama, Shruthi Bannur, Daniel Coelho de Castro, Anton Schwaighofer, Stephanie Hyland, Harshita Sharma, Maria Teodora Wetscherek, Tristan Naumann, Aditya Nori, Javier Alvarez Valle, Hoifung Poon, Ozan Oktay
vision-language processing chest x-ray phrase grounding localization
Published: Nov. 15, 2024. Version: 1.1.0
Database Credentialed Access
MIMIC-IV-Note: Deidentified free-text clinical notes
Alistair Johnson, Tom Pollard, Steven Horng, Leo Anthony Celi, Roger Mark
deidentification critical care electronic health record natural language processing clinical notes mimic
Published: Jan. 6, 2023. Version: 2.2
Database Credentialed Access
RadVLM Instruction Dataset
Nicolas Deperrois, Hidetoshi Matsuo, Samuel Ruiperez-Campillo, Moritz Vandenhirtz, Sonia Laguna, Alain Ryser, Koji Fujimoto, Mizuho Nishio, Thomas Sutter, Julia Vogt, Jonas Kluckert, Thomas Frauenfelder, Christian Bluethgen, Farhad Nooralahzadeh, Michael Krauthammer
chest x-rays vision-language models medical ai
Published: Sept. 25, 2025. Version: 1.0.0
Database Credentialed Access
MIMIC-Ext-MIMIC-CXR-VQA: A Complex, Diverse, And Large-Scale Visual Question Answering Dataset for Chest X-ray Images
Seongsu Bae, Daeun Kyung, Jaehee Ryu, Eunbyeol Cho, Gyubok Lee, Sunjun Kweon, Jungwoo Oh, Lei JI, Eric Chang, Tackeun Kim, Edward Choi
question answering chest x-ray radiology electronic health records machine learning multimodal deep learning evaluation visual question answering benchmark
Published: July 19, 2024. Version: 1.0.0
Database Credentialed Access
RadGraph-XL: A Large-Scale Expert-Annotated Dataset for Entity and Relation Extraction from Radiology Reports
Jean-Benoit Delbrouck
Published: Sept. 12, 2025. Version: 1.0.0
Database Credentialed Access
RadVLM Instruction Dataset
Nicolas Deperrois, Hidetoshi Matsuo, Samuel Ruiperez-Campillo, Moritz Vandenhirtz, Sonia Laguna, Alain Ryser, Koji Fujimoto, Mizuho Nishio, Thomas Sutter, Julia Vogt, Jonas Kluckert, Thomas Frauenfelder, Christian Bluethgen, Farhad Nooralahzadeh, Michael Krauthammer
chest x-rays vision-language models medical ai
Published: Sept. 25, 2025. Version: 1.0.0
Database Credentialed Access
Chest ImaGenome Dataset
Joy Wu, Nkechinyere Agu, Ismini Lourentzou, Arjun Sharma, Joseph Paguio, Jasper Seth Yao, Edward Christopher Dee, William Mitchell, Satyananda Kashyap, Andrea Giovannini, Leo Anthony Celi, Tanveer Syeda-Mahmood, Mehdi Moradi
scene graph visual dialogue object detection semantic reasoning bounding box knowledge graph explainability reasoning relation extraction chest disease progression cxr chest x-ray radiology machine learning multimodal deep learning visual question answering
Published: July 13, 2021. Version: 1.0.0