Resources
Database Credentialed Access
Medical-CXR-VQA dataset: A Large-Scale LLM-Enhanced Medical Dataset for Visual Question Answering on Chest X-Ray Images
Xinyue Hu, Lin Gu, Kazuma Kobayashi, liangchen liu, Mengliang Zhang, Tatsuya Harada, Ronald Summers, Yingying Zhu
Published: Jan. 21, 2025. Version: 1.0.0
Database Credentialed Access
MIMIC-IV-ECG-Ext-ICD: Diagnostic labels for MIMIC-IV-ECG
Nils Strodthoff, Juan Miguel Lopez Alcaraz, Wilhelm Haverkamp
machine learning electrocardiography mimic
Published: Aug. 30, 2024. Version: 1.0.1
Database Credentialed Access
CORAL: expert-Curated medical Oncology Reports to Advance Language model inference
Madhumita Sushil, Vanessa Kennedy, Divneet Mandair, Brenda Miao, Travis Zack, Atul Butte
artificial intelligence information extraction oncology natural language processing large language models electronic health records
Published: Feb. 7, 2024. Version: 1.0
Database Open Access
KINECAL
Sean Maudsley-Barton, Moi Hoon Yap
balance posturography clinical tests postural sway falls-risk age-related changes
Published: June 8, 2023. Version: 1.0.3
Database Restricted Access
MIMIC-IV-Ext-Apixaban-Trial-Criteria-Questions
Elizabeth Woo, Michael Craig Burkhart, Emily Alsentzer, Brett Beaulieu-Jones
clinical q and a evaluation set clinical trial eligibility
Published: April 30, 2025. Version: 1.0.0
Database Contributor Review
CARMEN-I: A resource of anonymized electronic health records in Spanish and Catalan for training and testing NLP tools
Eulalia Farre Maduell, Salvador Lima-Lopez, Santiago Andres Frid, Artur Conesa, Elisa Asensio, Antonio Lopez-Rueda, Helena Arino, Elena Calvo, Maria Jesús Bertran, Maria Angeles Marcos, Montserrat Nofre Maiz, Laura Tañá Velasco, Antonia Marti, Ricardo Farreres, Xavier Pastor, Xavier Borrat Frigola, Martin Krallinger
de-identification clinical ner anonymization
Published: April 20, 2024. Version: 1.0.1
Database Open Access
Facial and oral temperature data from a large set of human subject volunteers
Quanzeng Wang, Yangling Zhou, Pejman Ghassemi, Dwith Chenna, Michelle Chen, Jon Casamento, Joshua Pfefer, David Mcbride
clinical accuracy receiver operating characteristic curve infectious disease epidemics thermography fever screening inner canthus elevated body temperature facial maximum temperatures infrared thermograph pearson correlation coefficients thermometry
Published: May 24, 2023. Version: 1.0.0
Database Credentialed Access
PatientSim: A Persona-Driven Simulator for Realistic Doctor-Patient Interactions
Daeun Kyung, Hyunseung Chung, Seongsu Bae, Jiho Kim, Jae Ho Sohn, Taerim Kim, Soo Kim, Edward Choi
electronic health records multi-turn dialogue llm simulation doctor-patient consultation
Published: Oct. 18, 2025. Version: 1.0.0
Database Credentialed Access
RadGraph-XL: A Large-Scale Expert-Annotated Dataset for Entity and Relation Extraction from Radiology Reports
Jean-Benoit Delbrouck
Published: Sept. 12, 2025. Version: 1.0.0
Database Restricted Access
HYAMD High-Resolution Fundus Image Dataset for age related macular degeneration (AMD) Diagnosis
Meishar Meisel, Benjamin Alfred Cohen, Meital Baskin, Beatrice Tiosano, Joachim Behar, Eran Berkowitz
Published: Sept. 9, 2025. Version: 1.0.0