Resources
Database Credentialed Access
MIMIC-III-Ext-VeriFact-BHC: Labeled Propositions From Brief Hospital Course Summaries for Long-form Clinical Text Evaluation
Philip Chung, Akshay Swaminathan, Alex Goodell, Yeasul Kim, Momsen Reincke, Lichy Han, Ben Deverett, Mohammad Amin Sadeghi, Abdel badih El Ariss, Marc Ghanem, David Seong, Andrew Lee, Caitlin Coombes, Brad Bradshaw, Mahir Sufian, Hyo Jung Hong, Teresa Nguyen, Mohammad Rasouli, Komal Kamra, Mark Burbridge, James McAvoy, Roya Saffary, Stephen Parnell Ma, Dev Dash, James Xie, Ellen Wang, Cliff Schmiesing, Nigam Shah, Nima Aghaeepour
artificial intelligence natural language processing clinical notes electronic health records large language models brief hospital course long-form text chart review text reranking atomic claim hybrid retrieval clinical informatics clinical medicine fact verification retrieval-augmented generation logical atomism text embedding formal logic llm-as-a-judge llm evaluation
Published: April 9, 2025. Version: 1.0.0
Database Restricted Access
LATTE-CXR: Locally Aligned TexT and imagE, Explainable dataset for Chest X-Rays
Elham Ghelichkhan, Tolga Tasdizen
eye-tracking chest x-ray dataset automatically generated dataset caption-guided object detection image captioning with region-level description grounded radiology report generation phrase grounding xai multi-modal learning local visual-language models localization
Published: Feb. 4, 2025. Version: 1.0.0
Database Credentialed Access
RaDialog Instruct Dataset
Chantal Pellegrini, Ege Özsoy, Benjamin Busam, Nassir Navab, Matthias Keicher
medical image understaning radiology chatbot radiology report generation radiology assistant large vision-language models
Published: July 12, 2024. Version: 1.1.0
Model Credentialed Access
Characterization of Stigmatizing Language in Medical Records
Keith Harrigian, Ayah Zirikly, Brant Chee, Alya Ahmad, Anne Links, Somnath Saha, Mary Catherine Beach, Mark Dredze
clinical natural language processing domain transfer bias stigmatizing language large language models mimic
Published: Nov. 6, 2023. Version: 1.0.0
Database Open Access
CPAP Pressure and Flow Data from a Local Trial of 30 Adults at the University of Canterbury
Ella Guy, Jennifer Knopp, Geoff Chase
peep cpap respiratory mechanics pulmonary mechanics respiratory modelling biomedical engineering
Published: March 24, 2022. Version: 1.0.1
Database Credentialed Access
PatientSim: A Persona-Driven Simulator for Realistic Doctor-Patient Interactions
Daeun Kyung, Hyunseung Chung, Seongsu Bae, Jiho Kim, Jae Ho Sohn, Taerim Kim, Soo Kim, Edward Choi
electronic health records multi-turn dialogue llm simulation doctor-patient consultation
Published: Oct. 18, 2025. Version: 1.0.0
Database Restricted Access
TN-Mammo: A Multi-view Mammography Dataset for Breast Density Classification
Binh Nguyen, Cat Le, Loc Vu, Quynh Nguyen, Ha-Hieu Pham, Phuong Anh Vu, Thuan Huynh, Cao Tien Dung, Nghiem Diep Tuong, Byung-Woo Hong
Published: Oct. 4, 2025. Version: 1.0.0
Database Credentialed Access
MIMIC-IV-Ext-22MCTS: A 22 Millions-Event Temporal Clinical Time-Series Dataset with Relative Timestamp
Jing Wang, Xing Niu, Tong Zhang, Jie Shen, Juyong Kim, Jeremy Weiss
mimic clinical event annotation time series temporal annotation
Published: Sept. 29, 2025. Version: 1.0.0
Database Credentialed Access
RadVLM Instruction Dataset
Nicolas Deperrois, Hidetoshi Matsuo, Samuel Ruiperez-Campillo, Moritz Vandenhirtz, Sonia Laguna, Alain Ryser, Koji Fujimoto, Mizuho Nishio, Thomas Sutter, Julia Vogt, Jonas Kluckert, Thomas Frauenfelder, Christian Bluethgen, Farhad Nooralahzadeh, Michael Krauthammer
chest x-rays vision-language models medical ai
Published: Sept. 25, 2025. Version: 1.0.0
Database Credentialed Access
MIMIC-IV-Ext-Instr: A Dataset of 450K+ EHR-Grounded Instruction-Following Examples
Zhenbang Wu, Anant Dadu, Mike Nalls, Faraz Faghri, Jimeng Sun
large language models medical question answering instruction tuning
Published: Sept. 9, 2025. Version: 1.0.0