Resources
Database Contributor Review
CARMEN-I: A resource of anonymized electronic health records in Spanish and Catalan for training and testing NLP tools
Eulalia Farre Maduell, Salvador Lima-Lopez, Santiago Andres Frid, Artur Conesa, Elisa Asensio, Antonio Lopez-Rueda, Helena Arino, Elena Calvo, Maria Jesús Bertran, Maria Angeles Marcos, Montserrat Nofre Maiz, Laura Tañá Velasco, Antonia Marti, Ricardo Farreres, Xavier Pastor, Xavier Borrat Frigola, Martin Krallinger
de-identification clinical ner anonymization
Published: April 20, 2024. Version: 1.0.1
Database Restricted Access
MIMIC-Eye: Integrating MIMIC Datasets with REFLACX and Eye Gaze for Multimodal Deep Learning Applications
Chihcheng Hsieh, Chun Ouyang, Jacinto C Nascimento, Joao Pereira, Joaquim Jorge, Catarina Moreira
Published: March 23, 2023. Version: 1.0.0
Database Credentialed Access
RadGraph2: Tracking Findings Over Time in Radiology Reports
Adam Dejl, Sameer Khanna, Patricia Therese Pile, Kibo Yoon, Steven QH Truong, Hanh Duong, Agustina Saenz, Pranav Rajpurkar
chest x-rays relation extraction disease progression information extraction radiology reports named entity recognition
Published: Aug. 8, 2024. Version: 1.0.0
Model Credentialed Access
EntityBERT: BERT-based Models Pretrained on MIMIC-III with or without Entity-centric Masking Strategy for the Clinical Domain
Chen Lin, Steven Bethard, Guergana Savova, Timothy Miller, Dmitriy Dligach
Published: March 17, 2022. Version: 1.0.1
Database Credentialed Access
MIMIC-IV-Ext-22MCTS: A 22 Millions-Event Temporal Clinical Time-Series Dataset with Relative Timestamp
Jing Wang, Xing Niu, Tong Zhang, Jie Shen, Juyong Kim, Jeremy Weiss
mimic clinical event annotation time series temporal annotation
Published: Sept. 29, 2025. Version: 1.0.0
Database Credentialed Access
RadNLI: A natural language inference dataset for the radiology domain
Yasuhide Miura, Yuhao Zhang, Emily Tsai, Curtis Langlotz, Dan Jurafsky
Published: June 29, 2021. Version: 1.0.0
Database Credentialed Access
National Institutes of Health Stroke Scale (NIHSS) Annotations for the MIMIC-III Database
Jiayang Wang, Xiaoshuo Huang, Lin Yang, Jiao Li
Published: Jan. 25, 2021. Version: 1.0.0
Database Restricted Access
HYAMD High-Resolution Fundus Image Dataset for age related macular degeneration (AMD) Diagnosis
Meishar Meisel, Benjamin Alfred Cohen, Meital Baskin, Beatrice Tiosano, Joachim Behar, Eran Berkowitz
Published: Sept. 9, 2025. Version: 1.0.0
Model Credentialed Access
Characterization of Stigmatizing Language in Medical Records
Keith Harrigian, Ayah Zirikly, Brant Chee, Alya Ahmad, Anne Links, Somnath Saha, Mary Catherine Beach, Mark Dredze
clinical natural language processing domain transfer bias stigmatizing language large language models mimic
Published: Nov. 6, 2023. Version: 1.0.0
Model Credentialed Access
Characterization of Stigmatizing Language in Medical Records
Keith Harrigian, Ayah Zirikly, Brant Chee, Alya Ahmad, Anne Links, Somnath Saha, Mary Catherine Beach, Mark Dredze
clinical natural language processing domain transfer bias stigmatizing language large language models mimic
Published: Nov. 6, 2023. Version: 1.0.0