Resources
Database Credentialed Access
MedNLI - A Natural Language Inference Dataset For The Clinical Domain
Chaitanya Shivade
natural language inference recognizing textual entailment
Published: Oct. 1, 2019. Version: 1.0.0
Challenge Credentialed Access
BioNLP Workshop 2023 Shared Task 1A: Problem List Summarization
Yanjun Gao, Dmitriy Dligach, Timothy Miller, Majid Afshar
bionlp clinical natural language processing electronic health record summarization
Published: Nov. 12, 2023. Version: 2.0.0
Database Restricted Access
CXRGraph: Using Information Extraction to Normalize the Training Data for Automatic Radiology Report Generation
Yuxiang Liao, Hoisang Heung, Hantao Liu, Irena Spasic
relation extraction information extraction natural language processing named entity recognition structured radiology report
Published: Feb. 3, 2025. Version: 1.0.0
Database Credentialed Access
LLaVA-Rad MIMIC-CXR Annotations
Juan Manuel Zambrano Chaves, Shih-Cheng Huang, Yanbo Xu, Hanwen Xu, Naoto Usuyama, Sheng Zhang, Fei Wang, Yujia Xie, Mahmoud Khademi, Ziyi Yang, Hany Awadalla, Julia Gong, Houdong Hu, Jianwei Yang, Chunyuan Li, Jianfeng Gao, Yu Gu, Cliff Wong, Mu-Hsin Wei, Tristan Naumann, Muhao Chen, Matthew Lungren, Akshay Chaudhari, Serena Yeung, Curtis Langlotz, Sheng Wang, Hoifung Poon
Published: Jan. 24, 2025. Version: 1.0.0
Database Credentialed Access
MIMIC-III-Ext-VeriFact-BHC: Labeled Propositions From Brief Hospital Course Summaries for Long-form Clinical Text Evaluation
Philip Chung, Akshay Swaminathan, Alex Goodell, Yeasul Kim, Momsen Reincke, Lichy Han, Ben Deverett, Mohammad Amin Sadeghi, Abdel badih El Ariss, Marc Ghanem, David Seong, Andrew Lee, Caitlin Coombes, Brad Bradshaw, Mahir Sufian, Hyo Jung Hong, Teresa Nguyen, Mohammad Rasouli, Komal Kamra, Mark Burbridge, James McAvoy, Roya Saffary, Stephen Parnell Ma, Dev Dash, James Xie, Ellen Wang, Cliff Schmiesing, Nigam Shah, Nima Aghaeepour
artificial intelligence natural language processing clinical notes electronic health records large language models brief hospital course long-form text chart review text reranking atomic claim hybrid retrieval clinical informatics clinical medicine fact verification retrieval-augmented generation logical atomism text embedding formal logic llm-as-a-judge llm evaluation
Published: April 9, 2025. Version: 1.0.0
Database Credentialed Access
Medication Extraction Labels for MIMIC-IV-Note Clinical Database
Akshay Goel, Almog Gueta, Omry Gilon, Sofia Erell, Amir Feder
Published: Dec. 12, 2023. Version: 1.0.0
Database Credentialed Access
MIMIC-Ext-DrugDetection
Fabrice Harel-Canada, Nanyun Peng, David Goodman, Ruby Romero, Allan Nguyen, Brandon Moghanian, Anabel Salimian
ehr mimic-iv substance use clinical notes methamphetamine multi-label cocaine drug detection polysubstance use prescription opioid misuse cannabis benzodiazepine misuse injection drug use heroin mimic-iii
Published: Sept. 25, 2025. Version: 1.0.0
Database Credentialed Access
MIMIC-Ext-DrugDetection
Fabrice Harel-Canada, Nanyun Peng, David Goodman, Ruby Romero, Allan Nguyen, Brandon Moghanian, Anabel Salimian
ehr mimic-iv substance use clinical notes methamphetamine multi-label cocaine drug detection polysubstance use prescription opioid misuse cannabis benzodiazepine misuse injection drug use heroin mimic-iii
Published: Sept. 25, 2025. Version: 1.0.0
Challenge Credentialed Access
CXR-LT: Multi-Label Long-Tailed Classification on Chest X-Rays
Gregory Holste, Mingquan Lin, Song Wang, Yiliang Zhou, Yishu Wei, Hao Chen, Atlas Wang, Yifan Peng
disease classification artificial intelligence chest x-ray deep learning computer-aided diagnosis long-tailed learning cardiopulmonary disease zero-shot learning
Published: March 19, 2025. Version: 2.0.0
Database Credentialed Access
RuMedNLI: A Russian Natural Language Inference Dataset For The Clinical Domain
Pavel Blinov, Aleksandr Nesterov, Galina Zubkova, Arina Reshetnikova, Vladimir Kokh, Chaitanya Shivade
natural language inference recognizing textual entailment russian language
Published: April 1, 2022. Version: 1.0.0