Resources
Database Contributor Review
BRATECA (Brazilian Tertiary Care Dataset): a Clinical Information Dataset for the Portuguese Language
prescriptions exams tertiary care natural language processing clinical notes
Published: July 14, 2022. Version: 1.1
Database Credentialed Access
Immunosuppressive Condition and Medication Annotations for Admission Notes in the MIMIC-III Database
Published: Aug. 4, 2025. Version: 1.0.0
Database Restricted Access
Application of Med-PaLM 2 in the refinement of MIMIC-CXR labels
Published: Feb. 4, 2025. Version: 1.0.0
Database Credentialed Access
MIMIC-IV-Ext Cardiac Disease
Published: May 6, 2025. Version: 1.0.0
Database Credentialed Access
Tasks 1 and 3 from Progress Note Understanding Suite of Tasks: SOAP Note Tagging and Problem List Summarization
Published: Sept. 30, 2022. Version: 1.0.0
Model Credentialed Access
Transformer models trained on MIMIC-III to generate synthetic patient notes
Published: May 27, 2020. Version: 1.0.0
Database Credentialed Access
Annotated Social Determinants of Health Dataset for Adverse Pregnancy Outcomes
Published: Aug. 4, 2025. Version: 1.0.0
Model Credentialed Access
Fine-tuning foundational models to code diagnoses from veterinary health records
transformers natural language processing large language models foundational models one health diagnoses snomed-ct veterinary medicine omop cdm veterinary medical records clinical coding
Published: Jan. 25, 2026. Version: 1.0.0
Database Credentialed Access
Phenotype Annotations for Patient Notes in the MIMIC-III Database
patient classification natural language processing
Published: March 5, 2020. Version: 1.20.03
Database Credentialed Access
RadGraph: Extracting Clinical Entities and Relations from Radiology Reports
entity and relation extraction graph multi-modal natural language processing radiology
Published: June 3, 2021. Version: 1.0.0