Resources


Database Open Access

CUILESS2016

A corpus of Concept Unique Identifier concepts taken from the SemEval2015 Task 14.

concept umls snomed

Published: Jan. 24, 2018. Version: 1.0.0


Database Open Access

CUILESS2016

A corpus of Concept Unique Identifier concepts taken from the SemEval2015 Task 14.

concept umls snomed

Published: Jan. 24, 2018. Version: 1.0.0


Database Credentialed Access

CHIFIR: Cytology and Histopathology Invasive Fungal Infection Reports

Vlada Rozova, Anna Khanina, Jasmine Teng, Joanne Teh, Leon Worth, Monica Slavin, karin thursky, Karin Verspoor

A corpus of cytology and histopathology reports annotated for terminology relevant to fungal infections. Ideal for validation of named entity recognition and relation extraction methods.

nlp invasive fungal infections information extraction clinical documentation

Published: July 12, 2023. Version: 1.0.0


Database Restricted Access

REFLACX: Reports and eye-tracking data for localization of abnormalities in chest x-rays

Ricardo Bigolin Lanfredi, Mingyuan Zhang, William Auffermann, Jessica Chan, Phuong-Anh Duong, Vivek Srikumar, Trafton Drew, Joyce Schroeder, Tolga Tasdizen

This dataset contains 3032 cases of eye-tracking data collected while five radiologists dictated reports for frontal chest x-rays, synchronized timestamped dictation transcription, and manual labels for validation of localization of abnormalities.

computer vision eye tracking radiology report chest x-rays radiology machine learning reflacx fixations gaze deep learning

Published: Sept. 27, 2021. Version: 1.0.0


Database Open Access

MIMIC-IV demo data in the OMOP Common Data Model

Michael Kallfelz, Anna Tsvetkova, Tom Pollard, Manlik Kwong, Gigi Lipori, Vojtech Huser, Jeffrey Osborn, Sicheng Hao, Andrew Williams

Preliminary work to transform a MIMIC-IV demo dataset to the OMOP Common Data Model

omop common data model

Published: June 21, 2021. Version: 0.9


Database Credentialed Access

MIMIC-III and eICU-CRD: Feature Representation by FIDDLE Preprocessing

Shengpu Tang, Parmida Davarmanesh, Yanmeng Song, Danai Koutra, Michael Sjoding, Jenna Wiens

Features and labels from MIMIC-III and eICU-CRD produced by FIDDLE, an EHR preprocessing pipeline.

preprocessing machine learning electronic health record

Published: April 28, 2021. Version: 1.0.0


Database Open Access

Safety and Preliminary Efficacy of Intranasal Insulin for Cognitive Impairment in Parkinson Disease and Multiple System Atrophy

Vera Novak, Peter Novak

Dataset collected as part of a study that aimed to determine the effects of intranasal insulin on cognition and motor performance in Parkinson's disease.

atrophy insulin randomized controlled trial cognitive assessment motor assessment parkinsons

Published: April 10, 2019. Version: 1.0


Software Open Access

Measurement of Global Electrical Heterogeneity

The Global Electrical Heterogeneity (GEH) concept is based on the theory of Wilson’s electrical gradient vector, which characterizes the degree of heterogeneity of the total recovery time across the ventricles.

Published: May 2, 2018. Version: 1.0.0


Challenge Credentialed Access

Analysis of Clinical Text: Task 14 of SemEval 2015

Guergana Savova

This is the dataset for SemEval 2014 and 2015, Analysis of Clinical Text

semeval nlp

Published: Dec. 28, 2014. Version: 2.0