Database Open Access

eICU Collaborative Research Database Demo

Alistair Johnson Tom Pollard Omar Badawi Jesse Raffa

Published: May 6, 2021. Version: 2.0.1


When using this resource, please cite: (show more options)
Johnson, A., Pollard, T., Badawi, O., & Raffa, J. (2021). eICU Collaborative Research Database Demo (version 2.0.1). PhysioNet. https://doi.org/10.13026/4mxk-na84.

Please include the standard citation for PhysioNet: (show more options)
Goldberger, A., Amaral, L., Glass, L., Hausdorff, J., Ivanov, P. C., Mark, R., ... & Stanley, H. E. (2000). PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation [Online]. 101 (23), pp. e215–e220.

Abstract

This project is a demo version of the eICU Collaborative Research Database, a multi-center database comprised of deidentified health data for over 200,000 admissions to ICUs across the United States between 2014-2015. The database includes vital sign measurements, care plan documentation, severity of illness measures, diagnosis information, and treatment information. Data is collected through the Philips eICU program, a critical care telehealth program that delivers information to caregivers at the bedside. This open access demo allows researchers to ascertain whether the eICU Collaborative Research Database is suitable for their work. It includes over 2,500 unit stays selected from 20 of the larger hospitals in the eICU Collaborative Research Database.


Background

Philips Healthcare has developed an eICU telehealth system for critically ill patients, which provides 24 hour support for caregivers at the bedside. It is a supplement — not a replacement — to the bedside team, and the data utilized by the remote caregivers is archived for research purposes. A subset of the data is made available to researchers via the eICU Collaborative Research Database, a multi-center intensive care unit database covering over 200,000 admissions to ICUs monitored by eICU Programs across the United States [1]. The database is deidentified, and includes vital sign measurements, care plan documentation, APACHE severity of illness measures, diagnosis information, and treatment details. This open access demo allows researchers to ascertain whether the eICU Collaborative Research Database is suitable for their work.


Methods

The demo dataset contains data associated with (ICU) stays for over 2,500 unit stays selected from 20 of the larger hospitals in the eICU Collaborative Research Database. All tables are deidentified to meet the safe harbor provision of the US Health Insurance Portability and Accountability Act (HIPAA). These provisions include the removal of all protected health information. Hospital and unit identifiers have also been removed to protect the privacy of contributing organizations. The schema was established in collaboration with Privacert (Cambridge, MA), who certified the re-identification risk as meeting safe harbor standards (HIPAA Certification no. 1031219-2).


Data Description

Data include vital signs, laboratory measurements, medications, APACHE components, care plan information, admission diagnosis, patient history, time-stamped diagnoses from a structured problem list, and similarly chosen treatments. Data from each patient is collected into a common warehouse only if certain "interfaces" are available. Each interface is used to transform and load a certain type of data: vital sign interfaces incorporate vital signs, laboratory interfaces provide measurements on blood samples, and so on. It is important to be aware that different care units may have different interfaces in place, and that the lack of an interface will result in no data being available for a given patient, even if those measurements were made in reality. The data is provided as CSVs that can be loaded into a relational database using code provided in the eICU Code Repository [2]. An gzipped SQLite version of the database is also provided in the sqlite folder for convenience.


Usage Notes

The demo provides an opportunity to review the structure and content of the eICU Collaborative Research Database before deciding whether or not to carry out an analysis on the full version. The demo also provides a useful sample dataset to demonstrate that code developed for the eICU Collaborative Research Database is functional and returning expected results (for example, for verifying behaviour in unit tests and demonstrating reproducibility in published papers).


Release Notes

This demo is associated with eICU-CRD v2.0.

Version 2.0.1

  • Added an SQLite3 version of the dataset.
  • Removed unnecessary spacing in the admissionDx, allergy, and customLab tables.

Acknowledgements

The authors would like to thank the Philips eICU Research Institute and Philips Healthcare for contribution of the data. The authors would also like to thank Andrew A. Kramer for insightful comments regarding the data and Dina Demner-Fushman for helpful feedback on the deidentification process.


Conflicts of Interest

The work was supported by grants NIH-R01-EB017205, NIH-R01-EB001659, and NIH-R01-GM104987 from the National Institutes of Health. The MIT Laboratory for Computational Physiology received funding from Philips Healthcare to undertake work on the database described in this paper. O.B. is an employee of Philips Healthcare.


References

  1. Pollard TJ, Johnson AEW, Raffa JD, Celi LA, Mark RG and Badawi O. The eICU Collaborative Research Database, a freely available multi-center database for critical care research. Scientific Data (2018). DOI: http://dx.doi.org/10.1038/sdata.2018.178.
  2. eICU Code Repository on GitHub. https://github.com/MIT-LCP/eicu-code [Accessed: 1 April 2021]

Parent Projects
eICU Collaborative Research Database Demo was derived from: Please cite them when using this project.
Share
Access

Access Policy:
Anyone can access the files, as long as they conform to the terms of the specified license.

License (for files):
Open Data Commons Open Database License v1.0

Corresponding Author
You must be logged in to view the contact information.
Versions
  • 2.0 - July 15, 2019
  • 2.0.1 - May 6, 2021

Files

Total uncompressed size: 130.6 MB.

Access the files
Folder Navigation: <base>
Name Size Modified
sqlite
LICENSE.txt (download) 25.2 KB 2021-05-06
SHA256SUMS.txt (download) 2.7 KB 2021-05-06
admissionDx.csv.gz (download) 93.6 KB 2021-05-06
admissiondrug.csv.gz (download) 111.4 KB 2019-05-23
allergy.csv.gz (download) 34.6 KB 2021-05-06
apacheApsVar.csv.gz (download) 66.6 KB 2019-05-23
apachePatientResult.csv.gz (download) 209.1 KB 2019-05-23
apachePredVar.csv.gz (download) 52.4 KB 2019-05-23
carePlanCareProvider.csv.gz (download) 56.4 KB 2019-05-23
carePlanEOL.csv.gz (download) 289 B 2019-05-23
carePlanGeneral.csv.gz (download) 351.9 KB 2019-05-23
carePlanGoal.csv.gz (download) 30.4 KB 2019-05-23
carePlanInfectiousDisease.csv.gz (download) 1.5 KB 2019-05-23
customLab.csv.gz (download) 634 B 2021-05-06
diagnosis.csv.gz (download) 352.1 KB 2019-05-23
hospital.csv.gz (download) 759 B 2019-05-23
infusiondrug.csv.gz (download) 394.2 KB 2019-05-23
intakeOutput.csv.gz (download) 1.7 MB 2019-07-12
lab.csv.gz (download) 5.5 MB 2019-05-23
medication.csv.gz (download) 1.4 MB 2019-05-23
microLab.csv.gz (download) 3.5 KB 2019-07-12
note.csv.gz (download) 259.7 KB 2019-07-12
nurseAssessment.csv.gz (download) 1.0 MB 2019-07-12
nurseCare.csv.gz (download) 488.1 KB 2019-07-12
nurseCharting.csv.gz (download) 16.1 MB 2019-07-12
pastHistory.csv.gz (download) 139.1 KB 2019-05-23
patient.csv.gz (download) 133.6 KB 2019-05-23
physicalExam.csv.gz (download) 759.0 KB 2019-07-12
respiratoryCare.csv.gz (download) 57.1 KB 2019-07-12
respiratoryCharting.csv.gz (download) 1.6 MB 2019-07-12
treatment.csv.gz (download) 450.1 KB 2019-05-23
vitalAperiodic.csv.gz (download) 2.5 MB 2019-05-23
vitalPeriodic.csv.gz (download) 18.5 MB 2019-05-23