PhysioNet/CinC Challenge 2009: Training Set

The training set consists of 60 records selected from the MIMIC II Database. Each record belongs to one of four groups as shown below. For each record, the table below lists the MIMIC II Clinical Database subject name, the corresponding MIMIC II Waveform Database waveform and numerics record names, T0 (which would be the time of the end of the record if it were part of the test set), and the sex and age of the subject. As part of the de-identification process, we have replaced all dates in these records with surrogate dates, most of which (for the next several years, at least) are in the future.

Update (24 August 2009): Ages for all subjects in the MIMIC II Database are available here; Franco Chiarugi pointed out that these were inconsistent with those originally given on this page. The table below has now been corrected.

The object of the challenge is to use information available before T0 to predict if the patient will have an acute hypotensive episode (AHE) that begins during the forecast window (the hour following T0). In this training set, all available information gathered after T0 is provided, and the AHE of interest (if present) begins midway through the forecast window, about 30 minutes after T0.

Individual clinical, waveform, and numerics records belonging to the training set can be accessed by following the links in the tables below. (If you are using one of the PhysioNet mirrors and the links don't work, please try another mirror or the master PhysioNet server at MIT. For convenience, all of the clinical records in the training set can be downloaded as a tarball (about 27 Mb; for information on unpacking tarballs, see the PhysioNet FAQ). The waveform and numerics records are much larger; you may wish to explore them using WFDB software to read regions of interest directly from the PhysioNet server rather than downloading them in their entirety. If you decide to download these records, the PhysioBank ATM can provide tarballs or zip files of each record; it can also generate .mat files (for Matlab or Octave) of segments of up to one million samples (about 2 hours and 20 minutes of a waveform record, or, in most cases, an entire numerics record).

A set of plots illustrating mean ABP for each record is available as a tarball (about 14 Mb), or individual plots can be viewed here. Both thumbnails in .png format and highly detailed but much larger .pdf plots are provided. To make them easily comparable, all records are plotted to the same scales, (0 to 200 hours elapsed time, 0 to 200 mmHg), but some of these records are shorter and others are considerably longer than 200 hours. The mean ABP in these plots comes from the low-resolution time series of ABP measurements made once per minute provided in the "numerics record" associated with each waveform record; in each plot, 0 on the time axis corresponds to the beginning of the numerics records. The clinical, waveform, and numerics records generally do not begin at the same time; to find T0 on a plot, compare it with the start time of the numerics record. Find any record's start time in the first line of its header (.hea) file, use wfdbdesc to do so if you have installed the WFDB software package, or use the PhysioBank ATM.

In many cases mean ABP is not available for the first few hours of the record, and there may be occasional gaps later on, but records with significant gaps after the ABP first becomes available were excluded.

For additional information, see the Challenge home page, the MIMIC II home page, and our tutorial, Using the MIMIC II Database.

Group H1 (acute hypotensive episode in forecast window, treated with pressors)

clinical
record
waveform
record
numerics
record
T0
surrogate date/time
sex/age
s21775 a40439 a40439n 04/09/2008 18:30 M 75
s20658 a40493 a40493n 20/07/2016 08:00 F 72
s22466 a40764 a40764n 01/09/2013 11:20 F 76
s05336 a40834 a40834n 01/06/2011 21:00 M 40
s06349 a40928 a40928n 15/07/2010 19:20 F 89
s08718 a41200 a41200n 08/04/2015 20:05 M 88
s20794 a41447 a41447n 19/10/2014 09:10 M 85
s24799 a41770 a41770n 24/11/2009 09:50 M 66
s26318 a41835 a41835n 04/01/2012 09:45 M 65
s14204 a41882 a41882n 13/10/2013 12:30 F 89
s25699 a41925 a41925n 22/09/2012 13:40 M 35
s07125 a42277 a42277n 12/09/2017 21:40 M 53
s19208 a42397 a42397n 18/10/2010 11:55 F 78
s12821 a42410 a42410n 24/09/2015 13:10 F 77
s06637 * a42928n 05/11/2011 12:40 M 78

Group H2 (AHE in forecast window, not treated with pressors)

clinical
record
waveform
record
numerics
record
T0
surrogate date/time
sex/age
s02395 a40006 a40006n 03/06/2011 21:15 F 80
s08779 a40012 a40012n 11/11/2014 02:45 M 58
s23641 a40050 a40050n 03/07/2014 17:20 M 90+
s24924 a40051 a40051n 13/04/2014 16:50 F 79
s00439 a40064 a40064n 15/01/2013 08:30 F 82
s23015 a40076 a40076n 09/07/2012 06:05 M 68
s19603 a40096 a40096n 21/11/2008 12:25 F 75
s02172 a40099 a40099n 22/10/2014 22:15 M 32
s26105 a40113 a40113n 16/09/2016 09:35 M 47
s23594 a40119 a40119n 29/05/2010 19:35 F 66
s21817 a40125 a40125n 11/08/2012 16:55 F 73
s24984 a40127 a40127n 12/09/2009 09:20 M 56
s25602 a40154 a40154n 13/12/2009 04:10 F 77
s23591 a40164 a40164n 04/10/2015 12:00 M 85
s15687 a40172 a40172n 03/11/2015 02:20 F 90+

Group C1 (records not containing acute hypotensive episodes)

clinical
record
waveform
record
numerics
record
T0
surrogate date/time
sex/age
s17765 a40282 a40282n 30/06/2016 20:00 M 51
s04860 a40473 a40473n 27/03/2010 21:40 F 57
s26097 a40551 a40551n 05/04/2014 13:00 F 42
s00318 a40802 a40802n 15/07/2009 19:15 M 58
s14495 a40921 a40921n 06/10/2015 09:00 M 59
s22888 a41137 a41137n 07/08/2015 22:35 M 48
s26296 a41177 a41177n 01/03/2017 22:20 M 47
s06180 * a41385n 10/11/2008 20:30 F 45
s07468 a41434 a41434n 03/09/2012 16:30 F 71
s24004 a41466 a41466n 23/06/2010 04:45 M 66
s03133 a41495 a41495n 27/06/2015 22:00 M 46
s02280 a41664 a41664n 20/01/2016 18:10 M 58
s09672 a41934 a41934n 30/07/2015 23:05 F 46
s19418 a42141 a42141n 07/01/2011 15:10 F 39
s15465 a42259 a42259n 01/02/2013 00:25 M 67

Group C2 (AHE, but not in forecast window)

clinical
record
waveform
record
numerics
record
T0
surrogate date/time
sex/age
s05786 a40207 a40207n 31/10/2012 06:20 M 54
s25222 a40215 a40215n 21/09/2012 23:00 F 60
s02561 a40225 a40225n 25/10/2012 14:00 F 81
s24923 a40234 a40234n 12/12/2008 23:00 F 82
s16019 a40260 a40260n 24/03/2015 09:15 F 90+
s04286 a40264 a40264n 28/10/2016 21:00 M 61
s17069 a40277 a40277n 05/03/2016 05:00 F 61
s07860 a40306 a40306n 28/08/2009 09:00 F 36
s23020 a40329 a40329n 22/05/2014 18:00 F 84
s24431 a40355 a40355n 21/09/2010 13:00 M 22
s22657 a40374 a40374n 25/08/2012 06:45 M 88
s09341 a40376 a40376n 05/05/2011 01:25 F 62
s05126 a40384 a40384n 21/07/2009 14:15 F 69
s10611 a40408 a40408n 26/06/2012 15:50 M 78
s05289 a40424 a40424n 10/12/2010 18:10 M 76

* There are no high-resolution signals available for record a42928 (only once-per-minute time series of derived measurements). Technical limitations of the data acquisition process occasionally do not permit recording the high-resolution signals, although the derived measurements can be recorded, as happened in this case.

This situation also occurred during the recording of record a41385, during which the high-resolution signals were recorded only for a very brief interval (less than a minute).