Database Credentialed Access
Predictors of Hospital Onset Infection: A Matched Retrospective Cohort Dataset
Ziming Wei , Luke Sagers , Caroline McKenna , Ted Pak , Chanu Rhee , Michael Klompas , Sanjat Kanjilal
Published: Nov. 4, 2025. Version: 1.0.0
When using this resource, please cite:
(show more options)
Wei, Z., Sagers, L., McKenna, C., Pak, T., Rhee, C., Klompas, M., & Kanjilal, S. (2025). Predictors of Hospital Onset Infection: A Matched Retrospective Cohort Dataset (version 1.0.0). PhysioNet. RRID:SCR_007345. https://doi.org/10.13026/k70x-0m81
Please include the standard citation for PhysioNet:
(show more options)
Goldberger, A., Amaral, L., Glass, L., Hausdorff, J., Ivanov, P. C., Mark, R., ... & Stanley, H. E. (2000). PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation [Online]. 101 (23), pp. e215–e220. RRID:SCR_007345.
Abstract
This repository contains a de-identified and curated patient-level dataset for modeling the impact of fine-grained environmental and patient-level factors on nosocomial acquisition of a wide range of drug-susceptible and drug-resistant pathogens. The data are derived from the clinical data warehouses of the Mass General Brigham health system and span all adult inpatients admitted between May 2015 and July 2024.
We use a matched case/control design and provide two datasets reflecting distinctive but complementary strategies to model our outcome. The first, which we call the environmental analysis, investigates the impact of the hospital unit microbiome, using colonization pressure as a proxy, and matches on prior antibiotic exposure to equalize its selective impact across cases and controls. The second strategy, which we call the patient analysis, investigates the impact of prior fine-grained antibiotic exposures and matches controls spatiotemporally to cases based on their time of entry into the same ward, to equalize the impact of the hospital microbiome.
Each observation in the dataset corresponds to a hospitalization. We look at nosocomial acquisition of 11 common drug-susceptible and drug-resistant pathogens, with cases having a culture positive for the target pathogen between day 3 and day 30 after admission into their room.
Background
Nosocomial acquisition of pathogens is a common and serious complication of medical care. On any given day in the United States, approximately one in 31 hospitalized patients has at least one healthcare-associated infection, underscoring the clinical and public health importance of robust prediction and prevention frameworks [1].
Prior research has shown that both environmental and patient-specific factors are important predictors. From an infection control epidemiology standpoint, two recurring drivers are (i) ward-level exposure, captured by constructs such as colonization pressure [2] and prior-room occupant effects, and (ii) patient-level exposure, including recent and cumulative antibiotic use [3–5]. For example, Gu et al. (2023) showed that prior bed occupants colonized or infected with multidrug-resistant organisms significantly increased the risk of acquisition in subsequent occupants, supporting the notion that microenvironment-level pressure matters [5]. Similarly, Jolivet (2020) showed that colonization pressure in the previous week was associated with acquisition of extended-spectrum beta-lactamase-producing Enterobacterales in medical and surgical intensive care units, but not for MRSA [6].
However, the results of these studies have been difficult to operationalize, as they lack the granularity to inform decisions for patients who have a multitude of risk factors. Furthermore, the absence of publicly available data and open-source code makes reconstructing these models infeasible. We seek to fill this gap by making available two datasets that approach the problem of predicting hospital-acquired infection from electronic health record (EHR) data in unique but complementary ways. This dual-matched design, with one cohort matched by antibiotic exposure and the other by hospital unit, enables a separation of environmental and pharmacologic risk factors rarely possible in prior resources. We also publish a highly annotated code base of an end-to-end pipeline for reproducing this data locally. Together, these datasets represent one of the largest publicly available EHR datasets published for investigating hospital-acquired infection, covering 11 target pathogens and estimating risk from 14 different antibiotic classes and 9 different colonization pressure groups. The scale, high level of curation, and code base make this dataset uniquely suited for both mechanistic and machine learning studies of nosocomial infection risk.
The environmental analysis seeks to estimate the impact of a hospital unit's microbiome on the risk of hospital-acquired infection by matching controls to cases across 14 different classes of antibiotics. By doing so, we equalize the selective pressure of antibiotic exposure across cases and controls. Given the finite number of indications for antibiotics, it also serves to equalize a number of other confounders, such as comorbidity burden. Our primary predictor is colonization pressure, which represents the prevalence of an organism (or set of organisms) in ward co-occupants at the time of entry by the case or control.
The patient analysis is the converse of the environmental analysis and seeks to understand the impact of exposure to a given antibiotic class on the risk of hospital-acquired infection. Consistent with recent cohort data on hospital-onset Clostridioides difficile infection, antibiotic exposure is a key modifiable predictor of risk, supporting the need to model class-specific exposure histories [4]. Here, controls are selected to be on the same hospital unit in the same month and year as cases, thereby eliminating differences that may occur due to exposure to a unit's microbiome. The primary predictors are prior exposure to a given antibiotic class.
For both analyses, we apply a number of other criteria to ensure both cases and controls did not have prior evidence of infection and were not being actively treated for infection before the observation period. Controls were also matched on demographics, prior surgery, and length of stay in the room.
Methods
Study design
The analysis consists of a matched case-control design [7, 8].
Cohort Selection
Cases and controls were identified based on their room occupancy status, microbiology results, and antibiotic exposure histories.
Time zero (T0, where the subscript represents days) was defined as the timestamp of entry into the qualifying room. This room always represented the first room in a patient’s hospital stay lasting >48 hours, excluding any time in the emergency department.
Inclusion Criteria
- Single qualifying room stay: Patients must have stayed in only one room for >48 hours during the observation period (T+3 to T+30). They may be discharged from their room prior to the end of the observation period.
- No clinical evidence of active infection at the time of admission: Patients must not have received any antibiotics from T–7 to T+3.
- No clinical evidence of prior colonization: No clinical or surveillance cultures positive for any target organism in the previous 6 months (or 12 months if the organism was S. aureus or an Enterococcus species).
Exclusion Criteria
- Admissions limited to emergency room visits or with no ward stay >48 hours in duration.
- Admissions where the patient stayed in >1 room for >48 hours.
- Any antibiotic exposure between day –7 and day +3.
- Positive clinical or surveillance culture with a target organism within 6 months prior to T0 (12 months for S. aureus or Enterococcus species).
- Missing demographic data (age, sex) or invalid date/time stamps preventing accurate assignment of relative days.
Case Definition
- Cases were patients with identification of a target organism (defined below) in any clinical or surveillance culture between T+3 and T+30, regardless of whether discharge occurred before day 30.
Control Definition
- Controls were patients who met inclusion criteria but did not have evidence of the target organism in any clinical or surveillance culture between T+3 and T+30. Matching strategy described below.
Matching Strategy
For both analyses, controls were matched to cases using predefined demographic, clinical, and temporal criteria to ensure comparable exposure windows and infection risk contexts.
A maximum of three controls were selected per case (1:≤3 matching ratio), depending on the number of eligible controls within the cohort. A single control could be matched to multiple cases if it satisfied all matching criteria, allowing reuse across case sets to preserve cohort representativeness and maximize matching completeness.
Environmental analysis
- Controls were matched to cases based on:
- Demographics (age ±5 years, sex)
- Prior surgery (of any type) in the previous 90 days (1/0)
- Length of stay in room (±3 days) or length of time to case's index culture (whichever was shorter)
- Antibiotic exposure, stratified by:
- Number of courses (defined as ≥2 days of an antibiotic, ≥7 days apart from a different course)
- Antibiotic class (and corresponding members):
- Anti-anaerobic
- Metronidazole
- Penicillins
- Penicillin, amoxicillin, ampicillin
- Extended-spectrum penicillins
- Amoxicillin-clavulanate, ampicillin-sulbactam, piperacillin-tazobactam
- Anti-staphylococcal penicillins
- Dicloxacillin, nafcillin, oxacillin, cefazolin
- Cephalosporins
- Cephalexin, cefadroxil, cefaclor, cefuroxime, cefprozil, cefpodoxime, cefdinir, cefotetan, cefixime, cefoxitin
- Extended-spectrum cephalosporins
- Cefotaxime, ceftazidime, ceftriaxone, cefepime, ceftaroline, cefiderocol
- Carbapenems
- Ertapenem, imipenem, meropenem
- Fluoroquinolones
- Moxifloxacin, ciprofloxacin, levofloxacin, delafloxacin
- Folate inhibitors
- Sulfamethoxazole, trimethoprim, trimethoprim-sulfamethoxazole
- Glycopeptides
- Systemic: vancomycin
- Anti-C. difficile: vancomycin (oral)
- Lincosamides
- Clindamycin
- Macrolides
- Azithromycin, clarithromycin
- Tetracyclines
- Tetracycline, demeclocycline, doxycycline, minocycline, tigecycline, omadacycline, eravacycline
- Anti-anaerobic
Patient analysis
- Controls were matched to cases based on:
- Hospital ward
- Admission month and year
- Length of stay in room (±3 days) or length of time to case's index culture (whichever was shorter)
Outcomes
Nosocomial acquisition (T+3 to T+30) of one of the following target pathogens identified in any clinical or surveillance culture:
- Escherichia coli (susceptible to β-lactams)
- Extended-spectrum β-lactamase-producing (ESBL) E. coli
- Klebsiella pneumoniae (susceptible to β-lactams)
- Extended-spectrum β-lactamase-producing (ESBL) K. pneumoniae
- Clostridioides difficile
- Vancomycin-susceptible Enterococcus faecalis
- Vancomycin-resistant E. faecium
- Methicillin-susceptible Staphylococcus aureus (MSSA)
- Methicillin-resistant S. aureus (MRSA)
- Drug-susceptible Pseudomonas aeruginosa (susceptible to ceftazidime, cefepime, piperacillin-tazobactam, and carbapenems)
- Drug-resistant P. aeruginosa (resistant to at least one β-lactam)
Antibiotic susceptibility phenotypes were derived from the 32nd edition of the M100 document, published by the Clinical and Laboratory Standards Institute.
Model covariates
Environmental analysis
- Colonization pressure, defined as the time-weighted prevalence of a specified organism, or an organism set, in ward co-occupants, given by:
- A ward co-occupant was defined as a person with a >48-hour stay in the 30-day period prior to entry into the ward by the study participant (T-30 to T0).
- = 1 if the ward co-occupant had the target organism within the past 6 months; = 0 if not.
- is the time interval in days between the date the organism was identified in the co-occupant and the date of entry into the ward by the study participant (case or control).
- λ is a tunable decay parameter controlling the influence of time (set to 0.01 by default); this accounts for the diminishing effect of prior organisms over time.
- The look-back period for and was set to 6 months to account for the possibility of organism carriage.
- CP was calculated separately for the following nine organisms and groups (drug-susceptible, drug-resistant, and ESBL phenotypes are defined as above for the target organisms):
- Drug-susceptible Enterobacterales
- ESBL Enterobacterales
- Vancomycin-susceptible Enterococcus (VSE) species
- Vancomycin-resistant Enterococcus (VRE) species
- C. difficile
- MSSA
- MRSA
- Drug-susceptible P. aeruginosa
- Drug-resistant P. aeruginosa
- Example of use:
- Assume patientX enters wardA on T0, and there are five patients who resided on wardA between T-30 and T0 who had C. difficile in the previous 6 months, at a median of 60 days prior to T0.
- The corresponding colonization pressure for C. difficile on wardA for the day patientX is admitted would therefore be 2.744.
- Assume patientX enters wardA on T0, and there are five patients who resided on wardA between T-30 and T0 who had C. difficile in the previous 6 months, at a median of 60 days prior to T0.
- Comorbidities defined by the Elixhauser Index [9]
Patient analysis
- Age
- Sex
- Comorbidities defined by the Elixhauser Index [9]
- Number of courses of the following antibiotic classes in the previous 90 days (members of each class defined above):
- Anti-anaerobic
- Penicillins
- Extended-spectrum penicillins
- Anti-staphylococcal penicillins
- Cephalosporins
- Extended-spectrum cephalosporins
- Carbapenems
- Fluoroquinolones
- Folate inhibitors
- Glycopeptides
- Lincosamides
- Macrolides
- Tetracyclines
Data Description
Note that the dataset HO_infxn_analysis.csv contains data for both the environmental and patient analyses.
Sample Sizes
Environmental analysis
- 14,923 cases were matched to 28,480 control entries for 11 pathogens. The same control can be matched to different pathogens during the matching process.
| Pathogen | Case | Control |
| E. coli | 3,585 | 5,693 |
| ESBL E. coli | 996 | 2,299 |
| K. pneumoniae | 1,704 | 3,684 |
| ESBL K. pneumoniae | 238 | 376 |
| Vancomycin-susceptible E. faecalis | 1,823 | 3,082 |
| Vancomycin-resistant E. faecium | 244 | 366 |
| C. difficile | 592 | 881 |
| MSSA | 2,956 | 6,504 |
| MRSA | 1,101 | 2,397 |
| Drug-susceptible P. aeruginosa | 1,309 | 2,533 |
| Drug-resistant P. aeruginosa | 379 | 665 |
Patient analysis
- 16,933 cases were matched to 39,372 control entries for 11 pathogens. The same control can be matched to different pathogens during the matching process.
| Pathogen | Case | Control |
| E. coli | 4,576 | 9,875 |
| ESBL E. coli | 1,034 | 2,511 |
| K. pneumoniae | 1,723 | 4,194 |
| ESBL K. pneumoniae | 336 | 804 |
| Vancomycin-susceptible E. faecalis | 2,384 | 5,698 |
| Vancomycin-resistant E. faecium | 389 | 962 |
| C. difficile | 1,094 | 2,558 |
| MSSA | 2,375 | 5,387 |
| MRSA | 1,102 | 2,656 |
| Drug-susceptible P. aeruginosa | 1,456 | 3,615 |
| Drug-resistant P. aeruginosa | 464 | 1,112 |
Feature Dictionary
analysis- Description: Matching protocol
- Type: Categorical
- Encoding: N/A
- Units: N/A
- Range: environmental / patient. Descriptions of both analyses in Methods
run- Description: The target pathogen
- Type: Categorical
- Encoding: N/A
- Units: N/A
- Range: The 11 pathogen targets described in the Methods section
group- Description: Binary indicator of whether the entry is a case or a control
- Type: Categorical
- Encoding: N/A
- Units: N/A
- Range: Case / Control
group_binary- Description: Binary indicator of whether the entry is a case or a control
- Type: Categorical
- Encoding: 1: Case / 0: Control
- Units: N/A
- Range: 1 / 0
group_index- Description: Index to identify cases and controls in the same group
- Type: Categorical
- Encoding: 1: Case, 0: Control
- Units: N/A
- Range: 1 / 0
deidentified_patient_id- Description: De-identified patient ID (see Data De-identification)
- Type: Integer
- Encoding: N/A
- Units: N/A
- Range: Environmental analysis: 1–34,338; Patient analysis: 1–49,583
deidentified_month- Description: De-identified month of entry into a hospital unit; dates internally consistent for a patient (see Data De-identification)
- Type: Integer
- Encoding: N/A
- Units: N/A
- Range: 1–12
deidentified_year- Description: De-identified year of entry into a hospital unit; dates internally consistent for a patient (see Data De-identification)
- Type: Integer
- Encoding: N/A
- Units: N/A
- Range: 2000–2100
dept_code- Description: De-identified code for hospital ward (see Data De-identification)
- Type: Integer
- Encoding: N/A
- Units: N/A
- Range: 1–344
duration- Description: Length of stay in the first room
- Type: Numerical
- Encoding: N/A
- Units: days
- Range: 2–30
time_to_infxn- Description: Time between entry into the room (T₀) and the date of positive culture with the pathogen of interest. Applies to cases only; values for controls are set to N/A.
- Type: Numerical
- Encoding: N/A
- Units: days
- Range: 3.1–30 / N/A
matching_duration- Description: Length of time for matching controls to cases; ±3 days of the length of stay (
duration) or the timestamp for index culture (time_to_infxn), whichever is shorter. - Type: Numerical
- Encoding: N/A
- Units: days
- Range: 3.1–30
- Description: Length of time for matching controls to cases; ±3 days of the length of stay (
age- Description: Age of the patient at T₀; patients ≥90 are grouped into a single category
- Type: Numerical
- Encoding: N/A
- Units: years
- Range: 0–90 / >90
sex- Description: Sex of the patient
- Type: Categorical
- Encoding: N/A
- Units: N/A
- Range: Male / Female
any_surgery- Description: Any prior surgery in the previous 90 days
- Type: Binary
- Encoding: 1: Had prior surgery, 0: No prior surgery
- Units: N/A
- Range: 0 / 1
any_abx_0_60- Description: Binary indicator if the patient received any antibiotics in the previous 60 days
- Type: Binary
- Encoding: 1: Had prior antibiotic exposure, 0: No prior antibiotic exposure
- Units: N/A
- Range: 0 / 1
{abx_name}_0_60- Description: Number of courses of
{abx_name}in the previous 60 days, where each course is at least 2 days in duration - Type: Integer
- Encoding: N/A
- Units: Number of courses
- Range:
{abx_name}-specific ranges
- Description: Number of courses of
{pathogen_name}_cp- Description: Ward-level colonization pressure of
{pathogen_name}. Calculated by the time-weighted 6-month prevalence of{pathogen_name}in ward co-occupants who occupied the same ward as a case or control in the previous 30 days - Type: Numerical
- Encoding: N/A
- Units: N/A
- Range:
{pathogen_name}-specific ranges
- Description: Ward-level colonization pressure of
elix_index_mortality- Description: Elixhauser Index calculated based on [9]
- Type: Numerical
- Encoding: N/A
- Units: N/A
- Range: Environmental: –35–101; Patient: –33–101
elix_{elix_name}- Description: Indicator of individual Elixhauser category
- Type: Binary
- Encoding: 1: Specific Elixhauser comorbidity (
{elix_name}) present, 0: Specific Elixhauser comorbidity ({elix_name}) absent - Units: N/A
- Range: 0 / 1
Dependent Files (not included)
micro.csv- Description: Microbiology culture information
- Features in final dataset derived from this dataset:
pathogen– Pathogen IDantibiotics– Antibiotic used for susceptibility testinglab_result– Susceptibility testing result
dems.csv- Description: Demographic information
- Features in final dataset derived from this dataset:
age– Age of the patient at T₀sex– Sex of the patient
ADT.csv- Description: Admission, discharge, and transfer timestamps for all hospital encounters
- Features in final dataset derived from this dataset:
admission_date– Admission timestamp for the patientadmission_loc– Admission location or source of the patient
abx.csv- Description: Antibiotic prescriptions
- Features in final dataset derived from this dataset:
any_abx_0_60– Whether the patient was treated with antibiotics within 60 days{abx_name}_0_60– Number of courses of{abx_name}in the previous 60 days (refers to antibiotic class)
elix.csv- Description: Elixhauser comorbidities
- Features in final dataset derived from this dataset:
elix_{elix_name}– Value of individual Elixhauser categoryelix_index_mortality– Elixhauser Index calculated based on [9]
procedure.csv- Description: Procedural history
- Features in final dataset derived from this dataset:
cpt– CPT code of the procedure prescribed to the patient
cpt.csv- Description: Surgical procedure information
- Features in final dataset derived from this dataset:
any_surgery– Prior history of surgical procedures
- Dependent files:
procedure.csv
col_pressure.csv- Description: Ward-level colonization pressure
- Features in final dataset derived from this dataset:
{pathogen_name}_cp– Calculated by the time-weighted 6-month prevalence of{pathogen_name}in patients who occupied the same ward as a case or control in the previous 30 days
- Dependent files:
micro.csvADT.csv
Data De-identification
- Patient and encounter identifiers were replaced with randomly generated integers to remove all direct identifiers, and
deidentified_patient_idvalues were generated independently for the environmental and patient analyses to prevent cross-linkage between datasets. - Hospital ward was replaced with randomly generated integers.
- To de-identify temporal information in the dataset, each patient was assigned a randomly generated “anchor date” uniformly sampled between January 1, 2000, and January 1, 2100. This anchor date replaced the patient’s earliest recorded date in the dataset. All subsequent dates for the same patient were shifted by the same number of days relative to this anchor date, thereby preserving temporal intervals between events while removing any linkage to real calendar dates.
- For additional privacy protection, patients aged >90 years were recoded into a single category (“>90”).
Usage Notes
Getting Started
To start in R:
Read in the files
environmental_df <- read.csv('HO_infxn_environmental_analysis.csv')
Calculate the mean C. difficile colonization pressure in cases and controls for different target pathogens
cdiff_cp_summary <- environmental_df %>% group_by(run, group) %>% summarize(mean_cp = mean(CDiff_cp, na.rm = TRUE), n = n())
print(cdiff_cp_summary)
Source Code for Data Processing and Analysis
All code necessary for building the clean dataset from dependent raw datasets and running the models is available on GitHub [11]. A static snapshot of the code at the time of initial data submission is included in this repository.
Data Usage
This dataset is a de-identified version of the dataset used in [10]. It enables studies of hospital-acquired infection due to infection history of ward co-occupants and prior antibiotic exposures. It was derived from the Mass General Brigham system, and while designed for generalizable research use, results may be influenced by site-specific factors and the matched case–control design.
Release Notes
Version 1.0.0: Initial public release of the dataset.
Ethics
This study was approved by the Institutional Review Board (IRB) of Massachusetts General Brigham healthcare, protocol 2017P000682.
Conflicts of Interest
The authors declare no competing interests.
References
- Centers for Disease Control and Prevention. Current HAI progress report [Internet]. Healthcare-Associated Infections (HAIs). 2024. Available from: https://www.cdc.gov/healthcare-associated-infections/php/data/progress-report.html [Accessed October 7th, 2025]]
- Bonten MJM. Colonization pressure: a critical parameter in the epidemiology of antibiotic-resistant bacteria. Critical Care (London, England) 2012;16:142. https://doi.org/10.1186/cc11417.
- Mitchell BG, Dancer SJ, Anderson M, Dehn E. Risk of organism acquisition from prior room occupants: a systematic review and meta-analysis. Journal of Hospital Infection. 2015 Nov;91(3):211–7.
- Gilboa M, Regev-Yochay G, Meltzer E, Cohen I, Peretz Y, Zilberman-Daniels T, et al. Antibiotic use and the risk of hospital-onset Clostridioides difficile infection. JAMA Network Open. 2025 Aug 8;8(8):e2525252.
- Gu GY, Chen M, Pan JC, Xiong XL. Risk of multidrug-resistant organism acquisition from prior bed occupants in the intensive care unit: a meta-analysis. Journal of Hospital Infection [Internet]. 2023 Sep 1 [cited 22 Nov 2023];139:44–55. Available from: https://www.sciencedirect.com/science/article/pii/S0195670123002050.
- Jolivet S, Lolom I, Bailly S, et al. Impact of colonization pressure on acquisition of extended-spectrum β-lactamase-producing Enterobacterales and methicillin-resistant Staphylococcus aureus in two intensive care units: a 19-year retrospective surveillance. Journal of Hospital Infection. 2020;105:10–6. https://doi.org/10.1016/j.jhin.2020.02.012.
- Merrick R, McKerr C, Song J, Donnelly K, Gerrard R, Morgan M, Williams C, Craine N. Transferring inpatients between wards drives large nosocomial COVID-19 outbreaks, Wales, 2020–22: a matched case–control study using routine and enhanced surveillance data. Journal of Hospital Infection. 2024 Mar;145:1–10. doi:10.1016/j.jhin.2023.11.014. Epub 2023 Dec 9. PMID: 38081454.
- Hoo GSR, Cai Y, Quek YC, Teo JQ, Choudhury S, Koh TH, Lim TP, Marimuthu K, Ng OT, Kwa AL. Predictors and outcomes of healthcare-associated infections caused by carbapenem-nonsusceptible Enterobacterales: a parallel matched case–control study. Frontiers in Cellular and Infection Microbiology. 2022 Feb 24;12:719421. doi:10.3389/fcimb.2022.719421. PMID: 35281438; PMCID: PMC8907832.
- Yurkovich M, Avina-Zubieta JA, Thomas J, Gorenchtein M, Lacaille D. A systematic review identifies valid comorbidity indices derived from administrative health data. Journal of Clinical Epidemiology. 2015 Jan;68(1):3–14.
- Sagers L, Wei Z, McKenna C, Chan C, Agan AA, Pak R, et al. Hospital unit colonization pressure and nosocomial acquisition of drug susceptible and drug resistant pathogens.
- Kanjilal S. Hospital Onset Infection Analyses [Internet]. GitHub; 2024. Available from: https://github.com/sanjatkanjilal/hospital_onset_infection_analyses [Accessed 28 Oct 2025].
Access
Access Policy:
Only credentialed users who sign the DUA can access the files.
License (for files):
PhysioNet Credentialed Health Data License 1.5.0
Data Use Agreement:
PhysioNet Credentialed Health Data Use Agreement 1.5.0
Required training:
CITI Data or Specimens Only Research
Discovery
DOI (version 1.0.0):
https://doi.org/10.13026/k70x-0m81
DOI (latest version):
https://doi.org/10.13026/pa4z-5293
Topics:
electronic health records
infection control
clinical machine learning
infectious diseases
hospital onset infection
colonization pressure
Project Website:
https://github.com/sanjatkanjilal/hospital_onset_infection_analyses
Corresponding Author
Files
- be a credentialed user
- complete required training:
- CITI Data or Specimens Only Research You may submit your training here.
- sign the data use agreement for the project