Background

JMI

JMIR Med Inform

JMIR Medical Informatics

2291-9694

JMIR Publications

Toronto, Canada

v9i7e29986

34086596

10.2196/29986

Original Paper

Ambulatory Risk Models for the Long-Term Prevention of Sepsis: Retrospective Study

Eysenbach

Gunther

Walsh

Joseph

Lee

Jewel Y

MSc 1

https://orcid.org/0000-0002-3539-923X

Molani

Sevda

PhD 1

https://orcid.org/0000-0001-8227-4416

Fang

Chen

PhD 1

https://orcid.org/0000-0001-9425-5626

Jade

Kathleen

ND 1

https://orcid.org/0000-0002-9482-2536

O'Mahony

D Shane

MD 2

https://orcid.org/0000-0003-2168-7262

Kornilov

Sergey A

PhD 1

https://orcid.org/0000-0002-8378-110X

Mico

Lindsay T

MSc 3

https://orcid.org/0000-0003-0486-700X

Hadlock

Jennifer J

MD 1

Institute for Systems Biology

401 Terry Ave N

Seattle, WA, 98109

United States jhadlock@isbscience.org

https://orcid.org/0000-0001-6103-7606

1 Institute for Systems Biology

Seattle, WA

United States 2 Swedish Center for Research and Innovation Swedish Medical Center

Seattle, WA

United States 3 Providence St Joseph Health

Renton, WA

United States

Corresponding Author: Jennifer J Hadlock jhadlock@isbscience.org

7 2021

8 7 2021

9 7

e29986

4 5 2021 26 5 2021 2 6 2021

©Jewel Y Lee, Sevda Molani, Chen Fang, Kathleen Jade, D Shane O'Mahony, Sergey A Kornilov, Lindsay T Mico, Jennifer J Hadlock. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 08.07.2021.

2021

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included.

Background

Sepsis is a life-threatening condition that can rapidly lead to organ damage and death. Existing risk scores predict outcomes for patients who have already become acutely ill.

Objective

We aimed to develop a model for identifying patients at risk of getting sepsis within 2 years in order to support the reduction of sepsis morbidity and mortality.

Methods

Machine learning was applied to 2,683,049 electronic health records (EHRs) with over 64 million encounters across five states to develop models for predicting a patient’s risk of getting sepsis within 2 years. Features were selected to be easily obtainable from a patient’s chart in real time during ambulatory encounters.

Results

The models showed consistent prediction scores, with the highest area under the receiver operating characteristic curve of 0.82 and a positive likelihood ratio of 2.9 achieved with gradient boosting on all features combined. Predictive features included age, sex, ethnicity, average ambulatory heart rate, standard deviation of BMI, and the number of prior medical conditions and procedures. The findings identified both known and potential new risk factors for long-term sepsis. Model variations also illustrated trade-offs between incrementally higher accuracy, implementability, and interpretability.

Conclusions

Accurate implementable models were developed to predict the 2-year risk of sepsis, using EHR data that is easy to obtain from ambulatory encounters. These results help advance the understanding of sepsis and provide a foundation for future trials of risk-informed preventive care.

sepsis machine learning electronic health records risk prediction clinical decision making prevention risk factors

Introduction

Sepsis is a life-threatening condition characterized by a systemic immunological response to infection. Each year, more than 1.7 million adults in the United States develop sepsis, and nearly 16% of them die [1]. It is the leading cause of death in hospitals worldwide and puts a huge burden on health care systems [2-4]. Research to date has primarily focused on the inpatient setting, where timely treatment can improve sepsis-associated mortality and morbidity [5-9]. Commonly used risk scores, such as the systemic inflammatory response syndrome (SIRS) score [10], quick sequential organ failure assessment (qSOFA) score [11], and modified early warning score (MEWS) [12], offer benefit once patients are acutely ill, but are less useful for early detection [13-16]. Advanced machine learning has led to more efficient models based on data from larger populations and a greater number of risk factors [17-21], but these are designed for emergency and inpatient settings [21-27].

Better risk models are needed to support community-acquired sepsis prevention. In 2016, Wang et al were the first to develop a risk score for long-term sepsis [28]. Using the REGARDS cohort (n=30,239), they predicted an individual’s 10-year risk of sepsis (REGARD SRS), with a bootstrapped C index of 0.703. The REGARD SRS and SSRS rely on demographic and medical history features that could be obtained by patient self-report, but they also depend on clinical laboratory results from blood and urine, including laboratory tests, such as cystatin-C and high-sensitivity C-reactive protein, which are not routinely measured in community-dwelling patients. Thus, there is a pressing need for a noninvasive solution to guide interventions for preventing sepsis, including immunization, education on infection prevention, and early symptom recognition [29,30]. Published guidelines currently recommend these interventions for some patients, such as those who will be experiencing neutropenia secondary to chemotherapy or posttransplant immunosuppression [31,32], but many other patients at high risk are overlooked. An implementable model that works on real-world patient data could support risk stratification for population health outreach or at the point of care.

Given the increased adoption of electronic health records (EHRs) in ambulatory care [33], a wealth of longitudinal phenotype and exposure data is now accessible to support predictive analytics. Sepsis risk research can move beyond inpatient encounters toward investigation of long-term patient trajectories. Historical data can support more accurate models for clinical decision support and improved resource stewardship. Yet, accuracy is only one dimension of model quality. Two other considerations are implementability in real-world settings and biomedical relevance for discovery of new hypotheses about the mechanisms of disease, prevention, and treatment.

In this study, we developed EHR-based models using supervised machine learning methods to predict the long-term risk of sepsis, investigating both time-invariant and temporal synopsis features. For each model, we reported results for both performance and feature importance, and discussed trade-offs between accuracy, interpretability, implementability, and biomedical relevance. This research investigated the potential to predict long-term sepsis risk in ways that can inform clinical decisions and lead to a better understanding of the disease.

Methods Data and Study Setting

Providence St. Joseph Health (PSJH) is a community health system that includes over 51 hospitals and 1085 clinics. This retrospective study used clinical data from PSJH EHRs for patients who presented for health care at Providence, Swedish, or Kadlec sites in Alaska, California, Montana, Oregon, and Washington. Research was conducted within a Health Insurance Portability and Accountability Act (HIPAA)-secure data platform, after date shifting had been applied to reduce the risk of reidentification. Dates were shifted using a randomly selected offset per patient of up to ±365 days. All time windows below were defined on postshifted dates. Procedures were approved by the Institutional Review Board (IRB) at PSJH (IRB Study Number STUDY2019000389). Records were included for patients who presented for health care at least one time between 2017 and 2019. Our prediction model used records from patients over 18 years of age during a 3-year observation window starting in 2014 to predict sepsis in a 2-year window, starting in 2017. Patient age was calculated for the prediction window start date. Patients with no valid birth date or no encounters prior to 2014 were excluded. Our final study cohort consisted of 2,683,049 patients, including 1,558,851 (58.1%) women and 1,124,198 (41.9%) men, and the median age was 51.36 years. Over 64,000,000 encounters were collected from the cohort patients for feature extraction.

Feature and Label Extraction

Features represent information about the data used as model inputs, and the label is the outcome that the model is trained to predict. In this study, we selected features that can be easily obtained from EHRs, including previously reported long-term risk factors for sepsis [34] and potential risk factors for investigation. Binary outcome variables were used in labeling for classification (1 for sepsis and 0 for no sepsis). Sepsis was defined using the Systematized Nomenclature of Medicine-Clinical Terms (SNOMED CT) [35] hierarchical terminology system. The label was set to 1 if the parent concept for sepsis, SNOMED CT identifier (SCTID = 91302008), or any of its descendants was found in the problem list during the prediction window.

The following features were extracted from the observation window: sex, age, ethnicity, race, height, weight, BMI, ambulatory vital signs, history of medical conditions, hospital length of stay, encounters, problem list entries, medical history entries, medication orders, and procedures. Medical conditions were considered present if the SNOMED CT parent concept or any of its descendant concepts were found in the problem list during the observation window. The sepsis feature was included to investigate whether having a history of sepsis is a risk factor for developing sepsis in the future. Ratio features with repeated observations (eg, BMI, vital signs, and hospital length of stay) were transformed through statistical aggregation (minimum, maximum, mean, and standard deviation). All features are defined in Table 1 and categorized into four feature sets as follows: basic, vital signs, medical history, and health care delivery data. In total, 49 features were entered into the supervised machine learning process.

Table 1

Definitions of features used for models in the study for the observation window.

Category		Definition
Basic features
	Sex	Male (1), female (0), missing (−1)
	Age	Age calculated at the start of the prediction window
	Race	Native Hawaiian/Pacific Islander, American Indian/Alaska Native, Asian, Black/African American (1); White (0); other/missing (−1)
	Ethnicity	Hispanic/Latino (1), not Hispanic/Latino (0), missing (−1)
	Height	Last observed height
	Weight	Last observed weight
	Std_BMI	Standard deviation of BMI
Vital sign features
	BP_sys	Average and standard deviation of systolic blood pressure
	BP_dia	Average and standard deviation of diastolic blood pressure
	BT	Average and standard deviation of body temperature
	HR	Average and standard deviation of heart rate
	RR	Average and standard deviation of respiratory rate
Medical history features
	Sepsis	Sepsis (SCTID^a 91302008)
	Pneumonia	Pneumonia (SCTID 233604007)
	Bacterial infection	Bacterial infectious disease (SCTID 87628006)
	Fungal infection	Mycosis (SCTID 3218000)
	Protein-energy malnutrition	Deficiency of macronutrients (SCTID 238107002)
	Cancer	Malignant neoplastic disease (SCTID 363346000)
	COPD^b	Chronic obstructive lung disease (SCTID 13645005)
	Diabetes	Diabetes mellitus (SCTID 73211009)
	Chronic kidney disease	Chronic kidney disease (SCTID 709044004)
	Hypertension	Hypertensive disorder, systemic arterial (SCTID 38341003)
	Deep vein thrombosis	Deep venous thrombosis (SCTID 128053003)
	Arteriosclerosis	Arteriosclerotic vascular disease (SCTID 72092001)
	Peripheral artery disease	Peripheral arterial occlusive disease (SCTID 399957001)
	Coronary artery disease	Coronary arteriosclerosis (SCTID 53741008)
	Heart attack	Myocardial infarction (SCTID 22298006)
	Atrial fibrillation	Atrial fibrillation (SCTID 49436004)
	Stroke	Cerebrovascular accident (SCTID 230690007)
	Heart failure	Heart failure (SCTID 84114007)
Health care delivery features
	n_encounter	Total count of clinical encounters
	n_hospitalization	Total count of hospitalizations
	LOS	Average, minimum, maximum, and standard deviation of length of hospital stay
	n_problem	Total count of problem list entries
	u_problem	Number of unique problem list entries
	n_medical_hx	Total count of medical history entries
	u_medical_hx	Number of unique medical history entries
	n_medication	Total count of prescription medication orders
	u_medication	Number of unique prescription medication orders
	n_procedure	Total count of ordered medical procedures
	u_procedure	Number of unique ordered medical procedures

^aSCTID: Systematized Nomenclature of Medicine-Clinical Terms (SNOMED CT) identifier.

^bCOPD: chronic obstructive pulmonary disease.

Machine Learning

Data preprocessing and cleaning were conducted as follows. Missing data in categorical features (sex, race, and ethnicity) were assigned to be −1. Missing data in height, weight, and vital signs were imputed using the carry-forward method if previous observations were available; otherwise, median imputation was used. Outliers in height and weight were detected by calculating the modified z-score based on median absolute deviation (MAD) [36] in equation 1 with a threshold of 3.5. Both outliers and missing data were imputed with the median. Equation 1 is as follows:

Mi = 0.6745 (xi−x̃) MAD (1)

where MAD is the median absolute deviation and x̃ is the median of x.

Patients diagnosed with sepsis accounted for only about 0.8% of the cohort, leading to extremely imbalanced data. To ensure the validity of the model but, at the same time, overcome the class imbalance in the medical data set, we reserved 20% of the original data as a test set and undersampled the other 80% of the data by randomly selecting the same number of patients from the majority class (no sepsis) as the minority class (sepsis) to construct a balanced training set. The train/test split process is shown in Figure 1. This training set was then trained with several machine learning methods, including gradient boosting (GB), support vector machine (SVM), and logistic regression (LR), and validated with 10-fold cross validation. Four models were constructed with different combinations of feature sets. Model 1 used only the basic features. Sequentially, we added vital sign features to model 2, medical history features to model 3, and health care delivery data features to model 4.

Figure 1

Training, validation, and test split for modeling of the long-term risk of sepsis.

Model Performance Evaluation

All classification models were built using scikit-learn [37], an open-source Python machine learning library. Widely adopted performance measures, such as area under the receiver operating characteristic curve (AUROC), precision, sensitivity (or recall), specificity, and likelihood ratio, were used to evaluate the discrimination ability of our prediction models. Appropriate measures were selected based on the class distribution in the models. We also analyzed relative feature importance using the following three methods: (1) Shapley Additive exPlanations (SHAP) algorithm, (2) permutation testing, and (3) model coefficients from L1-regularized logistic regression (L1-LR). SHAP, an algorithm developed from coalition game theory, calculates the average marginal contribution of a feature across all possible coalitions [38]. Permutation testing estimates feature importance by calculating the drop in the performance after permuting the feature. A feature is considered important if shuffling its values increases the model prediction error. Shapley values and permutation feature importance computed on test data avoid the systematic bias in feature selection found with mean decrease impurity–based measures [39]. We also retrieved coefficients from L1-LR to investigate the relevance and directionality of features. LR with L1 regularization is a sparse linear model in which coefficients for unimportant features are reduced to zero [40], and the sign of the coefficient suggests positive or negative association with the model outcome (sepsis) [41].

Results

Table 2 shows the results of 10-fold cross-validation based on training data using GB, SVM, and LR. The results show a consistent trend of model performance, increasing as more features were added. GB slightly outperformed linear classifiers (SVM and LR) in all four models. The best AUROC of 0.8216 was achieved by model 4. The trained GB models were then used to make predictions on the 20% test data set, and they were evaluated with precision, sensitivity (or recall), specificity, positive and negative likelihood ratios, and diagnostic odds ratios because of the highly imbalanced class distribution (Table 3). The test set prevalence was 0.0079 with the population size of 536610. The results showed that the positive likelihood ratio ranged from 2.1135 to 2.8897, and the negative likelihood ratio ranged from 0.3192 to 0.4997. Sensitivity and specificity in each model had similar results in the training set and test set for predicting the sepsis outcome.

Table 2

Ten-fold cross-validation results on the training set.

Model and classifier		Precision	Sensitivity	Specificity	AUROC^a	Ten-fold error (%)
Model 1 (basic)
	GB^b	0.6727	0.6725	0.6725	0.7349	0.29%
	SVM^c	0.6607	0.6606	0.6606	0.7167	0.27%
	LR^d	0.6569	0.6565	0.6565	0.7134	0.29%
Model 2 (basic + VS^e)
	GB	0.6947	0.6946	0.6946	0.7595	0.28%
	SVM	0.6812	0.6811	0.6811	0.7425	0.29%
	LR	0.6776	0.6775	0.6775	0.7399	0.26%
Model 3 (basic + VS + MHX^f)
	GB	0.7008	0.7006	0.7006	0.7671	0.20%
	SVM	0.6897	0.6868	0.6868	0.7502	0.17%
	LR	0.6893	0.6891	0.6891	0.7523	0.18%
Model 4 (basic + VS + MHX + HCD^g)
	GB	0.7483	0.7481	0.7481	0.8216	0.27%
	SVM	0.7191	0.7169	0.7169	0.7910	0.26%
	LR	0.7185	0.7175	0.7175	0.7835	0.19%

^aAUROC: area under the receiver operating characteristic curve.

^bGB: gradient boosting.

^cSVM: support vector machine.

^dLR: logistic regression.

^eVS: vital signs.

^fMHX: medical history.

^gHCD: health care delivery data.

Table 3

Prediction results and 95% confidence intervals for the test set using the trained gradient boosting model.

Model	Precision, value (95% CI)	Sensitivity, value (95% CI)	Specificity, value (95% CI)	LR+^a, value (95% CI)	LR−^b, value (95% CI)	DOR^c
Model 1 (basic)	0.0165 (0.0159-0.0171)	0.6552 (0.6407-0.6694)	0.6900 (0.6887-0.6912)	2.1135 (2.0670-2.1611)	0.4997 (0.4793-0.5209)	4
Model 2 (basic + VS^d)	0.0177 (0.0171-0.0184)	0.6862 (0.6721-0.7001)	0.6980 (0.6968-0.6993)	2.2724 (2.2256-2.3202)	0.4495 (0.4299-0.4701)	5
Model 3 (basic + VS + MHX^e)	0.0184 (0.0177-0.0190)	0.6874 (0.6733-0.7012)	0.7084 (0.7071-0.7096)	2.3570 (2.3086-2.4065)	0.4413 (0.4220-0.4615)	5
Model 4 (basic + VS + MHX + HCD^f)	0.0224 (0.0217-0.0231)	0.7653 (0.7523-0.7779)	0.7352 (0.7340-0.7363)	2.8897 (2.8401-2.9401)	0.3192 (0.3023-0.3371)	9

^aLR+: positive likelihood ratio.

^bLR−: negative likelihood ratio.

^cDOR: diagnostic odds ratio.

^dVS: vital signs.

^eMHX: medical history.

^fHCD: health care delivery data.

To ensure the stability and reliability of the model, SHAP and permutation testing methods were implemented on the GB model. These methods improve the interpretability of the black box model and give a reasonable explanation for the prediction of each outcome. The results for the SHAP algorithm are shown in Figures 2-5. In addition, L1-LR and permutation results for model 4 are presented in Figure 6. In models 1-3, where health care delivery data features were not used, SHAP showed age as the dominant feature for predicting sepsis. Other important features included sex, ethnicity, respiratory rate, heart rate, standard deviation of BMI, history of sepsis, diabetes, and chronic kidney disease. In model 4, where health care delivery data features were added, the most predictive features were the number of unique entries (u_medical_hx), followed by age, the total count of medical history entries (n_medical_hx), the total count of encounters (n_encounter), sex, and the total count of ordered medical procedures (n_procedure). The important features identified in the SHAP algorithm have high permutation importance and high absolute values of coefficients learned by L1-LR models. The sign of the coefficients showed the directionality of those features. Moreover, the average diastolic blood pressure (avg_BP_dia) and the total count of encounters (n_encounter) were assigned with a negative coefficient in all three models, which implied the effect of high values for these features in decreasing the risk of developing sepsis.

Figure 2

The Shapley Additive exPlanations (SHAP) algorithm results for long-term sepsis risk in model 1. (A) The influence of higher and lower values of the feature on the patient's outcome. The left side of this graph represents reduced risk of developing sepsis, and the right side of the graph represents increased risk of developing sepsis. Red dots represent higher values of the feature, and blue dots represent lower values of the feature. Nominal classes are binary (0,1). (B) The ranking of feature importance indicated by SHAP.

Figure 3

The Shapley Additive exPlanations (SHAP) algorithm results for long-term sepsis risk in model 2. (A) The influence of higher and lower values of the feature on the patient's outcome. The left side of this graph represents reduced risk of developing sepsis, and the right side of the graph represents increased risk of developing sepsis. Red dots represent higher values of the feature, and blue dots represent lower values of the feature. Nominal classes are binary (0,1). (B) The ranking of feature importance indicated by SHAP. BP: blood pressure; BT: body temperature; HR: heart rate; RR: respiratory rate.

Figure 4

The Shapley Additive exPlanations (SHAP) algorithm results for long-term sepsis risk in model 3. (A) The influence of higher and lower values of the feature on the patient's outcome. The left side of this graph represents reduced risk of developing sepsis, and the right side of the graph represents increased risk of developing sepsis. Red dots represent higher values of the feature, and blue dots represent lower values of the feature. Nominal classes are binary (0,1). (B) The ranking of feature importance indicated by SHAP. BP: blood pressure; BT: body temperature; COPD: chronic obstructive pulmonary disease; HR: heart rate; RR: respiratory rate.

Figure 5

The Shapley Additive exPlanations (SHAP) algorithm results for long-term sepsis risk in model 4. (A) The influence of higher and lower values of the feature on the patient's outcome. The left side of this graph represents reduced risk of developing sepsis, and the right side of the graph represents increased risk of developing sepsis. Red dots represent higher values of the feature, and blue dots represent lower values of the feature. Nominal classes are binary (0,1). (B) The ranking of feature importance indicated by SHAP. BP: blood pressure; BT: body temperature; COPD: chronic obstructive pulmonary disease; HR: heart rate; LOS: length of hospital stay; RR: respiratory rate.

Figure 6

The L1-regularized logistic regression (L1-LR) algorithm results (A) and permutation testing results for long-term sepsis risk in model 4 (B). BP: blood pressure; BT: body temperature; COPD: chronic obstructive pulmonary disease; HR: heart rate; LOS: length of hospital stay; RR: respiratory rate.

Discussion Principal Findings

In this study, we constructed four interpretable implementable EHR-based models to predict the 2-year risk of sepsis in adults. Each model performed well, considering the complexity of the features included. As expected, model 4 with all 49 features outperformed the others, with an AUROC of 0.8216 achieved by the GB algorithm in the training set. Due to the low prevalence of sepsis outcomes in the 20% test set, the precision was low in all models. However, the positive likelihood ratio of 2.8897 and negative likelihood ratio of 0.3192 achieved by model 4 showed that our model has the ability to identify patients with higher risk of sepsis. The dominant features in this model, accounting for more than half of the feature importance, were the numbers of unique and total medical history entries (u_medical_hx and n_medical_hx), and age. Medical history features suggest an increased burden of underlying health conditions, and aging is the most substantial risk factor for multimorbidity [42]. Comorbidities are known to be significantly higher in patients with sepsis compared to those without sepsis [1,43], but previous models have not included multimorbidity as a distinct feature. Another strong predictor in model 4 was the total number of ordered medical procedures (n_procedure). Procedures, particularly those that are invasive, increase the risk of hospital-acquired infections, and may also be indicative of health status and multimorbidity. The total number of encounters (n_encounter), which was assigned a negative coefficient in L1-LR, was also a strong predictor in model 4. Although it requires further investigation, one possible reason could be that a greater number of health care visits is associated with better access to preventative health care.

Age, ethnicity, sex, average heart rate (avg_HR), and standard deviation of BMI (std_BMI) were the most important features in models 2 and 3. In addition to increasing the risk of multimorbidity, age is a known independent risk factor for sepsis incidence, severity, and outcomes [44]. Whether ethnicity represents a sepsis risk factor is not yet established. Results from epidemiological studies are contrasting [45-47]. Ethnicity may also be associated with socioeconomic status, a health determinant recently found to be associated with a higher rate of hospital admissions for infection [48]. Future tracking of health-related social needs in structured EHR data [49] will support deeper investigation. A higher resting heart rate, which is common in infection, is also a risk factor for all-cause mortality [50] and may suggest a poorer health status. Patients with higher average heart rates may have had infections during previous encounters. Obesity and malnourishment are known risk factors for sepsis [51], but the standard deviation of BMI (change over time) is a new potential risk factor and merits investigation. In models 3 and 4, basic features and vital signs (age, ethnicity, sex, BMI, and heart rate) appeared to be more stronger predictors than well-established medical conditions known to be sepsis risk factors, including heart failure [52], chronic kidney disease [53], chronic obstructive pulmonary disease (COPD) [54], and diabetes [55,56]. Taken together, these findings suggest the possibility that sepsis risk is associated with not only age and medical conditions, but also vital signs and features related to health care delivery.

Although the highest performance was achieved with the health care delivery data features set, it has limited usefulness for discovering potential risk factors given its reliance on aggregated features, such as the number of medical history entries. Inclusion of these aggregated features weakens other predictors that are potentially more biomedically informative, including medical conditions and biomarkers, such as vital signs. The second-best performing model (model 3) identified a subset of biomarkers as strong predictors, including the standard deviation of BMI and average resting heart rate.

In models 3 and 4 that incorporated medical history, the conditions with greater importance for long-term sepsis risk were history of sepsis, heart failure, chronic kidney disease, pneumonia, COPD, and diabetes. In contrast, the most impactful chronic diseases in the REGARDS 10-year prediction score were chronic lung disease, followed by diabetes and peripheral artery disease [28,34]. The difference in risk factors between REGARDS and our models may reflect a different population sample and prediction window, but could also reflect differing definitions for conditions. For example, Wang et al used laboratory markers (estimated glomerular filtration rate, urinary albumin-to-creatinine ratio, and cystatin-C) for chronic kidney disease [28]. We selected diagnostic codes, which are less precise, but more likely to be consistently implementable on EHR data. SNOMED CT was selected because it is a medically curated semantic ontology, which is structured as a directed acyclic graph and used in EHRs across many countries. These codes can be mapped to ICD-10 codes, but different health care systems would likely benefit from retraining and retesting the model for their specific population.

The primary goal of this study was to investigate whether readily available EHR data can predict the long-term risk of developing sepsis during ambulatory visits in real time. Performance could also be useful for assessing population health. Interpretability was a secondary concern, and the feature importance estimates discussed above should be taken as exploratory. Relationships identified in the models reflected shared information content, but not necessarily biomedical relevance or causality. However, feature importance models suggested new insights on potential risk factors for sepsis that merit further investigation.

Limitations

The studied population may have sample bias toward patients with continuous care within one health care system. There are also many common issues with structured EHR data that hamper the extraction of accurate information, including missing data, erroneous data, differences in EHR conventions among providers, and changes in how data are stored in EHRs over time [57]. These were only partially offset by terminology mapping, data removal, or imputation.

Using EHR diagnostic codes to identify sepsis patients also has limitations. First, it may miss cases where patients had sepsis at a different health care system. Second, because there is no confirmatory diagnostic test for sepsis, this model included patients who were treated empirically for sepsis but might not have had it. Third, variations in sepsis diagnosis, documentation, and coding practices could lead to missing sepsis labels [58]. Fourth, it does not differentiate between severe and milder forms of sepsis, or between hospital-acquired and community-acquired sepsis [43].

Future models can take advantage of the Adult Sepsis Event surveillance definition optimized for EHRs, which was recently released by the CDC [1,59]. This criterion uses objective clinical data to identify severe sepsis in hospitalized patients and displays superior sensitivity to diagnostic codes [1]. Lastly, our definition of ambulatory vital signs may include those that were taken in urgent or emergency situations. This is valid for prediction on real-world EHR data, but future models could better distinguish urgent encounters from those that are more likely to represent outpatient baseline.

Conclusions

Strategies for long-term sepsis risk prediction are needed to advance the understanding of the disease and guide efforts for prevention. We used retrospective EHR data from 2,683,049 adults across five US states to develop models for predicting adult patients’ long-term risk of sepsis. Our models achieved a high AUROC and suggested new insights into potential long-term risk factors, including changes in BMI and a higher mean heart rate in ambulatory settings. These models could be implemented at a low cost, requiring only information that is easy to obtain from EHRs in real time. Ambulatory patients at the highest risk for sepsis could benefit from personalized preventative approaches, including increased emphasis on immunization, and education on reducing the risk of infection and recognizing early symptoms of sepsis. This implementable model provides a path toward clinical trials of risk-informed interventions for long-term sepsis prevention.

Abbreviations

AUROC

area under the receiver operating characteristic curve

COPD

chronic obstructive pulmonary disease

EHR

electronic health record

gradient boosting

L1-LR

L1-regularized logistic regression

logistic regression

MAD

median absolute deviation

PSJH

Providence St. Joseph Health

SCTID

Systematized Nomenclature of Medicine-Clinical Terms identifier

SHAP

Shapley Additive exPlanations

SNOMED CT

Systematized Nomenclature of Medicine-Clinical Terms

SVM

support vector machine

This work was funded in part by the Washington Research Foundation. We thank Ryan T Roper and Venkata R Duvvuri for their design and implementation assistance for biomedical concept extraction. We are grateful to Providence St. Joseph Health for sharing their data, data engineering expertise, and computational resources. We appreciate the technical assistance of Mark Premo, Jennifer Jones, and Andrey Dubovoy. We would also like to acknowledge SNOMED International for developing and maintaining SNOMED CT.

None declared.

Rhee

Dantes

Epstein

Murphy

Seymour

Iwashyna

Kadri

Angus

Danner

Fiore

Jernigan

Martin

Septimus

Warren

Karcz

Chan

Menchaca

Wang

Gruber

Klompas

CDC Prevention Epicenter Program

Incidence and Trends of Sepsis in US Hospitals Using Clinical vs Claims Data, 2009-2014

JAMA 2017 10 03 318 13 1241 1249

10.1001/jama.2017.13836

28903154

2654187

PMC5710396

Angus

Linde-Zwirble

Lidicker

Clermont

Carcillo

Pinsky

Epidemiology of severe sepsis in the United States: analysis of incidence, outcome, and associated costs of care

Crit Care Med 2001 07 29 7 1303 10

10.1097/00003246-200107000-00002

11445675

Novosad

Sapiano

Grigg

Lake

Robyn

Dumyati

Felsen

Blog

Dufort

Zansky

Wiedeman

Avery

Dantes

Jernigan

Magill

Fiore

Epstein

Vital Signs: Epidemiology of Sepsis: Prevalence of Health Care Factors and Opportunities for Prevention

MMWR Morb Mortal Wkly Rep 2016 08 26 65 33 864 9

10.15585/mmwr.mm6533e1

27559759

Fleischmann

Scherag

Adhikari

NKJ

Hartog

Tsaganos

Schlattmann

Angus

Reinhart

International Forum of Acute Care Trialists

Assessment of Global Incidence and Mortality of Hospital-treated Sepsis. Current Estimates and Limitations

Am J Respir Crit Care Med 2016 02 01 193 3 259 72

10.1164/rccm.201504-0781OC

26414292

Rivers

Nguyen

Havstad

Ressler

Muzzin

Knoblich

Peterson

Tomlanovich

Early Goal-Directed Therapy in the Treatment of Severe Sepsis and Septic Shock

N Engl J Med 2001 11 08 345 19 1368 1377

10.1056/nejmoa010307

Nguyen

Corbett

Steele

Banta

Clark

Hayes

Edwards

Cho

Wittlake

Implementation of a bundle of quality indicators for the early management of severe sepsis and septic shock is associated with decreased mortality*

Critical Care Medicine 2007 35 4 1105 1112

10.1097/01.ccm.0000259463.33848.3d

Sebat

Musthafa

Johnson

Kramer

Shoffner

Eliason

Henry

Spurlock

Effect of a rapid response system for patients in shock on time to treatment and mortality during 5 years*

Critical Care Medicine 2007 35 11 2568 2575

10.1097/01.ccm.0000287593.54658.89

Coba

Whitmill

Mooney

Horst

Brandt

Digiovine

Mlynarek

McLellan

Boleski

Yang

Conway

Jordan

(The Henry Ford Hospital Sepsis Collaborative Group)

Resuscitation bundle compliance in severe sepsis and septic shock: improves survival, is better late than never

J Intensive Care Med 2011 01 10 26 5 304 13

10.1177/0885066610392499

21220270

0885066610392499

Dellinger

Levy

Rhodes

Annane

Gerlach

Opal

Sevransky

Sprung

Douglas

Jaeschke

Osborn

Nunnally

Townsend

Reinhart

Kleinpell

Angus

Deutschman

ClS

Machado

Rubenfeld

Webb

Beale

Vincent

Moreno

Surviving Sepsis Campaign Guidelines Committee including the Pediatric Subgroup

Surviving sepsis campaign: international guidelines for management of severe sepsis and septic shock: 2012

Crit Care Med 2013 02 41 2 580 637

10.1097/CCM.0b013e31827e83af

23353941

00003246-201302000-00024

Bone

Toward an epidemiology and natural history of SIRS (systemic inflammatory response syndrome)

JAMA 1992 268 24 3452 5

1460735

Heim

Newport

Heit

Graham

Wilcox

Bonsall

Miller

Nemeroff

Pituitary-adrenal and autonomic responses to stress in women after sexual and physical abuse in childhood

JAMA 2000 08 02 284 5 592 7

10.1001/jama.284.5.592

10918705

joc92111

Subbe

Kruger

Rutherford

Gemmel

Validation of a modified Early Warning Score in medical admissions

QJM 2001 10 94 10 521 6

10.1093/qjmed/94.10.521

11588210

Finkelsztein

Jones

Pabón

Delgado

Nakahira

Arbo

Berlin

Schenck

Choi

AMK

Siempos

Comparison of qSOFA and SIRS for predicting adverse outcomes of patients with suspicion of sepsis outside the intensive care unit

Crit Care 2017 03 26 21 1 73

10.1186/s13054-017-1658-5

28342442

10.1186/s13054-017-1658-5

PMC5366240

Rodriguez

Greenwood

Nuckton

Darger

Shofer

Troeger

Jung

Speich

Valencia

Kilgannon

Fernandez

Baumann

Comparison of qSOFA with current emergency department tools for screening of patients with sepsis for critical illness

Emerg Med J 2018 06 02 35 6 350 356

10.1136/emermed-2017-207383

29720475

emermed-2017-207383

van der Woude

van Doormaal

Hutten

J Nellen

Holleman

Classifying sepsis patients in the emergency department using SIRS, qSOFA or MEWS

Neth J Med 2018 05 76 4 158 166

29845938

Khwannimit

Bhurayanontachai

Vattanavanit

Comparison of the accuracy of three early warning scores with SOFA score for predicting mortality in adult sepsis and septic shock patients admitted to intensive care unit

Heart Lung 2019 05 48 3 240 244

10.1016/j.hrtlng.2019.02.005

30902348

S0147-9563(18)30334-0

Henry

Hager

Pronovost

Saria

A targeted real-time early warning score (TREWScore) for septic shock

Sci Transl Med 2015 08 05 7 299 299ra122

10.1126/scitranslmed.aab3719

26246167

7/299/299ra122

Calvert

Desautels

Chettipally

Barton

Hoffman

Jay

Mao

Mohamadlou

Das

High-performance detection and early prediction of septic shock for alcohol-use disorder patients

Ann Med Surg (Lond) 2016 06 8 50 5

10.1016/j.amsu.2016.04.023

27489621

S2049-0801(16)30041-3

PMC4960347

Desautels

Calvert

Hoffman

Jay

Kerem

Shieh

Shimabukuro

Chettipally

Feldman

Barton

Wales

Das

Prediction of Sepsis in the Intensive Care Unit With Minimal Electronic Health Record Data: A Machine Learning Approach

JMIR Med Inform 2016 09 30 4 3 e28

10.2196/medinform.5909

27694098

v4i3e28

PMC5065680

Horng

Sontag

Halpern

Jernite

Shapiro

Nathanson

Creating an automated trigger for sepsis clinical decision support at emergency department triage using machine learning

PLoS One 2017 4 6 12 4 e0174708

10.1371/journal.pone.0174708

28384212

PONE-D-15-13098

PMC5383046

Delahanty

Alvarez

Flynn

Sherwin

Jones

Development and Evaluation of a Machine Learning Model for the Early Identification of Patients at Risk for Sepsis

Ann Emerg Med 2019 04 73 4 334 344

10.1016/j.annemergmed.2018.11.036

30661855

S0196-0644(18)31486-0

Calvert

Price

Chettipally

Barton

Feldman

Hoffman

Jay

Das

A computational approach to early sepsis detection

Comput Biol Med 2016 07 01 74 69 73

10.1016/j.compbiomed.2016.05.003

27208704

S0010-4825(16)30112-3

Rothman

Levy

Dellinger

Jones

Fogerty

Voelker

Gross

Marchetti

Beals

Sepsis as 2 problems: Identifying sepsis at admission and predicting onset in the hospital using an electronic medical record-based acuity score

J Crit Care 2017 04 38 237 244

10.1016/j.jcrc.2016.11.037

27992851

S0883-9441(16)30277-5

Komorowski

Celi

Badawi

Gordon

Faisal

The Artificial Intelligence Clinician learns optimal treatment strategies for sepsis in intensive care

Nat Med 2018 11 22 24 11 1716 1720

10.1038/s41591-018-0213-5

30349085

10.1038/s41591-018-0213-5

Islam

Nasrin

Walther

Yang

Prediction of sepsis patients using machine learning approach: A meta-analysis

Comput Methods Programs Biomed 2019 03 170 1 9

10.1016/j.cmpb.2018.12.027

30712598

S0169-2607(18)31602-X

Calvert

Saber

Hoffman

Das

Machine-Learning-Based Laboratory Developed Test for the Diagnosis of Sepsis in High-Risk Patients

Diagnostics (Basel) 2019 02 13 9 1 20

10.3390/diagnostics9010020

30781800

diagnostics9010020

PMC6468682

Fleuren

Klausch

TLT

Zwager

Schoonmade

Guo

Roggeveen

Swart

Girbes

ARJ

Thoral

Ercole

Hoogendoorn

Elbers

PWG

Machine learning for the prediction of sepsis: a systematic review and meta-analysis of diagnostic test accuracy

Intensive Care Med 2020 03 21 46 3 383 400

10.1007/s00134-019-05872-y

31965266

10.1007/s00134-019-05872-y

PMC7067741

Wang

Donnelly

Griffin

Levitan

Shapiro

Howard

Safford

Derivation of Novel Risk Prediction Scores for Community-Acquired Sepsis and Severe Sepsis*

Critical Care Medicine 2016 44 7 1285 1294

10.1097/ccm.0000000000001666

Choy

Agcaoili

Halimi

Impact of community-based education on sepsis

Crit Care 2009 13 Suppl 4 P42

10.1186/cc8098

Kempker

Wang

Martin

Sepsis is a preventable public health problem

Crit Care 2018 05 06 22 1 116

10.1186/s13054-018-2048-3

29729670

10.1186/s13054-018-2048-3

PMC5936625

Taplitz

Kennedy

Bow

Crews

Gleason

Hawley

Langston

Nastoupil

Rajotte

Rolston

Strasfeld

Flowers

Outpatient Management of Fever and Neutropenia in Adults Treated for Malignancy: American Society of Clinical Oncology and Infectious Diseases Society of America Clinical Practice Guideline Update

JCO 2018 05 10 36 14 1443 1453

10.1200/jco.2017.77.6211

Avery

Michaels

AST Infectious Diseases Community of Practice

Strategies for safe living following solid organ transplantation-Guidelines from the American Society of Transplantation Infectious Diseases Community of Practice

Clin Transplant 2019 09 06 33 9 e13519

10.1111/ctr.13519

30844096

Office-based Physician Electronic Health Record Adoption

Health IT Dashboard 2020-12-05

https://dashboard.healthit.gov/quickstats/pages/physician-ehr-adoption-trends.php

Wang

Donnelly

Yende

Levitan

Shapiro

Dai

Zhao

Heiss

Odden

Newman

Safford

Validation of the REGARDS Severe Sepsis Risk Score

J Clin Med 2018 12 11 7 12 536

10.3390/jcm7120536

30544923

jcm7120536

PMC6306847

SNOMED Internationan 2020-12-03

https://www.snomed.org/

Crosby

How to Detect and Handle Outliers

Technometrics 1994 08 36 3 315 316

10.1080/00401706.1994.10485810

Pedregosa

Varoquaux

Gramfort

Michel

Thirion

Grisel

Blondel

Prettenhofer

Weiss

Dubourg

Vanderplas

Passos

Cournapeau

Brucher

Perrot

Duchesnay

Scikit-learn: Machine Learning in Python

Journal of Machine Learning Research 2011 12 85 2825 2830

Lundberg

Erion

Chen

DeGrave

Prutkin

Nair

Katz

Himmelfarb

Bansal

Lee

From Local Explanations to Global Understanding with Explainable AI for Trees

Nat Mach Intell 2020 01 17 2 1 56 67

10.1038/s42256-019-0138-9

32607472

PMC7326367

Breiman

Last

Rice

Random forests: finding quasars

Statistical Challenges in Astronomy 2006

New York, NY

Springer

243 254

Lee

Abbeel

Efficient L1 Regularized Logistic Regression

AAAI 2021-06-20

https://www.aaai.org/Papers/AAAI/2006/AAAI06-064.pdf

Fonti

Belitser

Feature Selection using LASSO

VU Amsterdam 2017

2021-06-20

https://beta.vu.nl/nl/Images/werkstuk-fonti_tcm235-836234.pdf

Navickas

Petric

Feigl

Seychell

Multimorbidity: What do we know? What should we do?

J Comorb 2016 02 17 6 1 4 11

10.15256/joc.2016.6.72

29090166

joc.2016.6.72

PMC5556462

Rhee

Wang

Zhang

Fram

Kadri

Klompas

Epidemiology of Hospital-Onset Versus Community-Onset Sepsis in U.S. Hospitals and Association With Mortality

Critical Care Medicine 2019 47 9 1169 1176

10.1097/ccm.0000000000003817

Martin

Mannino

Moss

The effect of age on the development and outcome of adult sepsis

Crit Care Med 2006 01 34 1 15 21

10.1097/01.ccm.0000194535.82812.ba

16374151

00003246-200601000-00003

Barnato

Alexander

Linde-Zwirble

Angus

Racial Variation in the Incidence, Care, and Outcomes of Severe Sepsis

Am J Respir Crit Care Med 2008 02 177 3 279 284

10.1164/rccm.200703-480oc

Mayr

Yende

Linde-Zwirble

Peck-Palmer

Barnato

Weissfeld

Angus

Infection rate and acute organ dysfunction risk as explanations for racial differences in severe sepsis

JAMA 2010 06 23 303 24 2495 503

10.1001/jama.2010.851

20571016

303/24/2495

PMC3910506

Chaudhary

Donnelly

Wang

Racial Differences in Sepsis Mortality at U.S. Academic Medical Center–Affiliated Hospitals*

Critical Care Medicine 2018 46 6 878 883

10.1097/ccm.0000000000003020

Donnelly

Lakkur

Judd

Levitan

Griffin

Howard

Safford

Wang

Association of Neighborhood Socioeconomic Status With Risk of Infection and Sepsis

Clin Infect Dis 2018 06 01 66 12 1940 1947

10.1093/cid/cix1109

29444225

4850937

PMC6248765

Committee on the Recommended Social and Behavioral Domains and Measures for Electronic Health Records Board on Population Health and Public Health Practice Institute of Medicine

Capturing Social and Behavioral Domains and Measures in Electronic Health Records: Phase 2 2015

Washington, DC

National Academies Press (US)

Zhang

Shen

Resting heart rate and all-cause and cardiovascular mortality in the general population: a meta-analysis

CMAJ 2016 02 16 188 3 E53 E63

10.1503/cmaj.150535

26598376

cmaj.150535

PMC4754196

Wang

Griffin

Judd

Shapiro

Safford

Obesity and risk of sepsis: a population-based cohort study

Obesity (Silver Spring) 2013 12 05 21 12 E762 9

10.1002/oby.20468

23526732

PMC3795990

Walker

AMN

Drozd

Hall

Patel

Paton

Lowry

Gierula

Byrom

Kearney

Sapsford

Witte

Kearney

Cubbon

Prevalence and Predictors of Sepsis Death in Patients With Chronic Heart Failure and Reduced Left Ventricular Ejection Fraction

J Am Heart Assoc 2018 10 16 7 20 e009684

10.1161/jaha.118.009684

Wang

Gamboa

Warnock

Muntner

Chronic kidney disease and risk of death from infection

Am J Nephrol 2011 8 22 34 4 330 6

10.1159/000330673

21860228

000330673

PMC3169360

Inghammar

Engström

Ljungberg

Löfdahl

Roth

Egesten

Increased incidence of invasive bacterial disease in chronic obstructive pulmonary disease compared to the general population--a population based cohort study

BMC Infect Dis 2014 03 25 14 1 163

10.1186/1471-2334-14-163

24661335

1471-2334-14-163

PMC3976148

Tiwari

Pratyush

Gahlot

Singh

Sepsis in diabetes: A bad duo

Diabetes Metab Syndr 2011 10 5 4 222 7

10.1016/j.dsx.2012.02.026

25572769

S1871-4021(12)00027-6

Frydrych

Bian

O'Lone

Ward

Delano

Obesity and type 2 diabetes mellitus drive immune dysfunction, infection development, and sepsis mortality

J Leukoc Biol 2018 08 01 104 3 525 534

10.1002/jlb.5vmr0118-021rr

Bayley

Belnap

Savitz

Masica

Shah

Fleming

Challenges in using electronic health record data for CER: experience of 4 learning organizations and solutions applied

Med Care 2013 08 51 8 Suppl 3 S80 6

10.1097/MLR.0b013e31829b1d48

23774512

Rhee

Klompas

Sepsis trends: increasing incidence and decreasing mortality, or changing denominator?

J Thorac Dis 2020 02 12 Suppl 1 S89 S100

10.21037/jtd.2019.12.51

32148931

jtd-12-S1-S89

PMC7024753

Rhee

Zhang

Kadri

Murphy

Martin

Overton

Seymour

Angus

Dantes

Epstein

Fram

Schaaf

Wang

Klompas

Sepsis Surveillance Using Adult Sepsis Events Simplified eSOFA Criteria Versus Sepsis-3 Sequential Organ Failure Assessment Criteria*

Critical Care Medicine 2019 47 3 307 314

10.1097/ccm.0000000000003521