Background

JMI

JMIR Med Inform

JMIR Medical Informatics

2291-9694

JMIR Publications

Toronto, Canada

v8i11e6924

33231554

10.2196/medinform.6924

Original Paper

Temporal Design Patterns for Digital Phenotype Cohort Selection in Critical Care: Systematic Literature Assessment and Qualitative Synthesis

Eysenbach

Gunther

Jordan

Lisa

Jiang

Guoqian

Capurro

Daniel

MD, PhD 1

School of Computing and Information Systems Centre for Digital Transformation of Health University of Melbourne

Room 3.24, Level 3, Doug McDonnel (Building 168)

Parkville Campus

Melbourne, 3010

Australia 61 8344 4504 dcapurro@unimelb.edu.au

https://orcid.org/0000-0002-9256-1256

Barbe

Mario

MD, MSc 3 4

https://orcid.org/0000-0002-9755-803X

Daza

Claudio

BSc, MPH 2

https://orcid.org/0000-0002-9480-4020

Santa Maria

Josefa

MD 2

https://orcid.org/0000-0002-5425-1716

Trincado

Javier

MD 2

https://orcid.org/0000-0001-9659-5472

1 School of Computing and Information Systems Centre for Digital Transformation of Health University of Melbourne

Melbourne

Australia 2 Department of Internal Medicine School of Medicine Pontificia Universidad Catolica de Chile

Santiago

Chile 3 Department of Biomedical Informatics Clinica Alemana

Santiago

Chile 4 Instituto de Ciencias e Innovación en Medicina Facultad de Medicina Clínica Alemana, Universidad del Desarrollo

Santiago

Chile

Corresponding Author: Daniel Capurro dcapurro@unimelb.edu.au

11 2020

24 11 2020

8 11

e6924

9 7 2020 17 8 2020 30 8 2020 28 10 2020

©Daniel Capurro, Mario Barbe, Claudio Daza, Josefa Santa Maria, Javier Trincado. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 24.11.2020.

2020

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included.

Background

Inclusion criteria for observational studies frequently contain temporal entities and relations. The use of digital phenotypes to create cohorts in electronic health record–based observational studies requires rich functionality to capture these temporal entities and relations. However, such functionality is not usually available or requires complex database queries and specialized expertise to build them.

Objective

The purpose of this study is to systematically assess observational studies reported in critical care literature to capture design requirements and functionalities for a graphical temporal abstraction-based digital phenotyping tool.

Methods

We iteratively extracted attributes describing patients, interventions, and clinical outcomes. We qualitatively synthesized studies, identifying all temporal and nontemporal entities and relations.

Results

We extracted data from 28 primary studies and 367 temporal and nontemporal entities. We generated a synthesis of entities, relations, and design patterns.

Conclusions

We report on the observed types of clinical temporal entities and their relations as well as design requirements for a temporal abstraction-based digital phenotyping system. The results can be used to inform the development of such a system.

digital phenotyping clinical data temporal abstraction

Introduction

The increasing costs of health care [1] and the rapid advance of new discoveries create the need for streamlining the identification of effective health interventions. The evidence-based clinical practice paradigm promotes the generation of such knowledge through high-quality randomized controlled trials and systematic reviews [2]. However, when we consider the amount of resources required to conduct a randomized controlled trial [3], alternative ways to assess the effectiveness of clinical interventions become attractive.

The broad adoption of electronic health records (EHRs) [4] allows researchers to analyze routinely collected electronic clinical data to conduct comparative effectiveness research. A health care system that systematically analyzes clinical data to generate and test hypotheses should be able to learn from itself, becoming a learning health care system [5]. However, converting a traditional health care system into a learning one faces several organizational, societal, and data-related barriers.

“Good data” is a relative concept [6], because it depends on who the user is and what the data are being used for. When EHR data are collected primarily for direct patient care and not with the explicit objective of generating knowledge, a majority of the captured information is stored as free text or other types of unstructured format, limiting its reuse potential. Our group has estimated that 75% of all data elements required for calculating clinical quality measures are not available as structured and computable database fields [7]. Similar results have been found about the clinical information required for clinical trial or cohort eligibility criteria [8]. The combination of data and rules to specify the latter are denominated a phenotyping algorithm [9]; digital phenotypes are the cornerstone of generating new knowledge from routinely collected clinical data and of a learning health care system.

The value of structured data lies in its capacity of being computed without major processing, therefore several attempts have been made to overcome the lack of structured clinical data. A review by Shivade et al [10] reported that the most frequently used methods to automatically identify patient cohorts based on EHR phenotypes were rule-based systems, natural language processing, and machine learning techniques. In this review a majority of studies involved the use of diagnostic codes to select eligible patients. However, although wide variations are seen, diagnostic codes frequently present poor sensitivity and specificity to accurately determine patients’ conditions [11].

Despite current advances in the area, cohort building systems require a significant amount of effort to develop and test and, in real scenarios, the most commonly used strategy to deal with limited EHR data quality is to use a combination of simple rules and manual verification of clinical data from patient records [12]. Thus, the field is still open to new and complementary approximations to identify patient cohorts based on digital phenotypes.

Clinical researchers face many barriers when querying clinical databases to find patients that match a specific cohort definition. One problem is that querying clinical databases is a complex task requiring multiple interactions between clinical researchers and database experts. Among those complexities, inclusion criteria frequently define temporal patterns of clinical events, which need convoluted temporal database queries [13]. This is needed in up to 40% of studies [8]. Finding patients that meet certain temporal patterns of clinical events could be both a barrier—when systems that do not easily support this feature are not available—and a very powerful tool to accurately retrieve patient cohorts based on these temporal digital phenotypes. However, systems that easily support this feature are not readily available.

In this study, we systematically reviewed the critical care literature to characterize the temporal representation of inclusion criteria, interventions, and outcomes, used by clinical researchers when designing a clinical study. The product of this review is a set of basic temporal entities, temporal relations, and the resulting temporal phenotype design patterns. The results can be used to inform the design of temporal abstraction-based digital phenotyping systems.

Methods Data Source

We conducted a systematic literature review of published articles in the critical care domain. Using the Web of Science Journal Citation Reports, we selected the top 5 critical care journals according to their impact factor. Paired reviewers (MB, CD, JT, and JS) manually reviewed all publications and decided on inclusion or exclusion according to criteria described in the following section. Disagreements were solved by consensus.

Types of Studies Included

We included retrospective studies conducted in intensive care unit settings which used data obtained from EHRs, clinical databases generated from EHRs, or through manual chart abstractions. We excluded studies which presented exclusively outpatient or emergency department data.

Data Extraction

For every included study paired reviewers (MB, CD, JT, and JS) manually identified and extracted—using a purposefully built online form—all elements characterizing the study’s inclusion criteria, the interventions or exposures being studied (or the comparison group), and primary outcomes as defined by the original study authors following the Patient/Population, Intervention, Comparison, Outcome (PICO) framework [14]. Each attribute was then classified according to the clinical type (diagnosis, vital sign, laboratory result, medication, etc). When these elements contained a temporal dimension as defined by Boland et al [15], they were abstracted as temporal intervals or instants. For example, if the study included patients that underwent mechanical ventilation, because mechanical ventilation occurs during a period of time, such inclusion criteria would be abstracted as a mechanical ventilation interval; in the case of a single dose of antibiotics, that would be abstracted as a drug administration instant. Attributes that were not suitable to be represented as temporal attributes—such as sex, race—were represented as nontemporal patient attributes. A representation of the data extraction process can be seen in Figure 1.

Figure 1

Overview of the data extraction process.

When possible, if an interval or instant was itself an abstraction of lower-level concepts, it was decomposed into its parts according to the description explicitly provided or cited by the authors. If there were no details in the paper, we used standard definitions, when available. For example, when a systemic inflammatory response syndrome [16] was used as an inclusion criterion, we abstracted its components as determined by systemic inflammatory response syndrome definition at the time of the study: body temperature, heart rate, respiratory rate, arterial CO₂ pressure, and white blood cells. If standard definitions were not available, we did not decompose that interval and it was extracted as the authors described it. Clinical events that are stored as free-text format—whether because they are traditionally stored in this form or it is the only available format, such as radiology reports or surgical protocols—were not represented in the abstractions.

To minimize variability in the data extraction process, all researchers followed an initial training period. Researchers classified the identified elements—inclusion criteria, interventions or exposures, and outcomes—using the framework described above. Discrepancies on the concept extraction and temporal representations were resolved by group agreement. We performed descriptive statistics from the concept extractions and the temporal elements obtained.

The abstraction process was conducted iteratively and continued until the point of saturation. We predefined saturation as being met when including additional studies did not add any new types of temporal elements.

Finally, researchers systematically documented temporal and nontemporal relationships between the identified temporal elements. This allowed us to identify the temporal query design patterns present in the literature. Finally, we documented the required functionality for a novel temporal abstraction-based system to identify patient cohorts, interventions/exposures, and outcomes in large clinical databases.

Results Data Extraction

After iteratively extracting clinical concepts, the point of saturation—where no new types of temporal elements were identified—was reached after reviewing 28 primary studies. We obtained a total of 362 clinical entities, 48.6% (n=176) were inclusion criteria, 24.3% (n=88) were classified as interventions or exposures, and 27.0% (n=98) were outcomes. Abstracted entities were further classified into categories according to their clinical type, which are described, with examples, in Table 1. Therapeutic interventions (26.2%, 95/362), diagnostic tests (20.7%, 75/362), and vital signs (11.3%, 41/362) categories covered almost 60% of all entities.

Table 1

Categories, examples, and frequencies of identified clinical entities (N=362).

Classification	Example	Count, n (%)
Therapeutic intervention	Drugs or procedures: vancomycin, orotracheal intubation	95 (26.2)
Laboratory/diagnostic tests	Serum creatinine, hematocrit	75 (20.7)
Vital signs	Body temperature, respiratory rate, central venous pressure	41 (11.3)
Diagnosis	Pneumonia, urinary infection	35 (9.7)
Patient location	Intensive care unit hospitalization, patient transfer	26 (7.2)
Clinical scores	APACHE II^a, Cerebral Performance Category	25 (6.9)
Nontemporal attribute	Sex, ethnicity	17 (4.7)
Death	In-hospital deaths, 30-day mortality	15 (4.1)
Physical examination finding	Pupil diameter, abdominal pain	11 (3.0)
Past medical history	History of trauma	7 (1.9)
Disposition	Discharge to home, institution, or other health center	5 (1.4)
Other	Appropriate antibiotic usage	10 (2.8)

^aAPACHE II: Acute Physiology And Chronic Health Evaluation II.

Temporal Entities—Instants and Intervals

Of the 362 abstracted entities, 328 could be classified as clinical instants or intervals. Most entities could be abstracted as instants (54.1%, 196/362). This type of abstraction is used to represent a clinical event that does not have a duration but has a timestamp. For example, one inclusion criteria in this category was “the presence of arterial lactate > 2.5 mmol/L.” As much as 36.5% (132/362) of abstracted entities were of type interval. This type of abstraction is used to represent a clinical event that has a duration—defined by a start and end time—greater than 0. An example of a clinical interval is “noninvasive mechanical ventilation for at least 48 hours.”

Types of Clinical Intervals

Further analysis of clinical intervals showed that they can also be subdivided into 3 different categories:

Instant-based intervals: clinical intervals that are abstractions of identical instants. An example of this is hypothermia interval in which the interval is an abstraction of multiple instants of low body temperature measurements. Sometimes specific conditions have to be met to abstract this kind of interval: a time interval for a patient receiving more than 100 mL/hour of intravenous fluids. In other occasions, the instants were only used as categorical variables, regardless of the quantity: patient receiving normal saline infusion.

Bounded intervals: clinical intervals that are abstractions of specific instants defining their start and end times. An example of this is a hospitalization interval, where the start is defined by an admission instant and the end is defined by a discharge instant. Additional arithmetic operations may need to be applied to these intervals, for example, a clinical interval describing a hospitalization longer than 7 days.

Moving window intervals: clinical intervals where a specific condition needs to be met during a predefined window of time. An example of this was an oliguria interval, in which the condition oliguria (urinary output < 0.5 mL/kg/hour) has to be met during a 6-hour window. This denomination is consistent with previous descriptions [17].

Graphic examples of the 3 types of intervals are presented in Figure 2.

Figure 2

Three observed categories of clinical temporal intervals.

Within-Interval Calculations

In a small subset of intervals, arithmetic calculations were needed to correctly abstract them. For example, calculating an interval of pulse pressure variation (PPV) within a defined range would require calculating PPV (%) = 100 × 2 ([PP_max – PP_min]/[PP_max + PP_min]) at each instant before executing the abstraction. Other examples of within-interval calculations included counting the number of instants occurring inside an abstracted interval. An example of this would be an outcome defined as the number of chest x-rays performed on each patient during his or her stay in the intensive care unit; the interval is of type bounded (admission/discharge from the intensive care unit) and we need to count the number of additional instants (chest x-rays) occurring within the interval.

Temporal and Atemporal Relations

We explored the temporal relations between instants and intervals and, as expected, all of them conformed to the temporal logic described by Allen [18]. Briefly, Allen described 13 possible temporal relations between a pair of intervals. Examples of these are the before, equal, and overlap temporal relations, among others. Graphic examples are presented in Figure 3.

Figure 3

Examples of temporal relations.

In addition, some intervals were constructed by combinations of Boolean relations between intervals and instants. For example, to adequately represent a pediatric sepsis interval as defined by the International Pediatric Sepsis Consensus Conference [19] as required by the study authors, we required the Boolean relation AND between 6 different instants, and each one of them temporally related to an instant-based interval.

Some of the extracted entities did not have a temporal component and were denominated nontemporal patient attributes. Examples of these are age, race, and sex.

Finally, 5.5% (20/362) of the extracted concepts were not able to be represented using this proposed framework. For example, the outcome appropriate antimicrobial administration defined as whether the isolated bacteria were susceptible to the administered antibiotic implies a qualitative interpretation of a laboratory examination, which is out of scope of a temporal representation of clinical entities.

Nested Queries

One additional functionality that was particularly salient was the need to perform nested queries. In a nested query, a query uses the output of another query as its input. Observational studies frequently explore the effect of a specific exposure; this study design involves creating 2 patient cohorts that are identical except for the exposure. When the outcome is assessed in both cohorts, a nested query is the most natural way to satisfy this requirement:

SELECT (Outcome Phenotype) FROM (Exposure Cohort)

In this case Outcome Phenotype and Exposure Cohort are both themselves queries.

Design Patterns

The combination of temporal entities (instants and intervals), temporal relations, and nontemporal patient attributes can be used to describe the different observed patterns. A graphic description is presented in Figure 4.

Intervals can be temporally related to either instants or intervals. The same was observed for instants. We observed all 13 temporal relations described by Allen [18]. We call a pair of temporally related temporal entities (ie, Interval–Relation–Interval) a basic pattern. These basic patterns can, in turn, be related to other temporal entities or other temporal patterns. Those relations can be either temporal or through Boolean operators (AND, OR, Exclusive OR, NOT).

Intervals can use external variables as a condition to meet either before or after being abstracted. The first case would be the abstraction of an interval of reduced urinary output (<5 mL/kg/hour), in which each urinary output instant needs to be checked against the patient’s body weight (the external variable) before being added to the interval. An example for the second case could be total dose of prednisone less than 10 mg/kg. This interval is abstracted from individual instants of prednisone administration and after the interval is abstracted, it is checked against the patient’s body weight (the external variable). A final case was seen when an internal calculation—using information completely contained within the interval—was needed to be performed to generate the required attribute for the interval. An example of this would be an interval of a series of chest x-rays and, at the end, the number of x-rays would be calculated to create the interval total number of chest x-rays per week.

Figure 4

Examples of identified clinical temporal design patterns. ICU: intensive care unit.

Discussion Main Findings

This study presents a systematic, literature-based assessment of design requirements to develop a temporal abstraction-based digital phenotyping tool. Such a tool would facilitate the conduction of retrospective clinical studies in critical care using routinely collected electronic clinical data through enabling a rich description of clinical phenotypes. Once validated, these temporally abstracted digital phenotypes should be able to correctly represent patient cohorts, clinical interventions or exposures, as well as relevant clinical outcomes. The iterative nature of this review, which was conducted until reaching information saturation, adds robustness to its findings.

The initial findings of this review are consistent with previous research describing the nature of temporal clinical entities, in the form of clinical instants and intervals [20], as well as temporal relationships between these entities [18]. Other temporal abstraction-based digital phenotyping systems have been described in the past [21,22]; however, there are no reports that their development has been informed by systematically reviewing observational studies. As a consequence, this study adds 3 additional functionalities that may facilitate the creation of digital phenotypes for observational research.

First, this review shows that 3 subtypes of clinical intervals—instant-based, bounded, and moving window—are necessary to adequately represent digital phenotypes. Second, in addition to these interval subtypes, there is a need to perform calculations both within a clinical interval and with data external to the interval being abstracted. The third component involves the need to allow for nested queries when building digital phenotypes for observational studies. Other findings of this systematic review confirm the need to query for temporal relations and Boolean relations as described by Mo et al [23] in their desiderata for digital phenotyping. Finally, it is essential to highlight the need to generate high-quality temporal metadata during routine clinical documentations because temporal queries are an essential component of digital phenotyping.

Limitations

The main limitation of this review is its focus only on intensive care studies. We chose this setting given the temporal density of clinical data collected during critical care episodes. We cannot claim that these findings will be similar in other clinical domains; that statement would need to be explicitly verified in additional studies. A second limitation is the exclusion of inclusion criteria based on free text contained in clinical notes or reports. This was an explicit decision given our goal of designing a digital phenotyping system able to abstract higher-level concepts from structured data without relying on free text. We still need to demonstrate the feasibility of this approach [24].

Abbreviations

APACHE II

Acute Physiology And Chronic Health Evaluation II

EHR

electronic health record

PICO

Patient/Population, Intervention, Comparison, Outcome

PPV

pulse pressure variation

This work was supported by DC’s CONICYT-FONDECYT (Chile) grant (no. 11130577).

MB contributed with data analysis, manuscript write-up, and final document review. CD, JS, and JT contributed with data analysis and final document review. DC contributed with study conceptualization and design, data analysis, manuscript write-up, final document review, and decision to submit.

None declared.

Schäferhoff

Martinez

Ogbuoji

Sabin

Yamey

Trends in global health financing

BMJ 2019 05 20 365 l2185

10.1136/bmj.l2185

31109918

PMC6526392

Westreich

Edwards

Lesko

Cole

Stuart

Target Validity and the Hierarchy of Study Designs

Am J Epidemiol 2019 02 01 188 2 438 443

10.1093/aje/kwy228

30299451

5123986

PMC6357801

Tunis

Stryer

Clancy

Practical clinical trials: increasing the value of clinical research for decision making in clinical and health policy

JAMA 2003 09 24 290 12 1624 32

10.1001/jama.290.12.1624

14506122

290/12/1624

Palabindala

Pamarthy

Jonnalagadda

Adoption of electronic health records and barriers

J Community Hosp Intern Med Perspect 2016 6 5 32643

10.3402/jchimp.v6.32643

27802857

32643

PMC5089148

Budrionis

Bellika

The Learning Healthcare System: Where are we now? A systematic review

J Biomed Inform 2016 12 64 87 92

10.1016/j.jbi.2016.09.018

27693565

S1532-0464(16)30131-9

Kahn

Callahan

Barnard

Bauck

Brown

Davidson

Estiri

Goerg

Holve

Johnson

Liaw

Hamilton-Lopez

Meeker

Ong

Ryan

Shang

Weiskopf

Weng

Zozus

Schilling

A Harmonized Data Quality Assessment Terminology and Framework for the Secondary Use of Electronic Health Record Data

EGEMS (Wash DC) 2016 4 1 1244

10.13063/2327-9214.1244

27713905

egems1244

PMC5051581

Capurro

Yetisgen

van Eaton

Black

Tarczy-Hornoch

Availability of structured and unstructured clinical data for comparative effectiveness research and quality improvement: a multisite assessment

EGEMS (Wash DC) 2014 2 1 1079

10.13063/2327-9214.1079

25848594

egems1079

PMC4371483

Ross

Carini

Sim

Analysis of eligibility criteria complexity in clinical trials

Summit Transl Bioinform 2010 03 01 2010 46 50

21347148

PMC3041539

Overby

Weng

Haerian

Perotte

Friedman

Hripcsak

Evaluation considerations for EHR-based phenotyping algorithms: A case study for drug-induced liver injury

AMIA Jt Summits Transl Sci Proc 2013 2013 130 4

24303321

PMC3814479

Shivade

Raghavan

Fosler-Lussier

Embi

Elhadad

Johnson

Lai

A review of approaches to identifying patient phenotype cohorts using electronic health records

J Am Med Inform Assoc 2014 21 2 221 30

10.1136/amiajnl-2013-001935

24201027

amiajnl-2013-001935

PMC3932460

Khan

Ramsey

Ballard

Armstrong

Burchill

Menashe

Pantely

Broberg

Limited Accuracy of Administrative Data for the Identification and Classification of Adult Congenital Heart Disease

J Am Heart Assoc 2018 01 12 7 2 e007378

10.1161/JAHA.117.007378

29330259

JAHA.117.007378

PMC5850158

Weng

Sim

Richesson

Formal representation of eligibility criteria: a literature review

J Biomed Inform 2010 06 43 3 451 67

10.1016/j.jbi.2009.12.004

20034594

S1532-0464(09)00159-2

PMC2878905

Hruby

Matsoukas

Cimino

Weng

Facilitating biomedical researchers' interrogation of electronic health record data: Ideas from outside of biomedical informatics

J Biomed Inform 2016 04 60 376 84

10.1016/j.jbi.2016.03.004

26972838

S1532-0464(16)00044-7

PMC4837021

Huang

Lin

Demner-Fushman

Evaluation of PICO as a knowledge representation for clinical questions

AMIA Annu Symp Proc 2006 359 63

17238363

86041

PMC1839740

Boland

Carini

Sim

Weng

EliXR-TIME: A Temporal Knowledge Representation for Clinical Research Eligibility Criteria

AMIA Jt Summits Transl Sci Proc 2012 2012 71 80

22779055

PMC3392056

Bone

Balk

Cerra

Dellinger

Fein

Knaus

Schein

Sibbald

Definitions for sepsis and organ failure and guidelines for the use of innovative therapies in sepsis. The ACCP/SCCM Consensus Conference Committee. American College of Chest Physicians/Society of Critical Care Medicine

Chest 1992 06 101 6 1644 55

10.1378/chest.101.6.1644

1303622

S0012-3692(16)38415-X

Gall

Duftschmid

Dorda

Moving time window aggregates over patient histories

Int J Med Inform 2001 10 63 3 133 45

10.1016/s1386-5056(01)00164-2

11502429

S1386505601001642

Allen

Maintaining knowledge about temporal intervals

Commun. ACM 1983 11 26 11 832 843

10.1145/182.358434

Goldstein

Giroir

Randolph

International Consensus Conference on Pediatric Sepsis

International pediatric sepsis consensus conference: definitions for sepsis and organ dysfunction in pediatrics

Pediatr Crit Care Med 2005 01 6 1 2 8

10.1097/01.PCC.0000149131.72248.E6

15636651

01.PCC.0000149131.72248.E6

Shahar

A framework for knowledge-based temporal abstraction

Artificial Intelligence 1997 02 90 1-2 79 133

10.1016/s0004-3702(96)00025-2

Post

Kurc

Willard

Rathod

Mansour

Pai

Torian

Agravat

Sturm

Saltz

Temporal abstraction-based clinical phenotyping with Eureka!

AMIA Annu Symp Proc 2013 2013 1160 9

24551400

PMC3900137

Mate

Bürkle

Kapsner

Toddenroth

Kampf

Sedlmayr

Castellanos

Prokosch

Kraus

A method for the graphical modeling of relative temporal constraints

J Biomed Inform 2019 12 100 103314

10.1016/j.jbi.2019.103314

31629921

S1532-0464(19)30233-3

Thompson

Rasmussen

Pacheco

Jiang

Kiefer

Zhu

Montague

Carrell

Lingren

Mentch

Wehbe

Peissig

Tromp

Larson

Chute

Pathak

Denny

Speltz

Kho

Jarvik

Bejan

Williams

Borthwick

Kitchner

Roden

Harris

Desiderata for computable representations of electronic health records-driven phenotype algorithms

J Am Med Inform Assoc 2015 11 22 6 1220 30

10.1093/jamia/ocv112

26342218

ocv112

PMC4639716

Hernandez-Boussard

Monda

Crespo

Riskin

Real world evidence in cardiovascular medicine: ensuring data validity in electronic health record-based studies

J Am Med Inform Assoc 2019 11 01 26 11 1189 1194

10.1093/jamia/ocz119

31414700

5548084

PMC6798570