Abstract
Background: Electronic medical records (EMRs) contain large amounts of detailed clinical information. Using medical record review to identify conditions within large quantities of EMRs can be time-consuming and inefficient. EMR-based phenotyping using machine learning and natural language processing algorithms is a continually developing area of study that holds potential for numerous mental health disorders.
Objective: This review evaluates the current state of EMR-based case identification for depression and provides guidance on using current algorithms and constructing new ones.
Methods: A scoping review of EMR-based algorithms for phenotyping depression was completed. This research encompassed studies published from January 2000 to May 2023. The search involved 3 databases: Embase, MEDLINE, and APA PsycInfo. This was carried out using selected keywords that fell into 3 categories: terms connected with EMRs, terms connected to case identification, and terms pertaining to depression. This study adhered to the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) guidelines.
Results: A total of 20 papers were assessed and summarized in the review. Most of these studies were undertaken in the United States, accounting for 75% (15/20). The United Kingdom and Spain followed this, accounting for 15% (3/20) and 10% (2/20) of the studies, respectively. Both data-driven and clinical rule-based methodologies were identified. The development of EMR-based phenotypes and algorithms indicates the data accessibility permitted by each health system, which led to varying performance levels among different algorithms.
Conclusions: Better use of structured and unstructured EMR components through techniques such as machine learning and natural language processing has the potential to improve depression phenotyping. However, more validation must be carried out to have confidence in depression case identification algorithms in general.
doi:10.2196/49781
Keywords
Introduction
Background
Depression is a significant factor contributing to the global burden of disease. It contributes significantly to the cost of health care services, with depression treatment services costing an average of CAD $550 (US $406.12) per patient in Alberta, Canada, in the 2007/2008 fiscal year [
]. Depression also carries a significantly higher mortality rate [ ]. Surveillance of depression in the population is necessary to understand the needs of patients and allocate limited resources where they are most needed. This surveillance will ultimately allow health care professionals to make more targeted decisions when implementing population-level interventions.Electronic medical records (EMRs) are a digitized collection of patient records documented by medical professionals. They contain various types of patient information, including test results, demographic data, and information about medication orders, recorded in structured data fields and free-text data, such as discharge summaries and nurses’ notes [
- ]. EMRs were designed to aid individual patient care but are increasingly used for other purposes, such as research and gathering data for precision public health efforts, as they are compiled in large data warehouses [ - ]. An area that will be instrumental in applying EMRs to public health is case phenotyping, which is developing case definitions to identify positive cases of a disorder in EMR data.Accurate case identification in EMRs is an area where more research needs to be conducted. This is especially true for case identification of psychiatric disorders. Previous reviews of phenotyping algorithms for psychiatric disorders only considered primary care databases as their setting [
, ]. However, these are very different from inpatient EMR systems. For one, hospital inpatients are more likely to identify errors and omissions than patients in outpatient care or primary care [ ]. EMR data have been used in research for psychiatric patients in various specific inpatient use cases, including assessing patient safety events in psychiatric inpatient units [ ]. Research has also shown that hospitals with electronic psychiatric EMRs had lower readmission rates for psychiatric patients compared to hospitals without electronic records. Similarly, hospitals where psychiatric records were accessible to nonpsychiatric physicians had lower 14- and 30-day readmission rates [ ]. In 2015, patients with a mental health diagnosis made up over 11% of hospital separations and 25% of hospital days [ ]. Accurate case identification for inpatient stays for this at-risk population can help to identify what treatments have been most successful more efficiently than traditional research methods and could work in personalizing care for a more successful treatment plan.Objectives
This study aims to provide an overview of existing algorithms for depression case identification in inpatient EMRs. It examines the performance of the algorithms and how they were constructed to provide guidance to those wishing to use an existing algorithm or to construct new ones.
Methods
Identifying Relevant Literature
This review followed the methodology outlined in the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) 2018 statement [
]. First, we used the ICD-9-CM (International Classification of Diseases, Ninth Revision, Clinical Modification) codes for depression provided by Elixhauser et al [ ] to identify relevant terms, then developed a Boolean algorithm using these terms, as well as terms related to EMRs and terms related to case identification ( ). Finally, we searched the following 3 databases: Embase (1974 to May 2023), Ovid MEDLINE (1946 to May 2023), and APA PsycInfo (1806 to May 2023) for peer-reviewed papers and exported the results of the search to a reference manager program (Zotero; Corporation for Digital Scholarship and Roy Rosenzweig Center for History and New Media).Selecting Studies
Identified papers were screened in 2 stages. First, titles and abstracts were screened by 2 reviewers working independently to determine whether they met our established eligibility criteria. Papers were included if they were retrieved by the Boolean search and presented a case definition, involved depression and EMRs, were published between January 2000 and May 2023, and were written in English. We excluded papers that only used administrative databases, as this study focused on case phenotyping using EMR data. Next, full papers were reviewed for all abstracts that both reviewers identified as eligible. This review was carried out by 2 reviewers working independently. To be included, studies had to use EMRs for phenotyping and use inpatient data, and the case definition developed had to be for depression. The inpatient data source requirement was added because of differences in coding standards between primary care and inpatient settings. Disagreements at either screening stage were resolved by consensus, and if necessary, a third reviewer was consulted. We searched the references of all included papers for additional eligible papers, which we then screened using the same criteria. The search was designed to include all papers that used an algorithm phenotyping for depression with an EMR. The 2 most common methods were natural language processing (NLP) and machine learning, which were included but were not limited to. The search terms used to identify this category were not specific to a type of algorithm or method of case identification, as the purpose was to include a broad range of variations in phenotypic methodology (
).Extracting Data
We adapted an existing data extraction form (
, Lee et al [ ]) to collect the results of our review. Data were extracted by 1 reviewer and then confirmed by a second reviewer. Components we extracted included study characteristics (country, year, and inpatient or outpatient setting), the specific data source and details of the data, and the validation methodology (eg, medical record review), as well as detailed descriptions of the phenotype developed, the methods used, and the purpose for the case definition. We recorded the performance of the developed algorithms as reported in each study. We recorded the elements of EMRs used, whether other databases or diagnostic codes were used, and whether AI techniques (machine learning and NLP) were used as binary variables. Finally, based on this study’s primary objective, we classified each study into 1 of 3 categories (algorithm development, outcome analysis, and comorbidity analysis).Results
Paper Screening
The database search returned a total of 854 papers. After 257 duplicates were removed, 597 abstracts remained. Then, 522 abstracts were excluded in the title and abstract screening, leaving 75 papers for full-paper review. Of these, 20 papers could not be retrieved, and 36 were excluded based on the exclusion criteria. The 19 remaining papers met all eligibility criteria and were included in the review. Further, 1 additional paper was identified for inclusion from the references of the included papers, resulting in 20 papers for the review [
, - ]. The PRISMA flow diagram illustrating these steps is shown in .Characterizing the Identified Literature
Of the 20 studies we identified, the majority occurred in the United States (15/20, 75%). The remaining studies were from the United Kingdom (3/20, 15%) and Spain (2/20, 10%). All the studies were published in 2005 or later.
Most studies looked at inpatient and outpatient data (16/20, 80%), while fewer focused solely on inpatient data (4/20, 20%). A few studies (4/20, 20%) linked EMR data to administrative databases. These studies used structured fields of EMRs and diagnostic codes found in administrative databases. They occurred in 3 countries (United States, United Kingdom, and Spain) and were all published in 2020 or later. Another 3 studies (3/20, 15%) linked EMRs to genomic data (the Partners HealthCare Biobank, United States; the Michigan Genomics Initiative, United States; and the pediatric biorepository database of the Center for Applied Genomics at the Children’s Hospital of Philadelphia, United States). This linkage was conducted in an epidemiological analysis study to find genetic associations between conditions. These characteristics are shown in
.Paper reference | Country | EMR | settingAdditional data sources used |
Dashti et al [ | ]United States | Inpatient and outpatient | Genomic data |
Dorr et al [ | ]United States | Inpatient and outpatient | None |
Edgcomb et al [ | ]United States | Inpatient and outpatient | None |
Estiri et al [ | ]United States | Inpatient and outpatient | None |
Fang et al [ | ]United States | Inpatient and outpatient | Genomic data |
Fernandes et al [ | ]United Kingdom | Inpatient and outpatient | None |
Goulet et al [ | ]United States | Inpatient and outpatient | None |
Hong et al [ | ]United States | Inpatient and outpatient | Administrative data |
Ingram et al [ | ]United States | Inpatient and outpatient | None |
Khapre et al [ | ]United Kingdom | Inpatient and outpatient | Administrative data |
Mar et al [ | ]Spain | Inpatient and outpatient | Administrative data |
Mason et al [ | ]United Kingdom | Inpatient and outpatient | None |
Mayer et al [ | ]Spain | Inpatient and outpatient | None |
McCoy et al [ | ]United States | Inpatient | None |
Parthipan et al [ | ]United States | Inpatient | None |
Perlis et al [ | ]United States | Inpatient and outpatient | None |
Slaby et al [ | ]United States | Inpatient | Genomic data |
Tvaryanas et al [ | ]United States | Inpatient and outpatient | None |
Yusufov et al [ | ]United States | Inpatient and outpatient | Administrative data |
Zhou et al [ | ]United States | Inpatient | None |
aEMR: electronic medical record.
Most of the identified studies (18/20, 90%) used diagnostic codes in their case definition for depression. The most common codes used were ICD-9 (International Classification of Diseases, Ninth Revision), followed by ICD-10 (International Classification of Diseases, Tenth Revision). In many studies, the diagnostic code case definitions were combined with structured data elements, such as patient demographics (sex, age, etc), laboratory results, medications, and procedures. For example, procedures were coded with Current Procedural Terminology codes and other types of classifications. Structured EMR data were used in 13/20 studies (65%). Fewer studies (8/20, 40%) incorporated unstructured data elements, such as clinical notes. To analyze these elements, some studies used standardized vocabularies, such as the Unified Medical Language System, to develop lists of keywords. Most studies using unstructured data used NLP techniques to analyze the free-text data in unstructured EMR fields (7/20, 35%). NLP is commonly used on free-text medical data to transform the data into a structured format that can be processed using statistical techniques and machine learning. A quarter of the identified studies (5/20, 25%) used machine learning to develop phenotyping algorithms. Machine learning models included logistic regression, random forest, and propositional rule learners.
contains details about the algorithms defined in each study.Paper reference | Diagnostic codes? | EMR | – structured?EMR – unstructured? | ML | ?NLP | ?Validation methodology | Sensitivity | Specificity | PPV | AUC |
Dashti et al [ | ]No | Yes | Yes | Yes | Yes | Medical record review | 0.81 | — | 0.90 | — |
Dorr et al [ | ]Yes | Yes | No | No | No | Not specified | — | — | — | — |
Edgcomb et al [ | ]Yes | No | No | No | No | Not specified | — | — | — | — |
Estiri et al [ | ]Yes | No | No | No | No | Not specified | — | — | — | — |
Fang et al [ | ]Yes | Yes | No | No | No | Not specified | — | — | — | — |
Fernandes et al [ | ]Yes | Yes | No | No | No | Not specified | — | — | — | — |
Goulet et al [ | ]Yes | No | No | No | No | Medical record review | 0.45 | 0.90 | — | — |
Hong et al [ | ]Yes | Yes | No | Yes | No | Medical record review | — | — | — | 0.83 |
Ingram et al [ | ]Yes | Yes | No | No | No | Convergent validity | — | — | — | — |
Khapre et al [ | ]Yes | Yes | Yes | No | Yes | Not specified | — | — | — | — |
Mar et al [ | ]Yes | Yes | Yes | Yes | No | Medical record review | — | — | — | 0.80 |
Mason et al [ | ]Yes | No | No | No | No | Not specified | — | — | — | — |
Mayer et al [ | ]Yes | Yes | No | No | No | Not specified | — | — | — | — |
McCoy et al [ | ]Yes | No | No | No | No | Not specified | — | — | — | — |
Parthipan et al [ | ]Yes | Yes | Yes | No | Yes | Medical record review | — | — | — | — |
Perlis et al [ | ]Yes | Yes | Yes | No | No | Medical record review | 0.39 | 0.95 | 0.78 | 0.87 |
Slaby et al [ | ]Yes | Yes | Yes | No | Yes | Medical record review | — | — | 0.95 | — |
Tvaryanas et al [ | ]Yes | No | No | No | No | Not specified | — | — | — | — |
Yusufov et al [ | ]Yes | Yes | Yes | No | Yes | Medical record review | 0.85 | 0.95 | — | — |
Zhou et al [ | ]No | No | Yes | Yes | Yes | Medical record review | 0.87 | 0.92 | — | — |
aEMR: electronic medical record.
bML: machine learning.
cNLP: natural language processing.
dPPV: positive predictive value.
eAUC: area under the receiver operating characteristic curve.
fNot available.
gArea under the precision-recall curve and F₁-score were only available for Hong et al [
]. The best algorithm in that paper had an area under the precision-recall curve of 0.90 and an F₁-score of 0.81.Only 9 studies (45%) conducted a medical record review to produce a reference standard to which to compare phenotyping results. Since most of the identified studies (14/20, 70%) were conducted with a larger goal of which phenotyping depression was a small part, many did not provide much information on the methods of their phenotyping. Most studies did not report any metrics measuring the diagnostic accuracy of developed phenotyping algorithms; only 8 studies (40%) reported at least one performance metric. The 6 metrics reported were sensitivity, specificity, positive predictive value (PPV), area under the receiver operating characteristic curve, area under the precision-recall curve, and F₁-score. No studies reported negative predictive value. These metrics are displayed in
.We classified each study into 1 of 3 general purposes: algorithm development, comorbidity analysis, and outcome analysis. A small percentage of the identified studies (6/20, 30%) were conducted for algorithm development. These studies did not look at applications of the phenotyping algorithms developed; instead, they focused on phenotyping methods and algorithm performance. The rest of the studies used a case definition for depression as a step toward a larger goal. For 9 of these studies (9/20, 45%), this goal was outcome analysis or analyzing the effect of depression on patient outcomes, such as mortality, suicide attempts, and psychotherapy receipt. For the remaining studies (5/20, 25%), the goal was comorbidity analysis, examining the prevalence of depression as a comorbidity of other conditions. The comorbidities studied included HIV, hepatitis C, and cancer. Outcome analysis studies have become more prevalent in recent years. Further, 6 were published between 2020 and 2022, up from 3 between 2000 and 2019. In addition, algorithms used for depression phenotyping in EMRs have become more prevalent since 2017.
Discussion
Principal Results
In this review, we found 20 papers describing phenotyping algorithms for depression in inpatient EMR data. Most of these algorithms were case definitions using diagnostic codes, specifically ICD-9. This reflects that ICD (International Classification of Diseases) codes are commonly used for billing purposes in the United States and are the most frequently used diagnostic codes in EMRs worldwide [
]. ICD-coded data are thus widely available, making them a practical choice when developing a case definition. However, case definitions using diagnostic codes achieved worse sensitivity than algorithms that only used other fields of EMRs. Many algorithms also used structured EMR data [ , , , , , - , , , , ], but fewer used unstructured data [ , , , , , , , ]. NLP and machine learning techniques were used by a minority of algorithms (NLP [ , , , , , ] and machine learning [ , , , ]). These types of machine learning applications are relatively new and are receiving much attention from researchers [ ]. The algorithms that used machine learning performed well on all the metrics they reported (sensitivity 0.81‐0.87, specificity 0.82, PPV 0.90, and area under the receiver operating characteristic curve 0.80‐0.83). This suggests that the information in free-text EMR data is valuable for developing accurate phenotyping algorithms. It also supports the effectiveness of machine learning techniques for phenotyping of depression. This is likely an area that will be explored further in future research.Many of the papers we found did not include a medical record review. If algorithms are not validated against a reference standard such as a medical record review, their accuracy remains unknown. Most papers also did not report metrics measuring the validity of the algorithms developed. This limits the potential of these algorithms for application in precision health care. Conducting validation studies on the algorithms presented in these papers would make them more rigorous. Of the papers that did report metrics, few reported sensitivity, specificity, and PPV together. This could result in skewed interpretations of phenotype performance, as a high sensitivity may come at the cost of a low PPV (or vice versa) for instance.
Based on the validity reported in these papers, an EMR appears promising as a phenotyping tool for depression; however, few studies have reported metrics of diagnostic accuracy of EMR algorithms, especially comprehensive metrics to fully assess performance. Future validation studies conducted on existing case definitions would be valuable in establishing their validity and bringing these types of phenotyping algorithms to the attention of medical professionals and public health analysts. Machine learning and NLP are small but growing areas within phenotyping research. More work could be carried out using these techniques on the unstructured fields in EMRs, alone or in combination with other fields. Finally, as most of the studies we found were performed in the United States on US EMR data, it is to be determined how generalizable the identified case definitions are to data recorded in other jurisdictions. Both the standards of care and the methods of reporting diagnoses vary widely between health care systems, which could result in an algorithm only being valid in the region in which it was developed. There is a need for further research validating existing case definitions across health care regions or creating new case definitions specific to the EMR systems of other countries.
Limitations
Some relevant papers may have been missed, as we only searched 3 databases. It is also possible that our search terms were not sufficiently broad to return every pertinent paper. We also only considered peer-reviewed papers, not gray literature. However, we developed our search strategy in consultation with librarians and experts in the field with experience performing scoping reviews. For these reasons, we believe our search was sufficient to find papers for the review.
Conclusions
We examined current algorithms for phenotyping depression in inpatient EMRs. This is an area in which more research needs to be performed. It is difficult to accurately identify cases of depression in EMR data because depression is inconsistently coded, as there is some subjectivity in its diagnosis. Diagnostic codes are primarily used in the algorithms we found, but machine learning on free-text data has recently achieved promising results. Most of the algorithms were developed in the United States; how well they will perform on data from other jurisdictions is yet to be known. In addition, many identified algorithms have yet to be validated against a reference standard, or their performance was not reported. To be useful for public health research, case definitions must be validated; this is an area in which future work is needed. From this study, we conclude that EMRs have the potential to provide valuable insight into the indicators of depression, as well as its prevalence, common comorbidities, and associated outcomes. Future research into applying machine learning and NLP techniques on unstructured EMR data and studies to ascertain the validity and generalizability of existing phenotyping algorithms will be valuable in establishing EMR-based case phenotyping as a reliable tool in precision public health.
Acknowledgments
We are grateful to Natalie Wiebe, MSc, for her help developing the search strategy; to Seungwon Lee, PhD, for creating the data extraction form; and to Oliver Slater-Kinghorn for helping to screen papers. This work is supported by a Foundation Grant, led by HQ, through the Canadian Institutes of Health Research.
Conflicts of Interest
None declared.
Developed search terms.
DOC File, 38 KBSummary spreadsheet of identified papers.
XLS File, 54 KBPRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) checklist.
PDF File, 531 KBReferences
- Slomp M, Jacobs P, Ohinmaa A, et al. The distribution of mental health service costs for depression in the Alberta population. Can J Psychiatry. Sep 2012;57(9):564-569. [CrossRef]
- Chiu M, Vigod S, Rahman F, Wilton AS, Lebenbaum M, Kurdyak P. Mortality risk associated with psychological distress and major depression: a population-based cohort study. J Affect Disord. Jul 2018;234:117-123. [CrossRef] [Medline]
- Offerman S, Rauchwerger A, Nishijima D, et al. Use of an electronic medical record “dotphrase” data template for a prospective head injury study. West JEM. Mar 1, 2013;14(2):109-113. [CrossRef]
- Cohen S, Jannot AS, Iserin L, Bonnet D, Burgun A, Escudié JB. Accuracy of claim data in the identification and classification of adults with congenital heart diseases in electronic medical records. Arch Cardiovasc Dis. Jan 2019;112(1):31-43. [CrossRef] [Medline]
- Greiver M, Barnsley J, Glazier RH, Harvey BJ, Moineddin R. Measuring data reliability for preventive services in electronic medical records. BMC Health Serv Res. May 14, 2012;12:116. [CrossRef] [Medline]
- Perlis RH, Iosifescu DV, Castro VM, et al. Using electronic medical records to enable large-scale studies in psychiatry: treatment resistant depression as a model. Psychol Med. Jan 2012;42(1):41-50. [CrossRef] [Medline]
- LaFleur J, McAdam-Marx C, Alder SS, et al. Clinical risk factors for fracture among postmenopausal patients at risk for fracture: a historical cohort study using electronic medical record data. J Bone Miner Metab. Mar 2011;29(2):193-200. [CrossRef] [Medline]
- Patel RC, Amorim G, Jakait B, et al. Pregnancies among women living with HIV using contraceptives and antiretroviral therapy in Western Kenya: a retrospective, cohort study. BMC Med. Aug 13, 2021;19(1):178. [CrossRef] [Medline]
- Canfell OJ, Kodiyattu Z, Eakin E, et al. Real-world data for precision public health of noncommunicable diseases: a scoping review. BMC Public Health. Nov 24, 2022;22(1):2166. [CrossRef] [Medline]
- Carreira H, Williams R, Strongman H, Bhaskaran K. Identification of mental health and quality of life outcomes in primary care databases in the UK: A systematic review. BMJ Open. Jul 2019;9(7):e029227. [CrossRef]
- Larvin H, Peckham E, Prady SL. Case-finding for common mental disorders in primary care using routinely collected data: a systematic review. Soc Psychiatry Psychiatr Epidemiol. Oct 2019;54(10):1161-1175. [CrossRef] [Medline]
- Wang B, Kristiansen E, Fagerlund AJ, et al. Users’ experiences with online access to electronic health records in mental and somatic health care: cross-sectional study. J Med Internet Res. Dec 25, 2023;25:e47840. [CrossRef] [Medline]
- Marcus SC, Hermann RC, Frankel MR, Cullen SW. Safety of psychiatric inpatients at the Veterans Health Administration. Psychiatr Serv. Feb 1, 2018;69(2):204-210. [CrossRef] [Medline]
- Kozubal DE, Samus QM, Bakare AA, et al. Separate may not be equal: a preliminary investigation of clinical correlates of electronic psychiatric record accessibility in academic medical centers. Int J Med Inform. Apr 2013;82(4):260-267. [CrossRef] [Medline]
- Johansen H, Finès P. Acute care hospital days and mental diagnoses. H Rep. 2012;23(4):1-7. URL: https://www150.statcan.gc.ca/n1/pub/82-003-x/2012004/article/11761-eng.htm#:~:text=These%20patients%20accounted%20for%2025.5,mental%20diagnosis [Accessed 2024-10-08]
- Page MJ, McKenzie JE, Bossuyt PM, et al. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. PLoS Med. Mar 2021;18(3):e1003583. [CrossRef] [Medline]
- Elixhauser A, Steiner C, Harris DR, Coffey RM. Comorbidity measures for use with administrative data. Med Care. Jan 1998;36(1):8-27. [CrossRef] [Medline]
- Lee S, Doktorchik C, Martin EA, et al. Electronic medical record–based case phenotyping for the Charlson conditions: scoping review. JMIR Med Inform. Feb 1, 2021;9(2):e23934. [CrossRef] [Medline]
- Dashti HS, Redline S, Saxena R. Polygenic risk score identifies associations between sleep duration and diseases determined from an electronic medical record biobank. Sleep. Mar 1, 2019;42(3):zsy247. [CrossRef]
- Dorr DA, Quiñones AR, King T, Wei MY, White K, Bejan CA. Prediction of future health care utilization through note-extracted psychosocial factors. Med Care. 2022;60(8):570-578. [CrossRef]
- Edgcomb JB, Thiruvalluru R, Pathak J, Brooks JO. Machine learning to differentiate risk of suicide attempt and self-harm after general medical hospitalization of women with mental illness. Med Care. Feb 1, 2021;59:S58-S64. [CrossRef] [Medline]
- Estiri H, Strasser ZH, Klann JG, Naseri P, Wagholikar KB, Murphy SN. Predicting COVID-19 mortality with electronic medical records. NPJ Digit Med. Feb 4, 2021;4(1):15. [CrossRef] [Medline]
- Fang Y, Fritsche LG, Mukherjee B, Sen S, Richmond-Rakerd LS. Polygenic liability to depression is associated with multiple medical conditions in the electronic health record: phenome-wide association study of 46,782 individuals. Biol Psychiatry Cogn Neurosci Neuroimaging. Dec 2022;92(12):923-931. [CrossRef]
- Fernandes AC, Chandran D, Khondoker M, et al. Demographic and clinical factors associated with different antidepressant treatments: a retrospective cohort study design in a UK psychiatric healthcare setting. BMJ Open. Sep 2018;8(9):e022170. [CrossRef]
- Goulet JL, Fultz SL, McGinnis KA, Justice AC. Relative prevalence of comorbidities and treatment contraindications in HIV-mono-infected and HIV/HCV-co-infected veterans. AIDS. Oct 2005;19 Suppl 3:S99-105. [CrossRef] [Medline]
- Hong C, Rush E, Liu M, et al. Clinical knowledge extraction via sparse embedding regression (KESER) with multi-center large scale electronic health record data. NPJ Digit Med. Oct 27, 2021;4(1):151. [CrossRef] [Medline]
- Ingram WM, Baker AM, Bauer CR, et al. Defining major depressive disorder cohorts using the EHR: multiple phenotypes based on ICD-9 codes and medication orders. Neurol Psychiatry Brain Res. Jun 2020;36:18-26. [CrossRef] [Medline]
- Khapre S, Stewart R, Taylor C. An evaluation of symptom domains in the 2 years before pregnancy as predictors of relapse in the perinatal period in women with severe mental illness. Eur Psychiatr. 2021;64(1):e26. [CrossRef]
- Mar J, Gorostiza A, Ibarrondo O, et al. Validation of random forest machine learning models to predict dementia-related neuropsychiatric symptoms in real-world data. J Alzheimers Dis. 2020;77(2):855-864. [CrossRef] [Medline]
- Mason A, Irving J, Pritchard M, et al. Association between depressive symptoms and cognitive-behavioural therapy receipt within a psychosis sample: a cross-sectional study. BMJ Open. May 10, 2022;12(5):e051873. [CrossRef] [Medline]
- Mayer MA, Gutierrez-Sacristan A, Leis A, De La Peña S, Sanz F, Furlong LI. Using electronic health records to assess depression and cancer comorbidities. In: Informatics for Health: Connected Citizen-Led Wellness and Population Health. IOS Press; 2017:236-240. [CrossRef]
- McCoy TH, Yu S, Hart KL, et al. High throughput phenotyping for dimensional psychopathology in electronic health records. Biol Psychiatry. Jun 15, 2018;83(12):997-1004. [CrossRef] [Medline]
- Parthipan A, Banerjee I, Humphreys K, et al. Predicting inadequate postoperative pain management in depressed patients: a machine learning approach. PLoS One. 2019;14(2):e0210575. [CrossRef] [Medline]
- Slaby I, Hain HS, Abrams D, et al. An electronic health record (EHR) phenotype algorithm to identify patients with attention deficit hyperactivity disorders (ADHD) and psychiatric comorbidities. J Neurodev Disord. Jun 11, 2022;14(1):37. [CrossRef] [Medline]
- Tvaryanas AP, Maupin GM. Risk of incident mental health conditions among critical care air transport team members. Aviat Space Environ Med. Jan 2014;85(1):30-38. [CrossRef] [Medline]
- Yusufov M, Pirl WF, Braun I, Tulsky JA, Lindvall C. Natural language processing for computer-assisted chart review to assess documentation of substance use and psychopathology in heart failure patients awaiting cardiac resynchronization therapy. J Pain Symptom Manage. Oct 2022;64(4):400-409. [CrossRef]
- Zhou L, Baughman AW, Lei VJ, et al. Identifying patients with depression using free-text clinical documents. Stud Health Technol Inform. 2015;216:629-633. [Medline]
- O’Malley KJ, Cook KF, Price MD, Wildes KR, Hurdle JF, Ashton CM. Measuring diagnoses: ICD code accuracy. Health Serv Res. Oct 2005;40(5 Pt 2):1620-1639. [CrossRef] [Medline]
- Le Glaz A, Haralambous Y, Kim-Dufor DH, et al. Machine learning and natural language processing in mental health: systematic review. J Med Internet Res. May 4, 2021;23(5):e15708. [CrossRef] [Medline]
Abbreviations
EMR: electronic medical record |
ICD: International Classification of Diseases |
ICD-10: International Classification of Diseases, Tenth Revision |
ICD-9: International Classification of Diseases, Ninth Revision |
ICD-9-CM: International Classification of Diseases, Ninth Revision, Clinical Modification |
NLP: natural language processing |
PPV: positive predictive value |
PRISMA-ScR: Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews |
Edited by Christian Lovis; submitted 14.06.23; peer-reviewed by Katie Allen, Liz Herrle; final revised version received 05.07.24; accepted 07.07.24; published 14.10.24.
Copyright© Allison Grothman, William J Ma, Kendra G Tickner, Elliot A Martin, Danielle A Southern, Hude Quan. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 14.10.2024.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included.