%0 Journal Article %@ 2291-9694 %I JMIR Publications %V 4 %N 4 %P e39 %T A Predictive Model for Medical Events Based on Contextual Embedding of Temporal Sequences %A Farhan,Wael %A Wang,Zhimu %A Huang,Yingxiang %A Wang,Shuang %A Wang,Fei %A Jiang,Xiaoqian %+ Health Sciences, Department of Biomedical Informatics, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, United States, 1 858 822 4931, wyfarhan@gmail.com %K clinical decision support %K early classification %K temporal phenotyping %K clinical event context embedding %D 2016 %7 25.11.2016 %9 Original Paper %J JMIR Med Inform %G English %X Background: Medical concepts are inherently ambiguous and error-prone due to human fallibility, which makes it hard for them to be fully used by classical machine learning methods (eg, for tasks like early stage disease prediction). Objective: Our work was to create a new machine-friendly representation that resembles the semantics of medical concepts. We then developed a sequential predictive model for medical events based on this new representation. Methods: We developed novel contextual embedding techniques to combine different medical events (eg, diagnoses, prescriptions, and labs tests). Each medical event is converted into a numerical vector that resembles its “semantics,” via which the similarity between medical events can be easily measured. We developed simple and effective predictive models based on these vectors to predict novel diagnoses. Results: We evaluated our sequential prediction model (and standard learning methods) in estimating the risk of potential diseases based on our contextual embedding representation. Our model achieved an area under the receiver operating characteristic (ROC) curve (AUC) of 0.79 on chronic systolic heart failure and an average AUC of 0.67 (over the 80 most common diagnoses) using the Medical Information Mart for Intensive Care III (MIMIC-III) dataset. Conclusions: We propose a general early prognosis predictor for 80 different diagnoses. Our method computes numeric representation for each medical event to uncover the potential meaning of those events. Our results demonstrate the efficiency of the proposed method, which will benefit patients and physicians by offering more accurate diagnosis. %M 27888170 %R 10.2196/medinform.5977 %U http://medinform.jmir.org/2016/4/e39/ %U https://doi.org/10.2196/medinform.5977 %U http://www.ncbi.nlm.nih.gov/pubmed/27888170