Published on in Vol 8, No 2 (2020): February

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/16492, first published .
Analyzing Medical Research Results Based on Synthetic Data and Their Relation to Real Data Results: Systematic Comparison From Five Observational Studies

Analyzing Medical Research Results Based on Synthetic Data and Their Relation to Real Data Results: Systematic Comparison From Five Observational Studies

Analyzing Medical Research Results Based on Synthetic Data and Their Relation to Real Data Results: Systematic Comparison From Five Observational Studies

Journals

  1. Rankin D, Black M, Bond R, Wallace J, Mulvenna M, Epelde G. Reliability of Supervised Machine Learning Using Synthetic Data in Health Care: Model to Preserve Privacy for Data Sharing. JMIR Medical Informatics 2020;8(7):e18910 View
  2. Gillies C, Taylor D, Cummings B, Ansari S, Islim F, Kronick S, Medlin R, Ward K. Demonstrating the consequences of learning missingness patterns in early warning systems for preventative health care: A novel simulation and solution. Journal of Biomedical Informatics 2020;110:103528 View
  3. Azizi Z, Zheng C, Mosquera L, Pilote L, El Emam K. Can synthetic data be a proxy for real clinical trial data? A validation study. BMJ Open 2021;11(4):e043497 View
  4. Tayefi M, Ngo P, Chomutare T, Dalianis H, Salvi E, Budrionis A, Godtliebsen F. Challenges and opportunities beyond structured data in analysis of electronic health records. WIREs Computational Statistics 2021;13(6) View
  5. Vourganas I, Stankovic V, Stankovic L. Individualised Responsible Artificial Intelligence for Home-Based Rehabilitation. Sensors 2020;21(1):2 View
  6. Jeon S, Seo J, Kim S, Lee J, Kim J, Sohn J, Moon J, Joo H. Proposal and Assessment of a De-Identification Strategy to Enhance Anonymity of the Observational Medical Outcomes Partnership Common Data Model (OMOP-CDM) in a Public Cloud-Computing Environment: Anonymization of Medical Data Using Privacy Models. Journal of Medical Internet Research 2020;22(11):e19597 View
  7. Dankar F, Ibrahim M. Fake It Till You Make It: Guidelines for Effective Synthetic Data Generation. Applied Sciences 2021;11(5):2158 View
  8. Foraker R, Yu S, Gupta A, Michelson A, Pineda Soto J, Colvin R, Loh F, Kollef M, Maddox T, Evanoff B, Dror H, Zamstein N, Lai A, Payne P. Spot the difference: comparing results of analyses from real patient data and synthetic derivatives. JAMIA Open 2021;3(4):557 View
  9. Korytny A, Klein A, Marcusohn E, Freund Y, Neuberger A, Raz A, Miller A, Epstein D. Hypocalcemia is associated with adverse clinical course in patients with upper gastrointestinal bleeding. Internal and Emergency Medicine 2021;16(7):1813 View
  10. Kaur D, Sobiesk M, Patil S, Liu J, Bhagat P, Gupta A, Markuzon N. Application of Bayesian networks to generate synthetic health data. Journal of the American Medical Informatics Association 2021;28(4):801 View
  11. Epstein D, Solomon N, Korytny A, Marcusohn E, Freund Y, Avrahami R, Neuberger A, Raz A, Miller A. Association between ionised calcium and severity of postpartum haemorrhage: a retrospective cohort study. British Journal of Anaesthesia 2021;126(5):1022 View
  12. Maweu B, Shamsuddin R, Dakshit S, Prabhakaran B. Generating Healthcare Time Series Data for Improving Diagnostic Accuracy of Deep Neural Networks. IEEE Transactions on Instrumentation and Measurement 2021;70:1 View
  13. Pereira T, Morgado J, Silva F, Pelter M, Dias V, Barros R, Freitas C, Negrão E, Flor de Lima B, Correia da Silva M, Madureira A, Ramos I, Hespanhol V, Costa J, Cunha A, Oliveira H. Sharing Biomedical Data: Strengthening AI Development in Healthcare. Healthcare 2021;9(7):827 View
  14. Weber Y, Epstein D, Miller A, Segal G, Berger G. Association of Low Alanine Aminotransferase Values with Extubation Failure in Adult Critically Ill Patients: A Retrospective Cohort Study. Journal of Clinical Medicine 2021;10(15):3282 View
  15. Bahouth F, Elias A, Ghersin I, Khoury E, Bar O, Sholy H, Khoury J, Azzam Z. The prognostic value of heart rate at discharge in acute decompensation of heart failure with reduced ejection fraction. ESC Heart Failure 2022;9(1):585 View
  16. Borreda I, Zukermann R, Epstein D, Marcusohn E. IV Sodium Ferric Gluconate Complex in Patients Hospitalized Due to Acute Decompensated Heart Failure and Iron Deficiency. Journal of Cardiovascular Pharmacology and Therapeutics 2022;27 View
  17. Gorelik Y, Bloch-Isenberg N, Hashoul S, Heyman S, Khamaisi M. Hyperglycemia on Admission Predicts Acute Kidney Failure and Renal Functional Recovery among Inpatients. Journal of Clinical Medicine 2021;11(1):54 View
  18. Baumfeld Andre E, Carrington N, Siami F, Hiatt J, McWilliams C, Hiller C, Surinach A, Zamorano A, Pashos C, Schulz W. The Current Landscape and Emerging Applications for Real‐World Data in Diagnostics and Clinical Decision Support and its Impact on Regulatory Decision Making. Clinical Pharmacology & Therapeutics 2022;112(6):1172 View
  19. Marcusohn E, Gibory I, Miller A, Lipsky A, Neuberger A, Epstein D. The association between the degree of fever as measured in the emergency department and clinical outcomes of hospitalized adult patients. The American Journal of Emergency Medicine 2022;52:92 View
  20. Deniz-Garcia A, Fabelo H, Rodriguez-Almeida A, Zamora-Zamorano G, Castro-Fernandez M, Alberiche Ruano M, Solvoll T, Granja C, Schopf T, Callico G, Soguero-Ruiz C, Wägner A. Quality, Usability, and Effectiveness of mHealth Apps and the Role of Artificial Intelligence: Current Scenario and Challenges. Journal of Medical Internet Research 2023;25:e44030 View
  21. Kharya S, Soni S, Swarnkar T. Generation of synthetic datasets using weighted bayesian association rules in clinical world. International Journal of Information Technology 2022;14(6):3245 View
  22. Thomas J, Foraker R, Zamstein N, Morrow J, Payne P, Wilcox A, Haendel M, Chute C, Gersing K, Walden A, Bennett T, Eichmann D, Guinney J, Kibbe W, Liu H, Pfaff E, Robinson P, Saltz J, Spratt H, Starren J, Suver C, Williams A, Wu C, Gabriel D, Hong S, Kostka K, Lehmann H, Moffitt R, Morris M, Palchuk M, Zhang X, Zhu R, Amor B, Bissell M, Clark M, Girvin A, Lee A, Miller R, Walters K, Chae Y, Cook C, Dest A, Dietz R, Dillon T, Francis P, Fuentes R, Graves A, McMurry J, Neumann A, O'Neil S, Sheikh U, Volz A, Zampino E, Austin C, Bozzette S, Deacy M, Garbarini N, Kurilla M, Michael S, Rutter J, Temple-O'Connor M, Bradwell K, Manna A, Qureshi N, Saltz M, Bramante C, Harper J, Hernandez W, Koraishy F, Mariona F, Mattapally S, Saha A, Vedula S, Fu Y, Mathews N, Mendelevitch O. Demonstrating an approach for evaluating synthetic geospatial and temporal epidemiologic data utility: results from analyzing >1.8 million SARS-CoV-2 tests in the United States National COVID Cohort Collaborative (N3C). Journal of the American Medical Informatics Association 2022;29(8):1350 View
  23. Greenberg J, Landman J, Kelly M, Pennicooke B, Molina C, Foraker R, Ray W. Leveraging Artificial Intelligence and Synthetic Data Derivatives for Spine Surgery Research. Global Spine Journal 2023;13(8):2409 View
  24. Gonzales A, Guruswamy G, Smith S, Johnson A. Synthetic data in health care: A narrative review. PLOS Digital Health 2023;2(1):e0000082 View
  25. Foraker R, Guo A, Thomas J, Zamstein N, Payne P, Wilcox A. The National COVID Cohort Collaborative: Analyses of Original and Computationally Derived Electronic Health Record Data. Journal of Medical Internet Research 2021;23(10):e30697 View
  26. Nakhleh A, Saiegh L, Shehadeh N, Weintrob N, Sheikh-Ahmad M, Supino-Rosin L, Alboim S, Gendelman R, Zloczower M. Screening for non-classic congenital adrenal hyperplasia in women: New insights using different immunoassays. Frontiers in Endocrinology 2023;13 View
  27. Shi J, Wang D, Tesei G, Norgeot B. Generating high-fidelity privacy-conscious synthetic patient data for causal effect estimation with multiple treatments. Frontiers in Artificial Intelligence 2022;5 View
  28. Brzezinski R, Melloul A, Berliner S, Goldiner I, Stark M, Rogowski O, Banai S, Shenhar-Tsarfaty S, Shacham Y. Early Detection of Inflammation-Prone STEMI Patients Using the CRP Troponin Test (CTT). Journal of Clinical Medicine 2022;11(9):2453 View
  29. Shor R, Barak A, Loewenstein A, Shahar-Gonen M, Goldstein M, Gamzu R, Zur D. Is There a Dose-Response Relationship? Real-World Outcomes of Anti-Vascular Endothelial Growth Factor Treatment in Neovascular Age-Related Macular Degeneration. Ophthalmologica 2022;245(5):395 View
  30. Marcusohn E, Reiner Benaim A, Ronen S, Kerner A, Beyar R, Almog R. Door to balloon time in primary percutaneous coronary intervention in ST elevation myocardial infarction: every minute counts. Coronary Artery Disease 2022;33(5):341 View
  31. DEVECİ A, ESEN M. Medikal Sentetik Veri Üretimiyle Veri Dengelemesi. İstatistik ve Uygulamalı Bilimler Dergisi 2022;(5):17 View
  32. Elias A, Korytny A, Klein A, Khoury Y, Ben Hur D, Braun E, Azzam Z, Ghersin I. The Association Between Opioid Use and Opioid Type and the Clinical Course and Outcomes of Acute Pancreatitis. Pancreas 2022;51(5):523 View
  33. Brzezinski R, Rabin N, Lewis N, Peled R, Kerpel A, Tsur A, Gendelman O, Naftali-Shani N, Gringauz I, Amital H, Leibowitz A, Mayan H, Ben-Zvi I, Heller E, Shechtman L, Rogowski O, Shenhar-Tsarfaty S, Konen E, Marom E, Ironi A, Rahav G, Zimmer Y, Grossman E, Ovadia-Blechman Z, Leor J, Hoffer O. Automated processing of thermal imaging to detect COVID-19. Scientific Reports 2021;11(1) View
  34. Boedihardjo M, Strohmer T, Vershynin R. Private Sampling: A Noiseless Approach for Generating Differentially Private Synthetic Data. SIAM Journal on Mathematics of Data Science 2022;4(3):1082 View
  35. Walonoski J, Hall D, Bates K, Farris M, Dagher J, Downs M, Sivek R, Wellner B, Gregorowicz A, Hadley M, Campion F, Levine L, Wacome K, Emmer G, Kemmer A, Malik M, Hughes J, Granger E, Russell S. The “Coherent Data Set”: Combining Patient Data and Imaging in a Comprehensive, Synthetic Health Record. Electronics 2022;11(8):1199 View
  36. Braddon A, Robinson S, Alati R, Betts K. Exploring the utility of synthetic data to extract more value from sensitive health data assets: A focused example in perinatal epidemiology. Paediatric and Perinatal Epidemiology 2023;37(4):292 View
  37. El Emam K, Mosquera L, Fang X. Validating a membership disclosure metric for synthetic health data. JAMIA Open 2022;5(4) View
  38. El Emam K, Mosquera L, Fang X, El-Hussuna A. Utility Metrics for Evaluating Synthetic Health Data Generation Methods: Validation Study. JMIR Medical Informatics 2022;10(4):e35734 View
  39. La Salvia M, Torti E, Leon R, Fabelo H, Ortega S, Martinez-Vega B, Callico G, Leporati F. Deep Convolutional Generative Adversarial Networks to Enhance Artificial Intelligence in Healthcare: A Skin Cancer Application. Sensors 2022;22(16):6145 View
  40. Arora A, Arora A, V E S. Machine learning models trained on synthetic datasets of multiple sample sizes for the use of predicting blood pressure from clinical data in a national dataset. PLOS ONE 2023;18(3):e0283094 View
  41. Mosquera L, El Emam K, Ding L, Sharma V, Zhang X, Kababji S, Carvalho C, Hamilton B, Palfrey D, Kong L, Jiang B, Eurich D. A method for generating synthetic longitudinal health data. BMC Medical Research Methodology 2023;23(1) View
  42. García-Vicente C, Chushig-Muzo D, Mora-Jiménez I, Fabelo H, Gram I, Løchen M, Granja C, Soguero-Ruiz C. Evaluation of Synthetic Categorical Data Generation Techniques for Predicting Cardiovascular Diseases and Post-Hoc Interpretability of the Risk Factors. Applied Sciences 2023;13(7):4119 View
  43. Segerstrom S, Diefenbach M, Hamilton K, O’Connor D, Tomiyama A. Open Science in Health Psychology and Behavioral Medicine: A Statement From the Behavioral Medicine Research Council. Psychosomatic Medicine 2023;85(4):298 View
  44. Davis S, Ssemaganda H, Koola J, Mao J, Westerman D, Speroff T, Govindarajulu U, Ramsay C, Sedrakyan A, Ohno-Machado L, Resnic F, Matheny M. Simulating complex patient populations with hierarchical learning effects to support methods development for post-market surveillance. BMC Medical Research Methodology 2023;23(1) View
  45. Segerstrom S, Diefenbach M, Hamilton K, O’Connor D, Tomiyama A, Bacon S, Bennett G, Brondolo E, Czajkowski S, Davidson K, Epel E, Revenson T, Ruiz J. Open Science in Health Psychology and Behavioral Medicine: A Statement From the Behavioral Medicine Research Council. Annals of Behavioral Medicine 2023;57(5):357 View
  46. Ganguli R, Lad R, Lin A, Yu X. Novel Generative Recurrent Neural Network Framework to Produce Accurate, Applicable, and Deidentified Synthetic Medical Data for Patients With Metastatic Cancer. JCO Clinical Cancer Informatics 2023;(7) View
  47. Fonseca J, Bacao F. Tabular and latent space synthetic data generation: a literature review. Journal of Big Data 2023;10(1) View
  48. Azizi Z, Lindner S, Shiba Y, Raparelli V, Norris C, Kublickiene K, Herrero M, Kautzky-Willer A, Klimek P, Gisinger T, Pilote L, El Emam K. A comparison of synthetic data generation and federated analysis for enabling international evaluations of cardiovascular health. Scientific Reports 2023;13(1) View
  49. Giuffrè M, Shung D. Harnessing the power of synthetic data in healthcare: innovation, application, and privacy. npj Digital Medicine 2023;6(1) View
  50. Zuber S, Bechtiger L, Bodelet J, Golin M, Heumann J, Kim J, Klee M, Mur J, Noll J, Voll S, O’Keefe P, Steinhoff A, Zölitz U, Muniz-Terrera G, Shanahan L, Shanahan M, Hofer S. An integrative approach for the analysis of risk and health across the life course: challenges, innovations, and opportunities for life course research. Discover Social Science and Health 2023;3(1) View
  51. Kapp A, Hansmeyer J, Mihaljević H. Generative Models for Synthetic Urban Mobility Data: A Systematic Literature Review. ACM Computing Surveys 2024;56(4):1 View
  52. Ang C, Chiew Y, Wang X, Ooi E, Nor M, Cove M, Chase J. Virtual patient with temporal evolution for mechanical ventilation trial studies: A stochastic model approach. Computer Methods and Programs in Biomedicine 2023;240:107728 View
  53. Dankar F, Ibrahim M, Ismail L. A Multi-Dimensional Evaluation of Synthetic Data Generators. IEEE Access 2022;10:11147 View
  54. Mok H, Ostendorf E, Ganninger A, Adler A, Hazan G, Haspel J. Circadian immunity from bench to bedside: a practical guide. Journal of Clinical Investigation 2024;134(3) View
  55. Lun R, Siegal D, Ramsay T, Stotts G, Dowlatshahi D, de Carvalho L. Synthetic data in cancer and cerebrovascular disease research: A novel approach to big data. PLOS ONE 2024;19(2):e0295921 View
  56. Caballero P, Gonzalez-Abril L, Ortega J, Simon-Soro Á. Data Mining Techniques for Endometriosis Detection in a Data-Scarce Medical Dataset. Algorithms 2024;17(3):108 View
  57. Nasir M, Summerfield N, Carreiro S, Berlowitz D, Oztekin A. A machine learning approach for diagnostic and prognostic predictions, key risk factors and interactions. Health Services and Outcomes Research Methodology 2024 View
  58. El Emam K, Mosquera L, Fang X, El-Hussuna A. An evaluation of the replicability of analyses using synthetic health data. Scientific Reports 2024;14(1) View
  59. Brzezinski R, Banai S, Katz Shalhav M, Stark M, Goldiner I, Rogowski O, Shapira I, Zeltser D, Sasson N, Berliner S, Shacham Y. The CRP troponin test (CTT) stratifies mortality risk in patients with non‐ST elevation myocardial infarction (NSTEMI). Clinical Cardiology 2024;47(4) View
  60. Meiser M, Zinnikus I. A Survey on the Use of Synthetic Data for Enhancing Key Aspects of Trustworthy AI in the Energy Domain: Challenges and Opportunities. Energies 2024;17(9):1992 View
  61. Brzezinski R, Wasserman A, Sasson N, Stark M, Goldiner I, Rogowski O, Berliner S, Argov O. An Exploratory Analysis of Routine Ferritin Measurement Upon Admission and the Prognostic Implications of Low-Grade Ferritinemia During Inflammation. The American Journal of Medicine 2024;137(9):865 View
  62. Ramgopal S, Belanger T, Lorenz D, Lipsett S, Neuman M, Liebovitz D, Florin T. Preferences for Management of Pediatric Pneumonia. Pediatric Emergency Care 2024 View
  63. Akpan I, Kobara Y, Owolabi J, Akpan A, Offodile O. Conversational and generative artificial intelligence and human–chatbot interaction in education and research. International Transactions in Operational Research 2025;32(3):1251 View
  64. Ben Yehuda O, Itelman E, Vaisman A, Segal G, Lerner B. Early Detection of Pulmonary Embolism in a General Patient Population Immediately Upon Hospital Admission Using Machine Learning to Identify New, Unidentified Risk Factors: Model Development Study. Journal of Medical Internet Research 2024;26:e48595 View
  65. Itair M, Shahrour I, El Meouche R, Hattab N. Enhancing Building Services in Higher Education Campuses through Participatory Science. Buildings 2024;14(9):2784 View
  66. Wang E, Mott K, Zhang H, Gazit S, Chodick G, Burcu M. Validation Assessment of Privacy‐Preserving Synthetic Electronic Health Record Data: Comparison of Original Versus Synthetic Data on Real‐World COVID‐19 Vaccine Effectiveness. Pharmacoepidemiology and Drug Safety 2024;33(10) View
  67. Rashidi H, Albahra S, Rubin B, Hu B. A novel and fully automated platform for synthetic tabular data generation and validation. Scientific Reports 2024;14(1) View
  68. Smolyak D, Bjarnadóttir M, Crowley K, Agarwal R. Large language models and synthetic health data: progress and prospects. JAMIA Open 2024;7(4) View
  69. Pitkämäki T, Pahikkala T, Perez I, Movahedi P, Nieminen V, Southerington T, Vaiste J, Jafaritadi M, Khan M, Kontio E, Ranttila P, Pajula J, Pölönen H, Degerli A, Plomp J, Airola A. Finnish perspective on using synthetic health data to protect privacy: the PRIVASA project. Applied Computing and Intelligence 2024;4(2):138 View
  70. Greenberg S, Cohen N, Shopen N, Mordechai R, Zeltser D, Werthein J. Outcomes of ED chest pain visits: the prognostic value of negative but measurable high-sensitivity cardiac troponin (hs-cTn) levels. BMC Emergency Medicine 2024;24(1) View
  71. Hogenboom J, Lobo Gomes A, Dekker A, Van Der Graaf W, Husson O, Wee L. Actionability of Synthetic Data in a Heterogeneous and Rare Health Care Demographic: Adolescents and Young Adults With Cancer. JCO Clinical Cancer Informatics 2024;(8) View

Books/Policy Documents

  1. Sánchez M, Urquiza-Aguiar L. Doctoral Symposium on Information and Communication Technologies. View
  2. Sun S, Wang F, Rashidian S, Kurc T, Abell-Hart K, Hajagos J, Zhu W, Saltz M, Saltz J. Heterogeneous Data Management, Polystores, and Analytics for Healthcare. View
  3. Kvak D, Březinová E, Biroš M, Hrubý R. Medical Imaging and Computer-Aided Diagnosis. View
  4. Zamstein N, Nanyonga S, Morel E, Wayne R, Nottebaum S, Kozlakidis Z. Digitalization of Medicine in Low- and Middle-Income Countries. View