Published on in Vol 8, No 7 (2020): July

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/18910, first published .
Reliability of Supervised Machine Learning Using Synthetic Data in Health Care: Model to Preserve Privacy for Data Sharing

Reliability of Supervised Machine Learning Using Synthetic Data in Health Care: Model to Preserve Privacy for Data Sharing

Reliability of Supervised Machine Learning Using Synthetic Data in Health Care: Model to Preserve Privacy for Data Sharing

Journals

  1. Pita Costa J, Grobelnik M, Fuart F, Stopar L, Epelde G, Fischaber S, Poliwoda P, Rankin D, Wallace J, Black M, Bond R, Mulvenna M, Weston D, Carlin P, Bilbao R, Nikolic G, Shi X, De Moor B, Pikkarainen M, Paakkonen J, Staines A, Connolly R, Davis P. Meaningful Big Data Integration for a Global COVID-19 Strategy. IEEE Computational Intelligence Magazine 2020;15(4):51 View
  2. Dankar F, Ibrahim M. Fake It Till You Make It: Guidelines for Effective Synthetic Data Generation. Applied Sciences 2021;11(5):2158 View
  3. Aggarwal R, Sounderajah V, Martin G, Ting D, Karthikesalingam A, King D, Ashrafian H, Darzi A. Diagnostic accuracy of deep learning in medical imaging: a systematic review and meta-analysis. npj Digital Medicine 2021;4(1) View
  4. El Emam K, Mosquera L, Jonker E, Sood H. Evaluating the utility of synthetic COVID-19 case data. JAMIA Open 2021;4(1) View
  5. Libbi C, Trienes J, Trieschnigg D, Seifert C. Generating Synthetic Training Data for Supervised De-Identification of Electronic Health Records. Future Internet 2021;13(5):136 View
  6. Pereira T, Morgado J, Silva F, Pelter M, Dias V, Barros R, Freitas C, Negrão E, Flor de Lima B, Correia da Silva M, Madureira A, Ramos I, Hespanhol V, Costa J, Cunha A, Oliveira H. Sharing Biomedical Data: Strengthening AI Development in Healthcare. Healthcare 2021;9(7):827 View
  7. Papyshev G, Yarime M. Exploring city digital twins as policy tools: A task-based approach to generating synthetic data on urban mobility. Data & Policy 2021;3 View
  8. An R, Shen J, Xiao Y. Applications of Artificial Intelligence to Obesity Research: Scoping Review of Methodologies. Journal of Medical Internet Research 2022;24(12):e40589 View
  9. Hernandez M, Epelde G, Beristain A, Álvarez R, Molina C, Larrea X, Alberdi A, Timoleon M, Bamidis P, Konstantinidis E. Incorporation of Synthetic Data Generation Techniques within a Controlled Data Processing Workflow in the Health and Wellbeing Domain. Electronics 2022;11(5):812 View
  10. El Emam K, Mosquera L, Fang X, El-Hussuna A. Utility Metrics for Evaluating Synthetic Health Data Generation Methods: Validation Study. JMIR Medical Informatics 2022;10(4):e35734 View
  11. Desmet C, Cook D. Recent Developments in Privacy-preserving Mining of Clinical Data. ACM/IMS Transactions on Data Science 2021;2(4):1 View
  12. Singh A, Amutha J, Nagar J, Sharma S, Lee C. AutoML-ID: automated machine learning model for intrusion detection using wireless sensor network. Scientific Reports 2022;12(1) View
  13. ATASEVER S, AZGINOGLU N, TERZI D, TERZI R. A comprehensive survey of deep learning research on medical image analysis with focus on transfer learning. Clinical Imaging 2023;94:18 View
  14. Dankar F, Ibrahim M, Ismail L. A Multi-Dimensional Evaluation of Synthetic Data Generators. IEEE Access 2022;10:11147 View
  15. Wimmer S, Finger R. A note on synthetic data for replication purposes in agricultural economics. Journal of Agricultural Economics 2023;74(1):316 View
  16. Marquez Chavez J, Tang W. A Vision-Based System for Stage Classification of Parkinsonian Gait Using Machine Learning and Synthetic Data. Sensors 2022;22(12):4463 View
  17. Braddon A, Robinson S, Alati R, Betts K. Exploring the utility of synthetic data to extract more value from sensitive health data assets: A focused example in perinatal epidemiology. Paediatric and Perinatal Epidemiology 2023;37(4):292 View
  18. Brown C, Moore J, Tilford J. Rates Of Preterm Birth And Low Birthweight: An Analysis Of Racial And Ethnic Populations. Health Affairs 2023;42(2):261 View
  19. Singh A, Amutha J, Nagar J, Sharma S, Lee C. LT-FS-ID: Log-Transformed Feature Learning and Feature-Scaling-Based Machine Learning Algorithms to Predict the k-Barriers for Intrusion Detection Using Wireless Sensor Network. Sensors 2022;22(3):1070 View
  20. Solanke A, Biasiotti M. Digital Forensics AI: Evaluating, Standardizing and Optimizing Digital Evidence Mining Techniques. KI - Künstliche Intelligenz 2022;36(2):143 View
  21. Kuo N, Polizzotto M, Finfer S, Garcia F, Sönnerborg A, Zazzi M, Böhm M, Kaiser R, Jorm L, Barbieri S. The Health Gym: synthetic health-related datasets for the development of reinforcement learning algorithms. Scientific Data 2022;9(1) View
  22. Liventsev V, Härmä A, Petković M. Towards Effective Patient Simulators. Frontiers in Artificial Intelligence 2021;4 View
  23. Monarca I, Cibrian F, Chavez E, Tentori M. Using a small dataset to classify strength-interactions with an elastic display: a case study for the screening of autism spectrum disorder. International Journal of Machine Learning and Cybernetics 2023;14(1):151 View
  24. Abdelmigid H, Baz M, AlZain M, Al-Amri J, Zaini H, Morsi M, Abualnaja M, Alhuthal N. A Novel Generative Adversarial Network Model Based on GC-MS Analysis for the Classification of Taif Rose. Applied Sciences 2023;13(5):3052 View
  25. Evrimler S, Ali Gedik M, Ahmet Serel T, Ertunc O, Alperen Ozturk S, Soyupek S. Bladder Urothelial Carcinoma: Machine Learning-based Computed Tomography Radiomics for Prediction of Histological Variant. Academic Radiology 2022;29(11):1682 View
  26. Deniz-Garcia A, Fabelo H, Rodriguez-Almeida A, Zamora-Zamorano G, Castro-Fernandez M, Alberiche Ruano M, Solvoll T, Granja C, Schopf T, Callico G, Soguero-Ruiz C, Wägner A. Quality, Usability, and Effectiveness of mHealth Apps and the Role of Artificial Intelligence: Current Scenario and Challenges. Journal of Medical Internet Research 2023;25:e44030 View
  27. Hernadez M, Epelde G, Alberdi A, Cilla R, Rankin D. Synthetic Tabular Data Evaluation in the Health Domain Covering Resemblance, Utility, and Privacy Dimensions. Methods of Information in Medicine 2023;62(S 01):e19 View
  28. Reich T, Budka M, Hulbert D. Bus journey simulation to develop public transport predictive algorithms. Soft Computing Letters 2021;3:100029 View
  29. Yan C, Yan Y, Wan Z, Zhang Z, Omberg L, Guinney J, Mooney S, Malin B. A Multifaceted benchmarking of synthetic electronic health record generation models. Nature Communications 2022;13(1) View
  30. Majeed A, Zhang X. On the Adoption of Modern Technologies to Fight the COVID-19 Pandemic: A Technical Synthesis of Latest Developments. COVID 2023;3(1):90 View
  31. Abedi M, Hempel L, Sadeghi S, Kirsten T. GAN-Based Approaches for Generating Structured Data in the Medical Domain. Applied Sciences 2022;12(14):7075 View
  32. Ahmed G, Malick R, Akhunzada A, Zahid S, Sagri M, Gani A. An Approach towards IoT-Based Predictive Service for Early Detection of Diseases in Poultry Chickens. Sustainability 2021;13(23):13396 View
  33. Sueur C, Bousquet C, Espinosa R, Deneubourg J. Improving human collective decision-making through animal and artificial intelligence. Peer Community Journal 2021;1 View
  34. Steinhoff J. Toward a political economy of synthetic data: A data-intensive capitalism that is not a surveillance capitalism?. New Media & Society 2024;26(6):3290 View
  35. Carvalho T, Moniz N, Faria P, Antunes L. Survey on Privacy-Preserving Techniques for Microdata Publication. ACM Computing Surveys 2023;55(14s):1 View
  36. Restrepo J, Rivera J, Laniado H, Osorio P, Becerra O. Nonparametric Generation of Synthetic Data Using Copulas. Electronics 2023;12(7):1601 View
  37. Ganguli R, Lad R, Lin A, Yu X. Novel Generative Recurrent Neural Network Framework to Produce Accurate, Applicable, and Deidentified Synthetic Medical Data for Patients With Metastatic Cancer. JCO Clinical Cancer Informatics 2023;(7) View
  38. Pathare A, Mangrulkar R, Suvarna K, Parekh A, Thakur G, Gawade A. Comparison of tabular synthetic data generation techniques using propensity and cluster log metric. International Journal of Information Management Data Insights 2023;3(2):100177 View
  39. Sliman H, Megdiche I, Alajramy L, Taweel A, Yangui S, Drira A, Lamine E. MedWGAN based synthetic dataset generation for Uveitis pathology. Intelligent Systems with Applications 2023;18:200223 View
  40. Zhang J, Lim J, Kim M, Hur S, Chung T. WM–STGCN: A Novel Spatiotemporal Modeling Method for Parkinsonian Gait Recognition. Sensors 2023;23(10):4980 View
  41. ji X, Suehiro D, Uchida S. Paired contrastive feature for highly reliable offline signature verification. Pattern Recognition 2023;144:109816 View
  42. Majeed A. Attribute-Centric and Synthetic Data Based Privacy Preserving Methods: A Systematic Review. Journal of Cybersecurity and Privacy 2023;3(3):638 View
  43. El Kababji S, Mitsakakis N, Fang X, Beltran-Bless A, Pond G, Vandermeer L, Radhakrishnan D, Mosquera L, Paterson A, Shepherd L, Chen B, Barlow W, Gralow J, Savard M, Clemons M, El Emam K. Evaluating the Utility and Privacy of Synthetic Breast Cancer Clinical Trial Data Sets. JCO Clinical Cancer Informatics 2023;(7) View
  44. Kim K, Kwak J. PVS-GEN: Systematic Approach for Universal Synthetic Data Generation Involving Parameterization, Verification, and Segmentation. Sensors 2024;24(1):266 View
  45. Usman Akbar M, Larsson M, Blystad I, Eklund A. Brain tumor segmentation using synthetic MR images - A comparison of GANs and diffusion models. Scientific Data 2024;11(1) View
  46. Taylor K, Sheikh W. Automated hearing impairment diagnosis using machine‐learning: An open‐source software development undergraduate research project. Computer Applications in Engineering Education 2024;32(3) View
  47. Ling X, Menzies T, Hazard C, Shu J, Beel J. Trading Off Scalability, Privacy, and Performance in Data Synthesis. IEEE Access 2024;12:26642 View
  48. Pronello C, Anbarasan D, Spoturno F, Terzolo G. A low-cost automatic people-counting system at bus stops using Wi-Fi probe requests and deep learning. Public Transport 2024 View
  49. Akiya I, Ishihara T, Yamamoto K. Comparison of Synthetic Data Generation Techniques for Control Group Survival Data in Oncology Clinical Trials: Simulation Study. JMIR Medical Informatics 2024;12:e55118 View
  50. Hernandez M, Konstantinidis E, Epelde G, Londoño F, Petsani D, Timoleon M, Fiska V, Mpaltadoros L, Maga-Nteve C, Machairas I, Bamidis P. A Secure Data Publishing and Access Service for Sensitive Data from Living Labs: Enabling Collaboration with External Researchers via Shareable Data. Big Data and Cognitive Computing 2024;8(6):55 View
  51. Stanton I, Munir K, Ikram A, El‐Bakry M. Data augmentation for predictive maintenance: Synthesising aircraft landing gear datasets. Engineering Reports 2024 View
  52. Kühnel L, Schneider J, Perrar I, Adams T, Moazemi S, Prasser F, Nöthlings U, Fröhlich H, Fluck J. Synthetic data generation for a longitudinal cohort study – evaluation, method extension and reproduction of published data analysis results. Scientific Reports 2024;14(1) View
  53. Osorio-Marulanda P, Epelde G, Hernandez M, Isasa I, Reyes N, Iraola A. Privacy Mechanisms and Evaluation Metrics for Synthetic Data Generation: A Systematic Review. IEEE Access 2024;12:88048 View
  54. Scroggins J, Topaz M, Song J, Zolnoori M. Does synthetic data augmentation improve the performances of machine learning classifiers for identifying health problems in patient–nurse verbal communications in home healthcare settings?. Journal of Nursing Scholarship 2024 View
  55. Chakraborty A, Gao S, Miry R, Ramazi P, Greiner R, Lewis M, Wang H. An early warning indicator trained on stochastic disease-spreading models with different noises. Journal of The Royal Society Interface 2024;21(217) View
  56. Little C, Allmendinger R, Elliot M. Synthetic Census Microdata Generation: A Comparative Study of Synthesis Methods Examining the Trade-Off Between Disclosure Risk and Utility. Journal of Official Statistics 2024 View
  57. Qavi I, Tan G. Harnessing interpretable and ensemble machine learning techniques for precision fabrication of aligned micro-fibers. Manufacturing Letters 2024;41:364 View
  58. Hadley A, Pulliam C. Enhancing Activity Recognition After Stroke: Generative Adversarial Networks for Kinematic Data Augmentation. Sensors 2024;24(21):6861 View
  59. Jo H, Ahn S, Ohn J, Shin C, Ji E, Kim D, Jung S, Lee J. Insulin Resistance and Impaired Insulin Secretion Predict Incident Diabetes: A Statistical Matching Application to the Two Korean Nationwide, Population-Representative Cohorts. Endocrinology and Metabolism 2024;39(5):711 View
  60. Türkmen İ, Söyler A, Aliyev S, Semiz T. Bibliometric and Content Analysis of Articles on Artificial Intelligence in Healthcare. Journal of International Health Sciences and Management 2024;10(20):137 View
  61. Qi X, Meng H, Xu N, Mei G, Peng J. A knowledge-data dually driven paradigm for accurate identification of key blocks in complex rock slopes. Journal of Rock Mechanics and Geotechnical Engineering 2024 View
  62. Ștefănigă S, Cordoș A, Ivascu T, Feier C, Muntean C, Stupinean C, Călinici T, Aluaș M, Bolboacă S. Advancing Precision Oncology with Digital and Virtual Twins: A Scoping Review. Cancers 2024;16(22):3817 View
  63. Lautrup A, Hyrup T, Zimek A, Schneider-Kamp P. Systematic Review of Generative Modelling Tools and Utility Metrics for Fully Synthetic Tabular Data. ACM Computing Surveys 2024 View

Books/Policy Documents

  1. Llugiqi M, Mayer R. Machine Learning and Knowledge Extraction. View
  2. Sari M, Berawi M, Zagloel T, Amatkasmin L, Susantono B. Innovations in Digital Economy. View
  3. Antoniou J, Tringides O. Effects of Data Overload on User Quality of Experience. View
  4. Aldraimli M, Nazyrova N, Djumanov A, Sobirov I, Chaussalet T. Contemporary Methods in Bioinformatics and Biomedicine and Their Applications. View
  5. Little C, Elliot M, Allmendinger R. Privacy in Statistical Databases. View
  6. Huang J, Yin J, Wang S, Kong D. Computational and Experimental Simulations in Engineering. View
  7. Keller A, Martins Pereira C, Pires M. Multidisciplinary Perspectives on Artificial Intelligence and the Law. View
  8. Marwala T. Mechanism Design, Behavioral Science and Artificial Intelligence in International Relations. View
  9. Marwala T. Mechanism Design, Behavioral Science and Artificial Intelligence in International Relations. View