Published on in Vol 12 (2024)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/57674, first published .
Data Set and Benchmark (MedGPTEval) to Evaluate Responses From Large Language Models in Medicine: Evaluation Development and Validation

Data Set and Benchmark (MedGPTEval) to Evaluate Responses From Large Language Models in Medicine: Evaluation Development and Validation

Data Set and Benchmark (MedGPTEval) to Evaluate Responses From Large Language Models in Medicine: Evaluation Development and Validation

Journals

  1. Rewthamrongsris P, Burapacheep J, Trachoo V, Porntaveetus T. Accuracy of Large Language Models for Infective Endocarditis Prophylaxis in Dental Procedures. International Dental Journal 2024 View
  2. Andrew A, Tizzard E. Large language models for improving cancer diagnosis and management in primary health care settings. Journal of Medicine, Surgery, and Public Health 2024:100157 View

Books/Policy Documents

  1. Xu H, Xue T, Liu D, Zhang F, Westin C, Kikinis R, O’Donnell L, Cai W. Foundation Models for General Medical AI. View