Published on in Vol 13 (2025)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/67552, first published .
Regulatory Insights From 27 Years of Artificial Intelligence/Machine Learning–Enabled Medical Device Recalls in the United States: Implications for Future Governance

Regulatory Insights From 27 Years of Artificial Intelligence/Machine Learning–Enabled Medical Device Recalls in the United States: Implications for Future Governance

Regulatory Insights From 27 Years of Artificial Intelligence/Machine Learning–Enabled Medical Device Recalls in the United States: Implications for Future Governance

1Department of Biomedical Engineering, National Cheng Kung University, No.138, Shengli Rd, North District, Tainan, Taiwan

2Department of Engineering Science, National Cheng Kung University, Tainan, Taiwan

3Department of Regulatory and Quality Sciences, University of Southern California, Los Angeles, CA, United States

4Medical Device Innovation Center, National Cheng Kung University, Tainan, Taiwan

Corresponding Author:

Peng-Ting Chen, PhD


Background: Artificial intelligence/machine learning (AI/ML) has revolutionized the health care industry, particularly in the development and use of medical devices. The US Food and Drug Administration (FDA) has authorized over 878 AI/ML–enabled medical devices, reflecting a growing trend in both quantity and application scope. Understanding the distinct challenges these devices present in terms of FDA regulation violations is crucial for effectively avoiding recalls. This is particularly pertinent for proactive measures regarding medical devices.

Objective: This study explores the impact of AI/ML on medical device recalls, focusing on the distinct causes associated with AI/ML–enabled devices compared with other device types. Recall information associated with 510(k)-cleared devices was obtained from openFDA. Three recall cohorts were established: “All 510(k) devices recall,” “software-related devices recall,” and “AI/ML devices recall.”

Methods: Recall information for 510(k)-cleared devices was obtained from openFDA. AI/ML-enabled medical devices were identified based on FDA listings. Three cohorts were established: “All 510(k) devices recall,” “software-related devices recall,” and “AI/ML devices recall.” Root cause analysis was conducted for each recall event.

Results: The results indicate that while the top 5 recall root causes are relatively similar across the 3 control groups, the proportions vary, with AI/ML devices showing a higher impact for 87% of all recalls. Design and development–related factors play a significant role in recalls of AI/ML devices with root causes related to device design and software design accounting for 50% of recalls, emphasizing the importance of thorough planning, user feedback incorporation, and validation during the development process to reduce the probability of recalls. In addition, changes in software, including design changes and control changes, also contribute substantially to recalls in AI/ML devices.

Conclusions: In conclusion, this study provides valuable insights into the unique challenges and considerations associated with AI/ML–enabled medical device recalls, offering guidance for manufacturers to enhance verification plans and mitigate risks in this rapidly evolving technological landscape.

JMIR Med Inform 2025;13:e67552

doi:10.2196/67552

Keywords



Background

Artificial intelligence/machine learning (AI/ML) has significantly impacted the health care industry, particularly in the development and use of medical devices. Key contributions include enhancing diagnostic accuracy through the analysis of medical imaging data, enabling predictive analytics for disease outcomes, personalizing treatment plans based on patient characteristics, facilitating remote patient monitoring, expediting drug discovery, and improving surgical procedures through robotics [1,2]. In addition, AI aids in optimizing health care management processes, enhancing cybersecurity for patient data protection, and supporting continuous learning and improvement in medical devices.

From 1997 to March 2024, the US Food and Drug Administration (FDA) has authorized over 878 Class II AI/ML–enabled medical devices (refer to (Multimedia Appendix 1) ). The quantity may not be large, but from the annual registration records, it can be observed that such products are increasing year by year (refer to Figure 1).

At the same time, the scope of application is also expanding. This trend is reflected in the classification product codes covered by the registered products [2]. Benjamens et al [1] and Zhu et al [2] have successively pointed out in their research that the number of registered AI/ML products has been continuously increasing since 2010. Zhu et al [2] also mentioned that 82% of the registered AI/ML products are concentrated in the two medical specialties of radiology and cardiology, indicating a significant use of digital medical data (such as digital medical imaging data and electrocardiograms) and relevant applications of pattern recognition in diagnosis.

As the FDA approves more and more AI/ML–enabled medical devices, it is critical to understand the unique challenges they pose when it comes to recalls, which are for violations of FDA regulations. This study examines recall data from the FDA, identifying key causes for recalls specifically associated with AI/ML–enabled medical devices. Proactive measures taken for medical devices, Wallace and Kuhn [3] analyzed system failures caused by software faults using recall information and proposed insights for improving software development. Although their analysis was not specifically focused on AI/ML products, FDA data show that one-third of medical devices operated using software are recalled due to software failures. This noteworthy phenomenon has inspired our research thinking.

Recall is a proactive measure aimed at removing or rectifying products that violate laws enforced by the FDA. It is a voluntary action taken by manufacturers and distributors who recognize their responsibility to safeguard public health from products posing risks of injury, significant deception, or other defects [4-7]. Recalls are initiated when a medical device is found to be defective, poses a risk to health, or exhibits both defects and health risks [8]. In rare cases where a manufacturer or importer neglects to voluntarily recall a device jeopardizing health, the FDA may issue a recall order to the manufacturer under 21 Code of Federal Regulations (CFR) 810.

FDA 21 CFR 7 defines the recall as “a firm’s removal or correction of a marketed product that the FDA considers to be in violation of the laws it administers and against which the agency would initiate legal action, for example, seizure. Recall does not include a market withdrawal or a stock recovery” [9] (refer to Figure 2). Although recalls are intended to ensure patient safety, Zipp [8] points out that the system for initiating recalls often takes a long time to start the recall process [10]. Zipp [8] also uses the example of the recall of Philips sleep apnea ventilator devices to illustrate this situation and proposes using a Unique Device Identifier as a solution to track product flow and expedite the recall process.

Figure 1. Number of artificial intelligence/machine learning–enabled 510(k) medical devices approved per year from 1997 to March 2024. AI: artificial intelligence; ML: machine learning.
Figure 2. Recall activities with responsible organizations. FDA: Food and Drug Administration.

Objectives

This study delves into the recalls linked to AI/ML–enabled medical devices, aiming to discern prevalent recall types unique to these products through comparative analysis with recalls from diverse device types. By scrutinizing recall data, the research endeavors to equip manufacturers with insightful information, enabling them to undertake a more comprehensive evaluation before product introduction. This proactive approach entails implementing measures to preempt root causes associated with recalls, thus mitigating the likelihood of recall incidents. FDA medical device recalls serve as a critical barometer reflecting the intricate landscape of medical device innovation challenges. Each recall unveils potential areas of vulnerability within the innovation process, shedding light on shortcomings in design, development, and regulatory oversight. Recalls highlight the delicate balance between pushing boundaries for technological advancement and ensuring safety and efficacy standards are met. They underscore the complexities of integrating cutting-edge technologies, such as AI/ML, into medical devices while maintaining regulatory compliance. Moreover, these recalls signal the need for continuous vigilance and adaptation in response to evolving risks and emerging technologies, emphasizing the dynamic nature of medical device innovation. As such, FDA recalls provide invaluable lessons for industry stakeholders, guiding them in navigating the intricate terrain of innovation while prioritizing patient safety and regulatory adherence.


Overview

Recall information associated with 510(k)-cleared devices was downloaded from openFDA, a publicly accessible data platform provided by the US FDA to facilitate access to structured regulatory datasets, including medical device recalls [11,12]. AI/ML–enabled medical devices were identified based on the list published by the FDA [13]. These devices were manually screened by the FDA and do not have a specific code to track them. This list comprises medical devices that incorporate AI and ML, based primarily on the information provided within the summary descriptions of their marketing authorizations. The root cause description associated with each recall event was used to characterize the particular recall. Some recall events may be associated with more than one root cause.

In 21 CFR 7.46 (firm-initiated recall), when the initiating firm decides to remove or correct distributed products, reasons for removal or correction must be provided. The Guidance of Initiation of Voluntary Recalls Under 21 CFR Part 7, Subpart C mentions referencing 21 CFR 820.100(a)(2), which requires procedures for implementing corrections or preventive actions to include an investigation of the causes for nonconformities related to the product, processes, and quality systems. Furthermore, in the FDA regulatory procedures manual, chapter 7 (recall procedures) states that when recalls are classified as Class I or significant Class II recalls, the need for an establishment inspection should be assessed to determine the root cause of the problem and document any potential regulatory actions (7-5-1, 3 Establishment Inspection) [9,14]. Investigation and analysis of the root cause, as outlined in the aforementioned FDA requirements, are a crucial part and one of the important factors for the firm’s decision to remove or correct products. In practice, most companies’ root cause statements align with FDA-recognized categories because they rely on the same industry-accepted terminology. However, the actual analysis and conclusion come primarily from the manufacturer’s quality management system. The FDA’s role is to verify and ensure that the manufacturer’s reasoning makes sense and adequately addresses public health concerns.

Three cohorts were established for comparison and analysis including “All 510(k) devices recall” cohort, “software-related devices recall” cohort and “AI/ML devices recall” cohort (refer to Table 1).

Table 1. Characteristics of three recall cohorts.
Root cause description of recallAll 510(k) devices, 167,864 devicesSoftware-related devices, 3071 devicesAI/MLa devices, 878 devices
RecallRank%RecallRank%RecallRank%
Component change control31622179181b
Component design or selection147273305731151
Device design620721414052151338
Employee error943102108141
Environmental control23926128260
Equipment maintenance59014142240
Error in labeling54315160191
Finished device change control264105360
Incorrect or no expiration date104350
Labeling change control451171472301131
Labeling design7791221031511161
Labeling false and misleading30423118280
Labeling mix-ups5301617340
Manufacturing material removal713907350
Material or component contamination35121115320
Mixed-up of materials or components39420150221281
No marketing application10136017290
Nonconforming material or component4730311775482111
Other669511611153122121
Package design or selection4411815380
Packaging86611217300
Packaging change control25525110330
Packaging process control10319219270
Pending17029040250
PMAc144205370
Process change control60113155201291
Process control4717411507651147
Process design107082171821157
Radiation Control for Health and Safety Act2072701401011141
Release of material or component before receiving test results454004390
Reprocessing controls733804410
Software change control126320126111372
Software design310367310313268142
Software design manufacturing process125330125121
Software design change15731015792462
Software in the use environment9337093161
Software manufacturing or software deployment111340111131
Storage1643004400
Under investigation by firm4011596085637223
Unknown or undetermined by firm41119151211
Use error273241821712101
Vendor change control190280163101171
Total amount43,1001009639100162100

aAI/ML: artificial intelligence/machine learning.

bZero.

cPMA: premarket approval.

Data Processing

Each cohort required a specific analytical approach to accurately extract and rank recall events based on their causes. The following numbered list outlines the processes used to collect, filter, and analyze data to derive insights into the prevalence and nature of recalls within each group (refer to Figure 3):

  1. All 510(k) devices recall: the “All 510(k) Devices Recall” cohort begins by gathering all 510(k) medical device recall events from the openFDA database [11]. We specifically exclude recalls related to AI/ML devices and De Novo devices to focus on traditional 510(k) recalls. Post exclusion, the recalls are ranked by the frequency of each root cause, helping identify the most common issues within this group.
  2. Software-related devices recall: this cohort targets recalls specifically linked to software issues in medical devices. Due to the absence of a direct filter for software-related issues in openFDA, we apply two distinct criteria to accurately capture relevant recalls:
    • Criterion 1: we search within the “device_name” and “openfda_device_name” fields for entries that include the term “Software.” Recalls identified under this criterion were then sequenced by the prevalence of each root cause.
    • Criterion 2: we further refine our search by examining the “root cause description,” “product description,” “reason for recall,” and “code-info” fields for mentions of “Software.” Similar to the first criterion, these recalls were then sequenced by root cause prevalence.
    After identifying and ranking recalls based on both criteria, we merge the results to form a comprehensive sequence of software-related device recalls.
  1. AI/ML devices recall: the “AI/ML Devices Recall” cohort consists of recalls involving AI/ML–enabled devices as identified by the FDA [13]. Like the first cohort, we exclude any recalls pertaining to De Novo and premarket approval devices to solely focus on AI/ML 510(k) recalls. These are then ranked by root cause based on occurrence frequency.
Figure 3. The process for cohort distinction. AI: artificial intelligence; FDA: US Food and Drug Administration; ML: machine learning; PMA: premarket approval.

Ethical Considerations

All analyses were conducted using public data available on the FDA medical device databases and from openFDA. Therefore, ethics approval was unnecessary. This study does not include human subject information, primary data collection, or any form of experimentation involving individuals.


The three cohorts presented in this study are as follows: the “All 510(k) devices recall” cohort includes 167,864 medical devices covering 4473 product codes with a total of 43,100 recalls and 42 root causes. The “software-related devices recall” cohort includes 3071 medical devices, covering 438 product codes with 9639 recalls and 42 root causes. The “AI/ML devices recall” cohort comprises 878 class II medical devices, covering 110 product codes with 162 recalls and 17 root causes (refer to Table 1).

Root Causes of Recalls

Recalls in both “All 510(k) devices recall” and “software-related devices recall” cohorts that are directly related to software have six common root causes: software design, software design change, software design manufacturing process, software change control, software manufacturing/software deployment, and software in the use environment. The number of recalls for each of these root causes is the same, indicating that the filtering criteria did not miss software-related 510(k) medical devices. In the “AI/ML devices recall” recalls related to software, there are only three root causes: software design, software change control, and software design change. When summing up all software-related root causes, “All 510(k) devices recall” has 3715 recalls, accounting for 9%, “software-related devices recall” has 3,715 recalls, accounting for 39%, and “AI/ML devices recall” has 75 recalls, accounting for 46% (refer to Table 2 and Figures 4-6).

Looking at the number of recalls for each root cause, the top five root causes for “All 510(k) devices recall” are other/6695/16% (recall root cause/number of recall/% of all 510(k) devices recall), device design/6207/14%, nonconforming material-component/4730/11%, process control/4717/11%, and under investigation by firm/4011/9%, totaling 26,360 recalls or 61%. These do not include software-related root causes. For “software-related devices recall,” the top 5 root causes are software design/3103/32%, device design/1405/15%, other/1115/12%, nonconforming material-component/775/8%, and under investigation by firm/608/6%, totaling 7006 recalls or 73%. In “AI/ML devices recall,” the top five root causes are software design/68/42%, under investigation by firm/37/23%, device design/13/8%, process design/11/7%, and process control/11/7%, totaling 140 recalls or 86% (refer to (Multimedia Appendix 2) ).

Table 2. Recall events ranked by the number of occurrences for all cohorts.
Root causes description of recallAll 510(k) devices, 167,864 devicesSoftware-related devices, 3071 devicesAI/ML devices, 878 devices
RecallRank%RecallRank%RecallRank%
Other6695a1a16a1115b3b12b2121
Device design6,207c2c14c1405c2c15c13b3b8b
Nonconforming material or component4730b3b11b775d4d8d2111
Process control4717d4d11d5076511d4d7d
Under investigation by firm4011e5e9e608e5e6e37c2c23c
Software design3103673103a1a32a68a1a42a
Component design or selection147273305731151
Process design1070821718211e5e7e
Packaging process control10319219270f
Employee error943102108141
Packaging86611217300
Labeling design7791221031511161
Process change control60113155201291
Equipment maintenance59014142240
Error in labeling54315160191
Labeling mix-ups5301617340
Labeling change control451171472301131
Package design or selection4411815380
Unknown or undetermined by firm41119151211
Mixed-up of materials or components39420150221281
Material or component contamination35121115320
Component change control31622179181
Labeling false and misleading30423118280
Use error273241821712101
Packaging change control25525110330
Environmental control23926128260
Radiation Control for Health and Safety Act2072701401011141
Vendor change control190280163101171
Pending17029040250
Storage1643004400
Software design change15731015792462
Software change control126320126111372
Software design manufacturing process125330125121
Software manufacturing or software deployment111340111131
Incorrect or no expiration date104350
No marketing application10136017290
Software in the use environment9337093161
Reprocessing controls733804410
Manufacturing material removal713907350
Release of material or component before receiving test results454004390
Finished device change control264105360
PMAg144205370
Total amount43,1001009639100162100

aRanked number 1 root cause of recall for its cohort.

bRanked number 2 root cause of recall for its cohort.

cRanked number 3 root cause of recall for its cohort.

dRanked number 4 root cause of recall for its cohort.

eRanked number 5 root cause of recall for its cohort.

fZero.

gPMA: premarket approval.

Figure 4. Number of recalls based on the root causes description for “all 510(k) devices”. PMA: premarket approval.
Figure 5. Number of recalls based on the root causes description for “software-related devices.” PMA: premarket approval.
Figure 6. Number of recalls based on the root causes description for “artificial Intelligence/machine learning devices.”

Specific Root Causes

If the recalls with software-related root causes are added to the top five root causes, the total recalls for “All 510(k) devices recall” reach 30,075, accounting for 70%, “software-related devices recall” reach 7618, accounting for 79%, and “AI/ML devices recall” reach 147, accounting for 91%. Excluding recalls with software-related root causes, the top five root causes for “All 510(k) devices recall” have 26,360 recalls, accounting for 61%, “software-related devices recall” have 3903 recalls, accounting for 40%, and “AI/ML devices recall” have 72 recalls, accounting for 44% (refer to Table 3 and Multimedia Appendix 3 ).

Furthermore, the first root cause for both “software-related devices recall” and “AI/ML devices recall” is software design with extremely high percentages of 32% and 42%, respectively. In “All 510(k) devices recall,” software design still accounts for 6% of total recalls, ranking sixth. After excluding root causes with unclear definitions, such as other and under investigation by firm, and adding the recall quantities for the second-ranking root cause with significance in the three lists, which is device design, the combined percentage of software design and device design increases to 47% for “software-related devices recall” and 50% for “AI/ML devices recall” reaching half of the total recalls. For “All 510(k) devices recall,” it also reaches 22%, surpassing one-fifth of the total recalls (refer to Table 3).

In terms of product categories, recalls of AI/ML–enabled devices and software-related devices are concentrated in Radiology with another significant category being Cardiovascular. This corresponds to the number of product registrations. Recalls of all 510(k) devices are distributed across a greater number of product categories. However, Radiology and Cardiovascular remain among the categories with higher recall numbers. In addition to these, Orthopedic, General Hospital, and Gastroenterology/Urology are categories with relatively higher recall numbers.

Table 3. Statistical comparison of specific root cause recall occurences among different recall cohorts.
Recall with specific root causesAll 510(K) devicesSoftware-related devicesAI/ML devices
Recall%Recall%Recall%
Recall amount of all root causes43,100100%9369100%162100%
Recall amount of all software related root cause37159%371539%7546%
Recall amount of top 5 recall root causes26,36061%700673%14086%
Top 5+ all software related root causes30,07570%761879%14791%
Top 5 - any software related root causes26,36061%390340%7244%
Device design + software design931022%450847%9150%
Top 5 - (other + under investigation by firm)15,65436%528355%10364%
Recall amount of change related root causes21225%4955%117%

Principal Findings

This study investigated the underlying reasons behind recalls linked to AI/ML–enabled medical devices, pinpointing distinct causes compared with other device types. The first control group encompasses all recalls linked to either a 510(k) device. The second control group encompasses recalls tied to software.

Comparing the root causes and their corresponding recall quantities among the “All 510(k) devices recall,” “software-related devices recall,” and “AI/ML devices recall” groups, it’s evident that the combined recall occurrences for the top five root causes accounted for over 60% across all cohorts [15]. The recall total for “software-related devices recall” even surpasses 73%, while “AI/ML devices recall” reaches 86%. Despite the inclusion of unclearly defined root causes such as “other” and “under investigation by firm” in the top 5, removing these 2 root causes still results in proportions of 36%, 55%, and 64% for “All 510(k) devices recall,” “software-related devices recall,” and “AI/ML devices recall,” respectively. Although the definitions of products in the three control groups differ, it is interesting to note that if the top seven recall root causes are considered, the three groups are not far apart. The ranking of recalls varies slightly, and in the case of “AI/ML devices recall,” process design replaces component design/selection [14]. Improving the top seven recall root causes can have a significant positive impact on recalls and is a topic worthy of further investigation (refer to Table 3).

Looking at recalls related to software, it is clear they constitute a high proportion in “software-related devices recall” and “AI/ML devices recall” accounting for 39% and 46%, respectively. The software design root cause alone accounts for 32% and 42%, respectively with a combined percentage of 82% and 91% for all software-related recall root causes in “software-related devices recall” and “AI/ML devices recall.” Software design can be considered the primary recall root cause for both groups. When adding the device design factor to recalls, the total reaches close to half for both “software-related devices recall” and “AI/ML devices recall,” especially in “AI/ML devices recall,” where it surpasses half, reaching 50%. Including recalls related to process design and labeling design further increases the proportion to 58% [14]. These design and development-related factors leading to recalls likely represent inherent risks present during the product planning or design phase. The impact of these factors on “AI/ML devices recall” is notably higher than in the other 2 control groups. Considering experienced design and development engineers, incorporating user feedback, and implementing thorough validation during the development process could effectively reduce the probability of recalls.

Another group of root causes affecting recalls in “AI/ML devices recall” is related to changes, including software design change, software change control, process change control, vendor change control, and labeling change control. Although the quantities are relatively fewer with 4 root causes, they account for over a quarter (29.4%) of the total root causes, causing 7% of recalls. Compared with the 5% for “All 510(k) devices recall” and 5% for “software-related devices recall,” this is relatively higher. Particularly in “AI/ML devices recall,” where changes in corresponding software are relatively frequent, it is crucial for manufacturers to have comprehensive verification plans after any alterations.

The primary reasons for recalls in AI/ML medical devices include software design errors, nonconforming materials, inadequate process controls, and software implementation issues. Among these, software design errors are the most common, potentially leading to device malfunction, incorrect dosing, and compromised patient safety. Unlike traditional medical devices, AI/ML systems involve dynamic algorithms that evolve through data-driven learning, necessitating specialized validation processes beyond conventional static software testing [1,2]. Moreover, many AI/ML medical device recalls result from implicit assumptions made during algorithm design, such as dosage calculations or diagnostic thresholds. These assumptions must be explicitly identified, documented, and integrated into rigorous validation processes [4]. As a result, AI/ML–enabled medical devices present unique validation and quality assurance challenges, requiring continuous monitoring and adaptive regulatory frameworks to ensure their reliability and safety.

Furthermore, as AI/ML devices adapt postmarket based on new data, continuous market surveillance and real-time feedback loops are imperative to identify risks and respond proactively. Regulatory guidance increasingly emphasizes the necessity of robust monitoring systems, postmarket validation, and rapid recall readiness to address emerging problems [5,9]. These elements should be systematically incorporated into development lifecycle management plans for AI/ML–enabled medical devices.

Specifically, our analysis indicates that the most frequent root causes for AI/ML–enabled device recalls are software design errors, inappropriate software changes, inadequate process controls, and issues in software implementation. These factors can lead to critical patient safety impacts, including incorrect dosing, system malfunction, diagnostic errors, or device stoppage, with the potential for severe adverse health outcomes.

Based on the author’s over 20 years of experience in testing, inspection, and certification of medical devices within the testing, inspection and certification industry, including work with authorized third-party organizations such as US FDA Accredited Person and EU (European Union) Notified Body, the author recommends that manufacturers adopt the following strategies to ensure the long-term safety and reliability of AI/ML–enabled medical devices:

  1. Implement specialized validation frameworks, where AI/ML models undergo rigorous adversarial testing, real-world scenario validation, and continual performance monitoring, moving beyond reliance on premarket testing alone,
  2. Establish ongoing market surveillance mechanisms, shifting regulatory focus toward total product lifecycle oversight to ensure AI systems remain effective as they evolve with new clinical data, and
  3. Integrate user-centric risk mitigation practices, such as proactive feedback loops, to enable rapid adjustments and software updates that address emerging risks in clinical settings.

These strategies will better align regulatory oversight with the evolving nature of AI/ML technologies, ensuring both patient safety and long-term technology sustainability.

Limitation

This study has several limitations. First, the FDA currently does not have specific product codes or identification methods for Software as a Medical Device or software-related medical devices. Therefore, the data used in this study were derived from existing FDA data related to software to identify and analyze recall data for software-related medical devices as comprehensively as possible. Second, AI/ML–enabled devices also lack specific product codes or identification methods. Although the FDA manually updates its registry of AI/ML–enabled medical devices, this process is irregular, potentially leading to omissions in the data. These limitations restrict the ability to fully capture and analyze the unique challenges of AI/ML–enabled medical devices, such as specialized algorithm validation, handling of implicit assumptions, and algorithm performance variability. In addition, the publicly available FDA recall databases lack comprehensive information on software-specific faults, manufacturer-specific internal verification processes, and the complexity inherent in AI/ML technologies, including algorithm adaptability over time, real-world performance drifts, and user interaction dynamics.

Future Work

To address these limitations, future research should incorporate direct interactions with device manufacturers, internal company data, qualitative insights from developers, and longitudinal analyses to better explore these challenges and validate effective risk management strategies [10,14,16,17]. Based on the results of this study, we also plan to further investigate each recall of AI/ML–enabled medical devices and the reported reasons behind them to conduct a more in-depth analysis.

We hope to systematically deduce the real factors leading to recalls and compare them with current regulatory requirements to see if there is a need to adjust or strengthen regulatory requirements to reduce the probability of recalls and enhance public safety.

As AI/ML continues to reshape medical practices, a proactive and informed approach to regulatory oversight and device development becomes imperative for ensuring the safety and efficacy of these transformative technologies.

Conclusion

As the FDA authorizes an increasing number of AI/ML–enabled medical devices, it becomes crucial to understand the distinct challenges they pose in terms of recalls. This study delves into the intricate landscape of medical device recalls, focusing specifically on those associated with AI and ML. Comparisons across three control groups, namely “All 510(k) devices,” “Software-related devices,” and “AI/ML devices,” unveil similarities in the top 7 recall root causes with variations in proportions. Notably, AI/ML devices exhibit a higher impact, especially concerning design and development-related factors. This study underscores the importance of robust planning, user feedback incorporation, and thorough validation during the development process to mitigate the inherent risks associated with AI/ML–enabled medical devices. Recalls related to software, particularly software design, constitute a significant proportion in both “software devices” and “AI/ML devices.” Changes in software also emerge as a notable contributor to recalls in AI/ML devices, underscoring the need for comprehensive verification plans after alterations.

In addition to standard regulatory compliance, AI/ML–enabled medical devices face unique validation challenges due to their dynamic nature and reliance on continuous learning algorithms. Such devices require specialized validation strategies that accommodate algorithm adaptability and evolving real-world performance [4]. Continuous market analysis and proactive user feedback integration are also particularly crucial for these devices, as they help detect and address issues promptly, thereby preventing recalls related to software logic errors, data-driven failures, or algorithmic inaccuracies [3,4].

Acknowledgments

The authors extend their gratitude to the BioMe Lab (Medical Device Innovation Department) at National Cheng Kung University of Taiwan for the help in structuring the article and editing the content. This research was supported by the BioMed Commercialization Center and the Ministry of Science and Technology under grant number NSTC 112-2628-E-006-008-MY3, as well as the Medical Device Innovation Center (MDIC), National Cheng Kung University (NCKU), through the Featured Areas Research Center Program within the framework of the Higher Education Sprout Project by the Ministry of Education (MoE) in Taiwan.

Conflicts of Interest

None declared.

Multimedia Appendix 1

Food and Drug Administration artificial intelligence/machine learning–enabled medical devices list.

XLSX File, 74 KB

Multimedia Appendix 2

Number of recalls from the top 5 root case for (a) all 510(k) devices recall, (b) software-related devices recall, and (c) artificial intelligence/machine learning devices recall.

PNG File, 188 KB

Multimedia Appendix 3

Number of recall with specific root causes for (a) all 510(k) devices, (b) software-related devices, and (c) artificial intelligence/machine learning devices.

PNG File, 181 KB

  1. Benjamens S, Dhunnoo P, Meskó B. The state of artificial intelligence-based FDA-approved medical devices and algorithms: an online database. NPJ Digit Med. 2020;3. [CrossRef] [Medline]
  2. Zhu S, Gilbert M, Chetty I, et al. The 2021 landscape of FDA-approved artificial intelligence/machine learning-enabled medical devices: an analysis of the characteristics and intended use. Int J Med Inform. Sep 2022;165:104828. [CrossRef] [Medline]
  3. Wallace DR, Kuhn DR. Failure modes in medical device software: an analysis of 15 years of recall data. Int J Rel Qual Saf Eng. Dec 2001;08(04):351-371. [CrossRef]
  4. Fu Z, Guo C, Zhang Z, et al. Study of software-related causes in the FDA medical device recalls. Presented at: 2017 22nd International Conference on Engineering of Complex Computer Systems (ICECCS); Nov 5-8, 2017; Fukuoka, Japan. [CrossRef]
  5. Anderson HR. FDA Finalizes Voluntary Recall Guidance Imploring Companies to Be 'Recall Ready'. Faegre Drinker Biddle & Reath LLP. 2022. URL: https:/​/www.​faegredrinker.com/​en/​insights/​publications/​2022/​3/​fda-finalizes-voluntary-recall-guidance-imploring-companies-to-be-recall-ready [Accessed 2024-02-29]
  6. Dubin JR, Simon SD, Norrell K, et al. Risk of recall among medical devices undergoing US Food and Drug Administration 510(k) clearance and premarket approval, 2008-2017. JAMA Netw Open. May 3, 2021;4(5):e217274. [CrossRef]
  7. Kadakia KT, Dhruva SS, Caraballo C, et al. Use of recalled devices in new device authorizations under the US Food and Drug Administration’s 510(k) pathway and risk of subsequent recalls. JAMA. Jan 10, 2023;329(2):136-143. [CrossRef] [Medline]
  8. Zipp R. Anatomy of a medical device recall: how defective products can slip through an outdated system. MedTech Dive. 2021. URL: https://www.medtechdive.com/news/medical-device-recall-process-fda-philips-medtronic/608205/ [Accessed 2024-09-02]
  9. Subhan A. Medical Device Recalls. J Clin Eng. 2022;47(2):61-62. [CrossRef]
  10. Pace T. Analysis of FDA medical device recalls time between recall initation date and termination date. Faculty of California State University; 2020. URL: https://scholarworks.calstate.edu/downloads/gm80j116d [Accessed 2025-06-25]
  11. Open-source APIs. openFDA. URL: https://open.fda.gov/ [Accessed 2024-01-22]
  12. Muehlematter UJ, Bluethgen C, Vokinger KN. FDA-cleared artificial intelligence and machine learning-based medical devices and their 510(k) predicate networks. Lancet Digit Health. Sep 2023;5(9):e618-e626. URL: www.thelancet.com/ [CrossRef] [Medline]
  13. Artificial intelligence and machine learning (AI/ML)-enabled medical devices. FDA. URL: https:/​/www.​fda.gov/​medical-devices/​software-medical-device-samd/​artificial-intelligence-and-machine-learning-aiml-enabled-medical-devices [Accessed 2024-08-02]
  14. Thompson BM. Unpacking averages: common root causes driving medical device recalls. Epstein Becker & Green. 2022. URL: https:/​/www.​healthlawadvisor.com/​unpacking-averages-common-root-causes-driving-medical-device-recalls [Accessed 2024-03-03]
  15. May C. Medical device recalls. Online GMP Training. URL: https://www.onlinegmptraining.com/medical-device-recalls/ [Accessed 2024-03-07]
  16. Zuckerman DM, Brown P, Nissen SE. Medical device recalls and the FDA approval process. Arch Intern Med. Jun 13, 2011;171(11):1006-1011. [CrossRef] [Medline]
  17. Stern AD, Gordon WJ, Landman AB, et al. Cybersecurity features of digital medical devices: an analysis of FDA product summaries. BMJ Open. Jun 28, 2019;9(6):e025374. [CrossRef] [Medline]


AI: artificial intelligence
CFR: Code of Federal Regulations
EU: European Union
FDA: Food and Drug Administration
ML: machine learning


Edited by Christian Lovis; submitted 15.10.24; peer-reviewed by Danni Li, Dignesh Khunt; final revised version received 25.03.25; accepted 27.03.25; published 11.07.25.

Copyright

© Wei-Pin Chen, Wei-Guang Teng, C Benson Kuo, Yu-Jui Yen, Jian-Yu Lian, Matthew Sing, Peng-Ting Chen. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 11.7.2025.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included.