Original Paper
Abstract
Background: Data standards are not only key to making data processing efficient but also fundamental to ensuring data interoperability. When clinical trial data are structured according to international standards, they become significantly easier to analyze, reducing the efforts required for data cleaning, preprocessing, and secondary use. A common language and a shared set of expectations facilitate interoperability between systems and devices.
Objective: The main objectives of this study were to identify commonalities and differences in clinical trial metadata, protocols, and data collection systems/items within the VACCELERATE project.
Methods: To assess the degree of interoperability achieved in the project and suggest methodological improvements, interoperable points were identified based on the core outcome areas—immunogenicity, safety, and efficacy (clinical/physiological). These points were emphasized in the development of the master protocol template and were manually compared in the following ways: (1) summaries, objectives, and end points in the protocols of 3 VACCELERATE clinical trials (EU-COVAT-1_AGED, EU-COVAT-2_BOOSTAVAC, and EU-COVPT-1_CoVacc) against the master protocol template; (2) metadata of all 3 clinical trials; and (3) evaluations from a questionnaire survey regarding differences in data management systems and structures that enabled data exchange within the VACCELERATE network.
Results: The noncommonalities identified in the protocols and metadata were attributed to differences in populations, variations in protocol design, and vaccination patterns. The detailed metadata released for all 3 vaccine trials were clearly structured using internal standards, terminology, and the general approach of Clinical Data Acquisition Standards Harmonisation (CDASH) for data collection (eg, on electronic case report forms). VACCELERATE benefited significantly from the selection of the Clinical Trials Centre Cologne as the sole data management provider. With system database development coordinated by a single individual and no need for coordination among different trial units, a high degree of uniformity was achieved automatically. The harmonized transfer of data to all sites, using well-established methods, enabled quick exchanges and provided a relatively secure means of data transfer.
Conclusions: This study demonstrated that using master protocols can significantly enhance trial operational efficiency and data interoperability, provided that similar infrastructure and data management procedures are adopted across multiple trials. To further improve data interoperability and facilitate interpretation and analysis, shared data should be structured, described, formatted, and stored using widely recognized data and metadata standards.
Trial Registration: EudraCT 2021-004526-29; https://www.clinicaltrialsregister.eu/ctr-search/trial/2021-004526-29/DE/; 2021-004889-35; https://www.clinicaltrialsregister.eu/ctr-search/search?query=eudract_number:2021-004889-35; and 2021-004526-29; https://www.clinicaltrialsregister.eu/ctr-search/search?query=eudract_number:2021-004526-29
doi:10.2196/65590
Keywords
Introduction
Interoperability is defined as the ability of different information systems, devices, or applications to connect in a coordinated manner within and across organizational boundaries, enabling stakeholders to access, exchange, and cooperatively use data with the goal of optimizing the health of individuals and populations [
].Interoperability standards provide a common language and set of expectations that enable interoperability between systems and devices. These standards allow clinicians, laboratories, hospitals, pharmacies, and patients to seamlessly share data, regardless of the application or market supplier, thereby improving the coordination and delivery of health care [
].The health interoperability ecosystem comprises individuals, systems, and processes that aim to share, exchange, and access all forms of health information, including discrete, narrative, and multimedia data. Potential stakeholders in this ecosystem include individuals, patients, providers, hospitals/health systems, researchers, payers, suppliers, and systems, all of whom contribute to the creation, exchange, and use of health information and data. When data are structured according to international standards, they become much easier to analyze, reducing the efforts required for data cleaning and preprocessing. Data standards are not only essential for efficient data processing but are also fundamental to ensuring the interoperability of the data itself [
]. The greater use of data standards is crucial to the success of data sharing. Without such standards, shared data are difficult to interpret with certainty, significantly more time-consuming to process, and therefore more expensive to collect. Standards can be applied to data item definitions and codes, controlled vocabularies for categories, and the structuring and exchange of data. File formats used for storing and transferring data should also be standardized to simplify data processing [ ]. Clinical trial datasets should always include metadata describing the characteristics of each data item (eg, type, code, name, and possibly an ontology reference), along with details of the trial’s schedule and design. Datasets should be made available for sharing in 1 or more standardized file formats that are compatible with a wide variety of systems [ ].In January 2021, the European Commission launched VACCELERATE—the European Corona Vaccine Trial Accelerator Platform [
] (GA 101037867)—to coordinate phases II and III of COVID-19 vaccine trials across Europe. VACCELERATE aimed to serve as a single-entry point for multicountry vaccine trials in Europe and to address the need for greater inclusion of underrepresented populations in vaccine trials. One of the fundamental objectives of VACCELERATE was to promote the sharing, interoperability, and FAIRness (Findable, Accessible, Interoperable, and Reusable) of data generated through the VACCELERATE network. This included ensuring that data sharing and future reuse were considered by investigators, the scientific community, and study sponsors. The objective of this study was to assess the degree of commonalities and differences among clinical trial metadata, protocols, and data collection systems/items in VACCELERATE consortium projects [ - ]. Additionally, the study aimed to explore the reasons why interoperability was sometimes limited or failed to occur. It also sought to identify areas where interoperability was relatively straightforward and to propose structures and systems that could enhance interoperability in the future, including for data generated outside the trials funded under the VACCELERATE platform [ - ].EU-COVAT-1_AGED (EudraCT: 2021-004526-29) is a multinational, phase II, randomized, adaptive protocol designed to evaluate the immunogenicity and reactogenicity of different COVID-19 vaccines in adults aged ≥75 years who have already been vaccinated against SARS-CoV-2. It is the first of 3 VACCELERATE phase II COVID-19 vaccine trials and focuses on the immunogenicity and reactogenicity of vaccines in older adults (≥75 years), addressing a critical demographic that is often underrepresented in vaccine studies [
].EU-COVAT-2_BOOSTAVAC (EudraCT: 2021-004889-35) is an international, multicenter, phase II, randomized, adaptive protocol designed to evaluate the need for, optimal timing of, and immunogenicity of administering a booster messenger RNA (mRNA) vaccination dose against SARS-CoV-2 in the general population (aged 18+ years) already vaccinated against SARS-CoV-2. This trial investigates booster vaccination strategies in the general adult population, providing insights into optimal timing and immunological outcomes for sustaining immunity [
].EU-COVPT-1_COVACC (EudraCT: 2021-004526-29) is a phase II, comparative, randomized trial designed to evaluate the impact of a reduced COVID-19 mRNA vaccination regimen on immunological responses and reactogenicity in pediatric patients with prior SARS-CoV-2 immunity. This trial investigates reduced-dose regimens in pediatric populations, contributing to the understanding of tailored vaccination approaches for children [
].Together, these trials represent a comprehensive investigation across key demographic groups—older adults, the general population, and children—using adaptive and multinational protocols that address the challenges of rapid vaccine development and deployment during pandemics.
Methods
Overview
The degree of commonality and interoperability among data was assessed across 3 levels: clinical trial protocols, metadata, and data systems and management.
Interoperability Among the VACCELERATE Clinical Trial Protocols and Master Protocol Template
The process began with generating a list of the main points and parameters of general interest (eg, type of trial, randomization, primary and secondary aims, and safety) across the 3 specific clinical trial protocols: EU-COVAT-1_AGED, EU-COVAT-2_BOOSTAVAC, and EU-COVPT-1_COVACC [
, ]. The second step involved comparing the 3 trial protocols to identify interoperable data points based on this list of key parameters. In step 3, the outcomes derived from the adult trial protocols were compared against the VACCELERATE adult master protocol ( ). This comparison focused on 4 core areas of the master protocol template’s core outcome set (COS): (1) immunogenicity, (2) safety, (3) efficacy/clinical/physiological, and (4) other outcomes of interest. Differences and similarities among the protocols were identified manually.
Metadata Interoperability of the VACCELERATE Clinical Trials
All data management (DM) processes, including the design of study databases for all 3 VACCELERATE clinical trials, were carried out by the Clinical Trials Centre Cologne (CTCC). The DM team at the CTCC developed the Study Database Specification (DB Spec) using an Excel (Microsoft Corporation) workbook. This workbook included metadata for defining the study database, case report forms (CRFs), edit checks, and a data dictionary. For each CRF study, the workbook contained an individual worksheet specifying database details such as field names, field response types, and formats. Additionally, the detailed metadata for all 3 vaccine trials were systematically developed using internal standards, terminology, and the general approach of Clinical Data Acquisition Standards Harmonization (CDASH) [
] for data collection, such as on electronic CRFs (eCRFs; ). For instance, questions were categorized into relevant Clinical Data Interchange Standards Consortium (CDISC) [ ] domains, along with prefixes for adverse events (AEs), concomitant medications, vital signs, protocol deviation, etc, while standard CDISC suffixes were applied to indicate question types.Caption | Variable | Length/format | Type | Explanation/code list |
Sample collected? | LBPERF | N/Ab | Category | YES=yes, NO=no |
Reason for sample not collected | LBREASND | 200 | Character | N/A |
Date of collection | LBDTC | N/A | Date | N/A |
Name of Lab | LBNAME | N/A | Combo box | N/A |
Test name | LBTEST | 30 | Label | Laboratory test names: blood urea nitrogen, creatinine, aspartate transaminase, alanine transaminase, total bilirubin, and alkaline phosphatase |
Result | LBORRES | 99999.99 | Float | Laboratory test result |
Unit | LBORRESU | N/A | Combo box | Laboratory test unit |
Reference range lower limit | LBORNRLO | 10 | Character | Laboratory test min |
Reference range upper limit | LBORNRHI | 10 | Character | Laboratory test max |
Export unit | LBEXUNIT | 10 | Character | Laboratory test export unit |
Clinically significant | LBCLSIG | N/A | Category | YES=yes, NO=no |
aCDASH: Clinical Data Acquisition Standards Harmonization.
bN/A: not applicable.
Although the databases were generated by a single center and coordinated by the same individual, it was valuable to evaluate the degree of standardization, commonality, and data interoperability among the DB Spec of all 3 trials. To achieve this, a manual comparison of the Excel DB Spec from all 3 VACCELERATE clinical trials was conducted (
). This comparison focused on the main core areas—immunogenicity, safety, efficacy/clinical/physiological—emphasizing the development of the master protocol template.The other variables were not considered due to differences in demographic details and treatment approaches among the 3 groups (older adults, adult, and pediatric). Metadata items such as data format, data dictionary name, response type, item text, required field indicator, and review groups were compared for consistency and discrepancies. Once identified, the next step was to analyze the rationale behind the observed consistencies and discrepancies among the metadata items.
Interoperability in Data Management and Systems
Interoperability in DM and systems was assessed through a questionnaire (
) distributed to the central laboratories: the University College Dublin (UCD) Centre for Experimental Pathogen Host Research (CEPHR, Dublin), Instituto de Salud Carlos III (ISCIII, Madrid), and the central biobank (UZA/UAntwerpen Biobank, Antwerp University Hospital). The questionnaire aimed to gather their experiences working with the VACCELERATE DM systems ( ). UZA/UAntwerpen provided an overview of samples received from local sites (shipment manifest) and transferred to the research laboratories (transport manifest) and the clinical trial center. CEPHR and ISCIII, as research laboratories, performed the specified analyses on samples received from the central biobank. After completing the analyses, the research laboratories sent the results to the CTCC.The questionnaire was designed to maximize accessibility and ease of use for site staff by categorizing the questions into broad areas while allowing flexibility through free-text response options: (1) laboratory procedures and documentation, (2) sample handling and storage data, (3) data transfer, and (4) general laboratory DM. Based on the information gathered through the questionnaire, a descriptive analysis was conducted to examine how each site handled, stored, and transferred data, as well as how their processes evolved within the VACCELERATE network. Furthermore, the analysis evaluated potential factors and new concepts required to harmonize and standardize the VACCELERATE DM systems.
The DM documents—external DM plans, edit check plans, CRF completion requirements, and DB Specs—developed for the clinical trials by the CTCC were reviewed. Guided by the questionnaire, a semistructured interview was conducted with the CTCC data manager.
Ethical Considerations
The 3 VACCELERATE COVID-19 vaccine trials (EU-COVAT-1_AGED, EU-COVAT-2_BOOSTAVAC, and EU-COVPT-1_COVACC) received favorable ethics opinions and approval from the national competent authorities in the participating countries (State Medicines Control Agency, Norwegian Medicines Agency, Spanish Agency of Medicines and Health Products, Health Products Regulatory Authority Ireland, and Paul-Ehrlich-Institut for the EU-COVAT-1 AGED trial; Health Products Regulatory Authority Ireland, Norwegian Medicines Agency, Swedish Medical Products Agency, The Spanish Agency of Medicines and Medical Devices, Federal Agency for Medicines and Health Products, and Paul-Ehrlich-Institut for the EU-COVAT-2 BOOSTAVAC trial; and Netherlands – Competent Authority, Norwegian Medicines Agency, Swedish Medical Products Agency, Hellenic National Organization for Medicines, and Paul-Ehrlich-Institut for the EU-COVPT-1 COVACC trial). Written informed consent was obtained from all participants before any trial procedures. These trials were conducted in accordance with the International Conference on Harmonization for Good Clinical Practice (ICH-GCP) guidelines and complied with the Declaration of Helsinki at all times. Furthermore, the provisions of the EU General Data Protection Regulation (GDPR) 2016/679 were strictly observed to ensure data protection. All investigational materials and data were pseudonymized in compliance with data protection regulations before undergoing scientific processing.
Results
Interoperability Among the Three Clinical Trial Protocols and Master Protocol
Our harmonization and standardization efforts began with a comparison of the 3 clinical trial protocols against the COS of the master protocol. The COS includes all appropriate measurements necessary to ensure standardization, efficiency, and uniformity in the conduct of clinical trials. Four main COS areas were adopted for phase II and III studies: (1) immunogenicity, (2) safety, (3) efficacy/clinical/physiological, and (4) other outcomes of interest.
The harmonization process resulted in the creation of
- , which illustrate the interoperability between the master protocol and the 3 clinical trial protocols based on the COS areas. A summary of the outcomes is presented in (see Table S1 in ). Humoral and cellular responses to COVID-19 vaccines were assessed in a similar manner across all trials, considering both vaccine and host characteristics ( ), although minor adjustments to time points were incorporated into the pediatric protocol.
Objective | Outcome | Method of measurement |
To evaluate humoral response to vaccination against wild-type SARS-CoV-2 and variants of interest—(1) short term: 7, 14, and 28 days after randomization and (2) long term: 3, 6, 9, and 12 months after randomization |
|
|
To evaluate the cellular immune response to vaccination against wild-type SARS-CoV-2 and variants of interest—(1) short term: 7, 14, and 28 days after randomization); and (2) long term: 3, 6, 9, and 12 months after randomization) |
|
|
aIg: immunoglobulin.
bTh: helper T cell.
cCD: cluster of differentiation.
The early detection of serious AEs (SAEs) can be vital for the protection of the participants and via the interoperability of safety data, an overall better general view of the patient’s clinical data could be achieved, resulting in the reduction of any potential risk of harms or adverse impacts and higher clinical safety. The developing safety profile in the trials is in a sufficiently similar motive and includes all the required information about reporting solicited, unsolicited, serious, and medically attended AEs, as shown in
. By contrast, the definitions of the COVID-19 disease (eg, asymptomatic infection, confirmed and reinfection cases, the severity of signs and symptoms) are not reported in the protocols, even though it is referred to as part of the trial dataset outcomes ( ). Finally, the profile of immunophenotyping of immune responses (at the cellular level) and genotypic testing of breakthrough infections (vaccine immunogenicity) were also determined in the protocols, enhancing interoperability among them ( ). It should be noted that we did not present herein the comparison of pediatric protocol versus pediatric master protocol because the former is currently under review process.Outcome | Method of measurement |
Solicited local and systemic AEsa for 7 days after each vaccination. | AEs are collected in a diary and will be entered into the electronic case report form (see all potential adverse reactions in Multimedia Appendix 2). |
Unsolicited AEs after each vaccination | AEs are collected during visits upon an open question by the investigator up to the end of the trial. |
SAEsb | SAEs collected throughout the study (from first vaccination until the end of the study) |
Medically attended adverse events | Medically attended adverse events from the day of vaccination until the end of the study. |
aAE: adverse event.
bSAE: serious adverse event.
Several parameters were considered to enable and enhance interoperability across the developed COVID-19 vaccine trials and master protocols (
and ). It is well-known that the lack of standardization in data processing exacerbates inhomogeneity in raw data. This often results in errors and significant time delays, reducing the reusability and interoperability of data within protocols. To address this, the various protocol formats were homogenized by manually identifying discrepancies and commonalities among selected parameters (see Tables S2-S13 in ). The selection of parameters was based on their relevance to the trials. Parameters present in only 1 protocol were excluded, while efforts were made to promote the consistent use of the same parameters across adult trials.Concealment (blinding) of treatment group allocation is not anticipated for the current trials, while both adult trials support vaccine booster strategies. All clinical trials assess reactogenicity symptoms and the immunogenicity (efficacy) profile of COVID-19 vaccines (following a fourth dose) to inform potential optimized vaccine strategies. The secondary objectives include evaluating immune response for up to 12 months (see Tables S2-S4 in
).The safety aims and the increase in antibody titer were found to be similar across all trials, as were the primary end points in the adult population protocols, due to the nature of the BOOSTAVAC trial (see Table S5 in
). By contrast, stratification/subpopulation, inclusion, and exclusion criteria among volunteers are directly influenced by the participants’ age and other trial-specific factors (eg, vaccine scheme, gender, immune status; see Table S6 in ). Additionally, patients’ insurance and consent forms are addressed in all protocols, and the sample size calculation was reported based on the specifics of each protocol (see Table S7 in ).Set of data and definition | Explanation | ||
Confirmed COVID-19 disease | |||
COVID-19 disease | Molecularly confirmed COVID-19 is defined as a positive SARS-CoV-2 viral RNA result using a PCRa-based or other molecular diagnostic test or positive antigen test. (See Multimedia Appendix 2 for the definition of confirmed COVID-19 disease.) | ||
Asymptomatic SARS-CoV-2 infection | |||
Asymptomatic SARS-CoV-2 infectionb | The same as above but without symptoms typical for/suspicious of COVID-19 | ||
Severity of COVID-19 disease | |||
Asymptomatic SARS-CoV-2 infection or COVID-19 disease (presence and severity of signs and symptoms of COVID-19 disease) | Signs and symptoms as measured by symptoms of infection with COVID-19c at first evaluation and at maximum severity: mild, moderate, and severe COVID-19 | ||
Severe COVID-19 disease | |||
COVID-19 disease fulfilling criteria for intensive care unit admittance | Clinical records | ||
All-cause mortality | |||
All-cause mortality: (1) at day 30 from randomization and (2) time from randomization | Clinical records | ||
Disease (or infection) duration | |||
Time to 2019-nCoV RT-PCRd negativity (days) | Negativity: 2 consecutive negative results (sampling interval of at least 24 hours) of the 2019-nCoV nucleic acids tests of respiratory pathogens | ||
Length (days) of hospital stay | Discharge standards: (1) normal body temperature for more than 3 days; (2) significant recovery from respiratory symptoms; (3) lung imaging showing obvious absorption and recovery of the acute exudative lesion; and (4) negativity of nucleic acids tests performed twice. | ||
Length of intensive care unit stay | N/Ae | ||
Reinfection | |||
Confirmed COVID-19 reinfection | Molecularly confirmed COVID-19 is defined as a positive SARS-CoV-2 viral RNA result using a PCR-based or another molecular diagnostic test or positive antigen test. (See Multimedia Appendix 2 for the definition of confirmed COVID-19 disease.) | ||
Time to reinfection | |||
Long COVID syndrome | WHO definitionb [ | , ]||
Multisystem inflammatory syndrome in children | MIS-Cf WHOg definition |
aPCR: polymerase chain reaction.
bSee [
].cPer case definition for moderate to severe/critical COVID-19.
dRT-PCR: reverse transcription polymerase chain reaction.
eN/A: not applicable.
fMIS-C: multisystem inflammatory syndrome in children associated with COVID-19.
gWHO: World Health Organization.
Set of data | Outcome and methods |
Immunophenotyping of immune responses to vaccination | Detailed immunophenotyping of immune responses, including metabolomics, proteomics, and transcriptomics |
Genotyping of breakthrough infections | Whole genome sequencing of isolated SARS-CoV-2 viral strains detected in breakthrough infections |
aPer the CoVacc trial protocol, Participants will be informed that leftover blood samples, remaining after all study-related testing is completed, may be stored indefinitely for potential future research purposes (eg, exploratory immunology). This may include genotypic testing of genetic polymorphisms relevant to vaccine immunogenicity. Participants will have the option to decide whether they permit the future use of any leftover samples. With the participants’ informed consent, any remaining cells and plasma will be frozen indefinitely for future analyses related to COVID-19 and other coronavirus-related diseases or vaccine-related responses. If a participant chooses not to allow this, all leftover samples belonging to that participant will be discarded at the end of the trial.

We also conducted a comparison of the 2 adult trials with the master protocol (see Tables S8-S13 in
). The primary aim of the adult trials is to evaluate the immunogenicity and reactogenicity (safety) of vaccine booster strategies and homologous mRNA vaccination for COVID-19 at different time intervals (see Tables S8 and S9 in ).Both the adult and master protocols include the relevant subprotocols, cohorts, and descriptions of the randomization process, while blinding is not anticipated in the adult trials. All protocols support phase II trials and share similar primary objectives. The primary and secondary objectives/end points (including exploratory ones) constitute a nearly identical core dataset. The secondary trial objectives for immunogenicity and reactogenicity focus on the immune response and associated signs and symptoms over up to 12 months (see Tables S9 and S10 in
).The reporting periods for unsolicited and solicited AEs, as well as the rates of SAEs, are identical across the adult and master protocols. However, discrepancies in inclusion and exclusion criteria are directly influenced by the trial participants (see Tables S11 and S12 in
). The treatment duration and follow-up period for the studies are identical in both the adult trials and the master protocol, extending up to 12 months. All clinical trials are conducted in accordance with the ICH-GCP, including the use of appropriate consent forms and patient insurance, based on the allocated treatment group (see Table S13 in ).
Metadata Interoperability
The points of consistency and discrepancies across the main core areas (eg, immunogenicity, safety, efficacy, clinical, and physiological aspects) among the metadata items of the 3 VACCELERATE clinical trials are presented in
- and . The noncommonalities identified within the CRFs were attributed to differences in the 3 populations (aged 75+, adult, and pediatric), variations in protocol design, and differing vaccination patterns ( and ). Additionally, differences in the coding and labeling of variables, as well as in the item groups, were identified. These discrepancies required clarification to enhance interoperability.Case report form | EU-COVAT-1_AGED | EU-COVAT-2_BOOSTAVAC | EU-COVPT-1_CoVacc | Commonality and noncommonality |
Saliva sampling (fsaliva) | Not detected | Collection status with time point and aliquot label before vaccination, on immune response evaluation, and on follow-up | Collection status with time point and aliquot label before vaccination and on follow-up |
|
Serum sampling (fserum) | Collection status with time point and aliquot label before vaccination, on immune response evaluation, and on follow-up | Collection status with time point and aliquot label before vaccination, on immune response evaluation, and on follow-up | Not detected |
|
LiHepa WBb sampling (flihep) | Collection status with time point and aliquot label before vaccination and on immune response evaluation | Collection status with time point and aliquot label before vaccination, on immune response evaluation, and on follow-up | Collection status with time point and aliquot label before vaccination and on follow-up |
|
EDTAc and PBMCsd sampling (fedta) | EDTA collection status with time point and aliquot label before vaccination, on immune response evaluation, and on follow-up. PBMC only before vaccination | EDTA and PBMC collection status with time point and aliquot label before vaccination, on immune response evaluation, and on follow-up | EDTA collection status with time point and aliquot label before vaccination and on follow-up. PBMC for selected sites only on follow-up |
|
aLiHep: lithium heparin.
bWB: whole blood.
cEDTA: ethylenediaminetetraacetic acid.
dPBMC: peripheral blood mononuclear cell.
Case report form | EU-COVAT-1_AGED | EU-COVAT-2_BOOSTAVAC | EU-COVPT-1_CoVacc | Commonality and noncommonality |
Adverse event (faesae) | Documentation forms available throughout the enclosure complemented by an optional unscheduled visit | Documentation forms available throughout the enclosure complemented by an optional unscheduled visit | Documentation forms available throughout the enclosure complemented by an optional unscheduled visit |
|
MHb (fmedhist) | MH documented before vaccination | MH documented before vaccination | MH documented before vaccination |
|
Concomitant medication (fcomed) | Documentation forms available throughout the enclosure | Documentation forms available throughout the enclosure | Documentation forms available throughout the enclosure |
|
PDd (fpd) | PD documented on completion | PD documented on completion | PD documented on completion |
|
Inclusion/exclusion (finex) | Inclusion/exclusion documented before enclosure | Inclusion/exclusion documented before enclosure | Inclusion/exclusion documented before enclosure |
|
Demography (fdemo) | Demography documented before vaccination | Demography documented before vaccination | Demography documented before vaccination |
|
Urine pregnancy test (fpregt) | Not detected | Collection status documented before vaccination and on an unscheduled visit | Not detected |
|
aMedDRA: Medical Dictionary for Regulatory Activities.
bMH: medical history.
cATC: Anatomical Therapeutic Chemical.
dPD: protocol deviation.
eN/A: not applicable.
Case report form | EU-COVAT-1_AGED | EU-COVAT-2_BOOSTAVAC | EU-COVPT-1_CoVacc | Commonality and noncommonality |
Patient diary (fdiary) | Diary collected on immune response evaluation | Diary collected on immune response evaluation | Diary administration on the second vaccination and first follow-up |
|
Vaccination (fvacc) | Vaccination documented on baseline visit | Vaccination documented on time points based on the trial arm | Vaccination documented on time points based on the trial arm |
|
Nasopharyngeal swab collection (fswab) | Not detected | Not detected | Collection status documented before vaccination |
|
PEb (fexam) | PE documented on screening, baseline, and immune response evaluation (in case of severe adverse event) | PE documented on screening, baseline, follow-up, and immune response evaluation | PE documented on screening, baseline, and follow-up |
|
VSc (fvitals) | VS documented on screening, baseline, immune response evaluation, and in case of unscheduled visit | VS documented on baseline, follow-up, and in case of unscheduled visit | VS documented on screening and on any pre- and postvaccination visits |
|
Randomization (frando) | Randomization documented on enrollment | Randomization documented on enrollment | Randomization documented on enrollment |
|
aN/A: not applicable.
bPE: physical examination.
cVS: vital sign

Interoperability in Data Management and Systems
VACCELERATE received a significant boost through the selection of the CTCC as the sole DM provider, with system database development coordinated by a single individual. This eliminated the need for coordination across multiple trial units. For the CTCC, the standardized format adopted within the VACCELERATE project utilized a data system featuring an installation of the Anju [
] (formerly OmniComm) TrialMaster clinical database management system, accompanied by metadata.Data transfer using tried-and-tested methods common to all sites (spreadsheet format) enabled a quick and relatively secure exchange, which proved highly effective. This process was further facilitated by the implementation of the TrialMaster system, which generated templates that could be adapted for use with spreadsheets. The responses received from the laboratories and CTCC are shown in
and , while the questionnaire results are presented in Table S14 in .
Discussion
Principal Findings
To the best of our knowledge, this is the first report to assess the degree of data commonality and interoperability between vaccine clinical trial protocols, the master protocol template, metadata, and data systems and management. In this study, we considered multiple parameters for the first time to enhance and improve interoperability across the developed COVID-19 vaccine clinical trial and master protocols, based on the selected COS areas: (1) immunogenicity, (2) safety, (3) efficacy/clinical/physiological outcomes, and (4) other outcomes of interest. This is significant because the lack of standardization in raw data processing is a well-known issue that exacerbates data inhomogeneity. This often results in errors and significant time delays, reducing the reusability and interoperability of data within protocols. To address this, the various protocol formats were homogenized by manually identifying discrepancies and commonalities among the selected parameters.
The database systems prepared by the CTCC already demonstrate substantial use of CDASH, a key enabler for the implementation of other CDISC standards. CDASH plays a crucial role in supporting the Study Data Tabulation Model (SDTM) [
]. Transforming study data to SDTM could also prove beneficial in the context of data sharing, especially as data interoperability becomes increasingly important. The adoption of CDASH standards [ ] within the VACCELERATE DM systems establishes an environment conducive to generating FAIR data throughout the life cycle of the 3 trials. This approach ensures that the data will be available for collaboration and future use.Pooling the metadata from the VACCELERATE studies would create a valuable metadata resource, independent of the underlying data. This resource could potentially be made accessible (eg, via the VACCELERATE official website and the ECRIN data-sharing repository for COVID-19 trials, EOSC-Life) [
], offering a unique opportunity for other sponsors to design related studies, particularly those utilizing the VACCELERATE infrastructure. The VACCELERATE metadata, as a shared resource, could play a pivotal role in standardizing data items and the controlled terminologies used within categorized questions, thus contributing to greater harmonization and interoperability in future research.Education and training are critical for the successful implementation of FAIR DM in clinical data. Therefore, additional CDISC training for DM staff could be beneficial, both within VACCELERATE and more broadly, to support the adoption of data standards. Metadata harmonization should be completed before database development.
Although the DM staff at the CTCC are highly experienced and have already implemented international standards, there is a lack of planning for long-term data storage and reuse. This occurred because it was not anticipated during the database setup phase and due to the lack of a long-term data sharing budget, either linked to VACCELERATE itself or allocated to a data-sharing repository. Our report suggests that metadata harmonization planning should begin before database development and be supported by requirements (eg, CDASH) and oversight throughout the entire database setup phase, as well as any subsequent amendments. All subcontractors should be informed and included in this planning process. Furthermore, generating metadata and SDTM data for secondary use can be done under less time pressure compared with other aspects of standards implementation, such as the initial development of eCRFs. Conversely, while the final application of both is likely to occur at the end of the study, this does not mean the work must be confined to that time frame. Transformations can be planned and tested at any stage, using dummy data if necessary, as can the generation of metadata.
VACCELERATE highlights some of the challenges in making data standards a practical reality. At the same time, it provides opportunities to address these challenges, developing methods that can help embed data standards within noncommercial clinical research.
VACCELERATE received a substantial boost from the selection of the CTCC as the sole DM provider, with system database development coordinated by a single individual. This approach eliminated the need for coordination among different trial units, resulting in a high degree of uniformity within the system. The DM systems used in VACCELERATE are relatively new and have the potential to perform well with the limited number of sites currently using them. To effectively test these DM systems, it is necessary to have multiple sites utilizing them, with additional trials planned for later stages of the rollout. While the central Electronic Data Capture (EDC) system (TrialMaster) was used to manage sample processing data, there were concerns from the laboratory sites and the central biobank regarding the time required to enter sample data, resolve errors, and reconcile site datasets with the EDC. Transferring data using tried-and-tested methods common to all sites (spreadsheet format) enabled a harmonized, rapid, and relatively secure exchange. Before a scanning system was adopted, allowing sites to upload processing data directly into the EDC could significantly reduce the time constraints associated with manual data entry.
As only 3 clinical studies were funded under the VACCELERATE project, these results were not applied to any newly registered studies. However, as the objective of the VACCELERATE project has now been extended to other vaccine-preventable infectious diseases with pandemic potential, there will be a broader opportunity in the near future to implement the results presented here. Nevertheless, these trials provide valuable insights and a scalable framework for future vaccine studies, especially in the context of epidemics and pandemic preparedness. The methodology and protocols of these trials address critical questions regarding vaccine immunogenicity, reactogenicity, and optimal dosing strategies across diverse populations, including older adults (≥75 years), the general adult population, and pediatric patients. By using adaptive, multinational protocols, these trials simulate the real-world conditions of pandemic response, where speed, adaptability, and inclusivity are critical.
The methodologies from these trials are highly generalizable and provide a robust blueprint for future vaccine clinical studies within research networks. The inclusion of underrepresented populations, such as older adults and children, ensures that future vaccine trials will generate comprehensive, inclusive data applicable to all demographics. Additionally, the adaptive design of the protocols used in these studies allows for future modifications in response to emerging data and infectious diseases—a critical feature for addressing the dynamic nature of pandemics and evolving infectious threats.
Beyond the outcomes of individual trials, these studies contribute to the establishment and strengthening of international vaccine research networks. The collaborative, multicentric nature of the VACCELERATE network demonstrates the feasibility of harmonized vaccine research across borders, promoting shared protocols, centralized data analysis, and the dissemination of findings. This approach not only accelerates the vaccine development process but also strengthens global preparedness by creating investigator-initiated clinical research infrastructure and expertise that can be rapidly mobilized for emerging epidemics.
Moreover, the lessons learned from these trials can inform the design and implementation of studies for vaccines targeting other infectious diseases with pandemic potential, such as influenza or novel zoonotic pathogens. The emphasis on immunogenicity, dosing strategies, and booster efficacy can be adapted to various pathogens, ensuring that the evidence generated remains both relevant and actionable.
In summary, the VACCELERATE trials serve as a cornerstone for advancing and standardizing vaccine research methodologies. They underscore the importance of inclusivity, adaptability, and international collaboration, setting a precedent for future studies aimed at mitigating the impact of pandemics and safeguarding global public health.
Limitations
This paper presents a novel approach to evaluating the degree of data commonality and interoperability between vaccine clinical trial protocols, the study master protocol template, metadata, and data systems and management. It did not benefit from preexisting methodologies, and its scope may therefore be limited by the specific cases examined. Nonetheless, it is anticipated to provide valuable insights that contribute to the improvement of clinical research data sharing and reuse.
Conclusions
This study demonstrates that using master protocols can significantly enhance trial operational efficiency and data interoperability when similar infrastructure and DM procedures are consistently adopted across multiple trials. To achieve data interoperability while preserving meaning in interpretation and analysis, shared data should be structured, described, formatted, and stored using widely recognized data and metadata standards.
Acknowledgments
The VACCELERATE project has received funding from the European Union’s Horizon 2020 Research and Innovation Programme (grant 101037867). For more information, please refer to the official website of the VACCELERATE network [
]. The services of the “BioBank Antwerp,” Antwerp, Belgium (ID: BE 71030031000, Biobank Antwerp [BB190007], BBMR-ERIC, Belgian [BIORESOURCE]) are used for the storage of generated samples and aliquots. Special thanks to ECRIN’s Communication Officer, Alexandra KUSTER, for proofreading the article.Generative artificial intelligence is not used by the authors to generate any content (text, tables, and figures) for this manuscript.
Data Availability
The datasets used and analyzed during this study are available from the corresponding author upon reasonable request.
Conflicts of Interest
OAC reports grants or contracts from iMi, iHi, DFG, BMBF, Cidara, DZIF, EU-DG RTD, F2G, Gilead, MedPace, MSD, Mundipharma, Octapharma, Pfizer, Scynexis; consulting fees from Abbvie, AiCuris, Basilea, Biocon, Boston Strategic Partners, Cidara, Elion Therapeutics, Gilead, GSK, IQVIA, Janssen, Matinas, MedPace, Menarini, Melinta, Molecular Partners, MSG-ERC, Mundipharma, Noxxon, Octapharma, Pardes, Partner Therapeutics, Pfizer, PSI, Scynexis, Seres, Seqirus, Shionogi, The Prime Meridian Group; speaker and lecture honoraria from Abbott, Abbvie, Akademie für Infektionsmedizin, Al-Jazeera Pharmaceuticals/Hikma, amedes, AstraZeneca, Deutscher Ärzteverlag, Gilead, GSK, Grupo Biotoscana/United Medical/Knight, Ipsen Pharma, Medscape/WebMD, MedUpdate, MSD, Moderna, Mundipharma, Noscendo, Paul-Martini-Stiftung, Pfizer, Sandoz, Seqirus, Shionogi, streamedup!, Touch Independent, Vitis; payment for expert testimony Cidara; and participation on a DRC, DSMB, DMC, Advisory Board for AstraZeneca, Cidara, IQVIA, Janssen, MedPace, Melinta, PSI, Pulmocide, Vedanta Biosciences. All other authors report no conflicting interests.
Multimedia Appendix 2
VACCELERATE central laboratories: Data Management Questionnaire.
DOCX File , 51 KBReferences
- Interoperability in healthcare. Healthcare Information and Management Systems Society. URL: https://www.himss.org/resources/interoperability-healthcare#Part1 [accessed 2023-06-05]
- Wilkinson MD, Dumontier M, Aalbersberg IJJ, Appleton G, Axton M, Baak A, et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci Data. Mar 15, 2016;3:160018. [CrossRef] [Medline]
- Ohmann C, Banzi R, Canham S, Battaglia S, Matei M, Ariyo C, et al. Sharing and reuse of individual participant data from clinical trials: principles and recommendations. BMJ Open. Dec 14, 2017;7(12):e018647. [FREE Full text] [CrossRef] [Medline]
- VACCELERATE - European Corona Vaccine Trial Accelerator platform. VACCELERATE. URL: https://vaccelerate.eu/ [accessed 2023-10-26]
- Salmanton-García J, Stewart FA, Heringer S, Koniordou M, Álvarez-Barco E, Argyropoulos CD, et al. VACCELERATE volunteer registry: a European study participant database to facilitate clinical trial enrolment. Vaccine. Jul 29, 2022;40(31):4090-4097. [FREE Full text] [CrossRef] [Medline]
- Salmanton-García J, Wipfler P, Valle-Simón P, Merakou C, Kopsidas I, Bethe U, et al. VACCELERATE Site Network: Real-time definition of clinical study capacity in Europe. Vaccine. Jun 2023;41(26):3915-3922. [CrossRef]
- Argyropoulos CD, Leckler J, Salmanton-García J, Constantinou M, Alexandrou A, Themistocleous S, et al. Enhancing public health communication regarding vaccine trials: design and development of the Pan-European VACCELERATE toolkit. JMIR Public Health Surveill. Apr 03, 2023;9:e44491. [CrossRef] [Medline]
- Poulimeneas D, Koniordou M, Kousi D, Merakou C, Kopsidas I, Tsopela GC, et al. VACCELERATE Consortium. The challenges of vaccine trial participation among underserved and hard-to-reach communities: an internal expert consultation of the VACCELERATE consortium. Vaccines (Basel). Nov 29, 2023;11(12):1784. [FREE Full text] [CrossRef] [Medline]
- Clinical studies. VACCELERATE. URL: https://vaccelerate.eu/clinical-studies-2/ [accessed 2023-10-26]
- Neuhann JM, Stemler J, Carcas A, Frías-Iniesta J, Bethe U, Heringer S, et al. A multinational, phase 2, randomised, adaptive protocol to evaluate immunogenicity and reactogenicity of different COVID-19 vaccines in adults ≥75 already vaccinated against SARS-CoV-2 (EU-COVAT-1-AGED): a trial conducted within the VACCELERATE network. Trials. Oct 08, 2022;23(1):865. [CrossRef] [Medline]
- Salmanton-García J, Wipfler P, Leckler J, Nauclér P, Mallon PW, Bruijning-Verhagen PC, et al. VACCELERATE Consortium. Predicting the next pandemic: VACCELERATE ranking of the WorldHealth Organization's Blueprint forAction toPreventEpidemics. Travel Med Infect Dis. Jan 2024;57:102676. [FREE Full text] [CrossRef] [Medline]
- Clinical Data Interchange Standards Consortium - Clinical Data Acquisition Standards Harmonisation (CDISC CDASH). Clinical Data Interchange Standards Consortium. 2023. URL: https://www.cdisc.org/standards/foundational/cdash [accessed 2023-10-26]
- Clear data. Clear impact. Clinical Data Interchange Standards Consortium (CDISC). URL: https://www.cdisc.org/ [accessed 2023-10-26]
- Soriano JB, Murthy S, Marshall JC, Relan P, Diaz JV, WHO Clinical Case Definition Working Group on Post-COVID-19 Condition. A clinical case definition of post-COVID-19 condition by a Delphi consensus. Lancet Infect Dis. Apr 2022;22(4):e102-e107. [FREE Full text] [CrossRef] [Medline]
- World Health Organization (WHO). A clinical case definition of post COVID-19 condition by a Delphi consensus. WHO. Oct 6, 2021. URL: https://www.who.int/publications/i/item/WHO-2019-nCoV-Post_COVID-19_condition-Clinical_case_definition-2021.1 [accessed 2024-10-28]
- Anju Software. 2022. URL: https://www.anjusoftware.com/ [accessed 2023-10-26]
- Clinical Data Interchange Standards Consortium - Study Data Tabulation Model (CDISC SDTM). Clinical Data Interchange Standards Consortium (CDISC). 2021. URL: https://www.cdisc.org/standards/foundational/sdtm [accessed 2023-10-26]
- Standards. Clinical Data Interchange Standards Consortium. URL: https://www.cdisc.org/standards [accessed 2023-10-26]
- European Clinical Research Infrastructure Network. URL: https://crdsr.ecrin.org/ [accessed 2024-10-26]
Abbreviations
AE: adverse event |
CDASH: Clinical Data Acquisition Standards Harmonization |
CDISC: Clinical Data Interchange Standards Consortium |
CEPHR: Centre for Experimental Pathogen Host Research |
COS: core outcome set |
CRF: case report form |
CTCC: Clinical Trials Centre Cologne |
DB spec: Study Database Specification |
DM: data management |
eCRF: electronic case report form |
EDC: Electronic Data Capture |
FAIR: Findable, Accessible, Interoperable, and Reusable |
GDPR: General Data Protection Regulation |
ICH-GCP: International Conference on Harmonization for Good Clinical Practice |
ISCIII: Instituto de Salud Carlos III |
mRNA: messenger RNA |
SAE: serious adverse event |
SDTM: Study Data Tabulation Model |
UCD: University College Dublin |
Edited by C Lovis; submitted 21.08.24; peer-reviewed by T Milotic, M Mantri, S Sung; comments to author 06.10.24; revised version received 29.10.24; accepted 23.01.25; published 07.03.25.
Copyright©Salma Malik, Zoi Pana Dorothea, Christos D Argyropoulos, Sophia Themistocleous, Alan J Macken, Olena Valdenmaiier, Frank Scheckenbach, Elena Bardach, Andrea Pfeiffer, Katherine Loens, Jordi Cano Ochando, Oliver A Cornely, Jacques Demotes-Mainard, Sergio Contrino, Gerd Felder. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 07.03.2025.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included.