<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.0 20040830//EN" "http://dtd.nlm.nih.gov/publishing/2.0/journalpublishing.dtd">
<?covid-19-tdm?>
<article xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" dtd-version="2.0">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">JMI</journal-id>
      <journal-id journal-id-type="nlm-ta">JMIR Med Inform</journal-id>
      <journal-title>JMIR Medical Informatics</journal-title>
      <issn pub-type="epub">2291-9694</issn>
      <publisher>
        <publisher-name>JMIR Publications</publisher-name>
        <publisher-loc>Toronto, Canada</publisher-loc>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="publisher-id">v9i11e31510</article-id>
      <article-id pub-id-type="pmid">34596570</article-id>
      <article-id pub-id-type="doi">10.2196/31510</article-id>
      <article-categories>
        <subj-group subj-group-type="heading">
          <subject>Original Paper</subject>
        </subj-group>
        <subj-group subj-group-type="article-type">
          <subject>Original Paper</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>Global Research on Coronaviruses: Metadata-Based Analysis for Public Health Policies</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="editor">
          <name>
            <surname>Eysenbach</surname>
            <given-names>Gunther</given-names>
          </name>
        </contrib>
      </contrib-group>
      <contrib-group>
        <contrib contrib-type="reviewer">
          <name>
            <surname>Blatch-Jones</surname>
            <given-names>Amanda</given-names>
          </name>
        </contrib>
      </contrib-group>
      <contrib-group>
        <contrib id="contrib1" contrib-type="author" corresp="yes">
          <name name-style="western">
            <surname>Warin</surname>
            <given-names>Thierry</given-names>
          </name>
          <degrees>DPhil</degrees>
          <xref rid="aff1" ref-type="aff">1</xref>
          <address>
            <institution>HEC Montréal</institution>
            <addr-line>3000, chemin de la Côte-Sainte-Catherine</addr-line>
            <addr-line>Montréal, QC, H3T 2A7</addr-line>
            <country>Canada</country>
            <phone>1 5146082106</phone>
            <email>thierry.warin@hec.ca</email>
          </address>
          <ext-link ext-link-type="orcid">https://orcid.org/0000-0002-5921-3428</ext-link>
        </contrib>
      </contrib-group>
      <aff id="aff1">
        <label>1</label>
        <institution>HEC Montréal</institution>
        <addr-line>Montréal, QC</addr-line>
        <country>Canada</country>
      </aff>
      <author-notes>
        <corresp>Corresponding Author: Thierry Warin <email>thierry.warin@hec.ca</email></corresp>
      </author-notes>
      <pub-date pub-type="collection">
        <month>11</month>
        <year>2021</year>
      </pub-date>
      <pub-date pub-type="epub">
        <day>30</day>
        <month>11</month>
        <year>2021</year>
      </pub-date>
      <volume>9</volume>
      <issue>11</issue>
      <elocation-id>e31510</elocation-id>
      <history>
        <date date-type="received">
          <day>23</day>
          <month>6</month>
          <year>2021</year>
        </date>
        <date date-type="rev-request">
          <day>16</day>
          <month>7</month>
          <year>2021</year>
        </date>
        <date date-type="rev-recd">
          <day>13</day>
          <month>8</month>
          <year>2021</year>
        </date>
        <date date-type="accepted">
          <day>27</day>
          <month>9</month>
          <year>2021</year>
        </date>
      </history>
      <copyright-statement>©Thierry Warin. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 30.11.2021.</copyright-statement>
      <copyright-year>2021</copyright-year>
      <license license-type="open-access" xlink:href="https://creativecommons.org/licenses/by/4.0/">
        <p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included.</p>
      </license>
      <self-uri xlink:href="https://medinform.jmir.org/2021/11/e31510" xlink:type="simple"/>
      <abstract>
        <sec sec-type="background">
          <title>Background</title>
          <p>Within the context of the COVID-19 pandemic, this paper suggests a data science strategy for analyzing global research on coronaviruses. The application of reproducible research principles founded on text-as-data information, open science, the dissemination of scientific data, and easy access to scientific production may aid public health in the fight against the virus.</p>
        </sec>
        <sec sec-type="objective">
          <title>Objective</title>
          <p>The primary goal of this paper was to use global research on coronaviruses to identify critical elements that can help inform public health policy decisions. We present a data science framework to assist policy makers in implementing cutting-edge data science techniques for the purpose of developing evidence-based public health policies.</p>
        </sec>
        <sec sec-type="methods">
          <title>Methods</title>
          <p>We used the EpiBibR (epidemiology-based bibliography for R) package to gain access to coronavirus research documents worldwide (N=121,231) and their associated metadata. To analyze these data, we first employed a theoretical framework to group the findings into three categories: conceptual, intellectual, and social. Second, we mapped the results of our analysis in these three dimensions using machine learning techniques (ie, natural language processing) and social network analysis.</p>
        </sec>
        <sec sec-type="results">
          <title>Results</title>
          <p>Our findings, firstly, were methodological in nature. They demonstrated the potential for the proposed data science framework to be applied to public health policies. Additionally, our findings indicated that the United States and China were the primary contributors to global coronavirus research during the study period. They also demonstrated that India and Europe were significant contributors, albeit in a secondary position. University collaborations in this domain were strong between the United States, Canada, and the United Kingdom, confirming the country-level findings.</p>
        </sec>
        <sec sec-type="conclusions">
          <title>Conclusions</title>
          <p>Our findings argue for a data-driven approach to public health policy, particularly when efficient and relevant research is required. Text mining techniques can assist policy makers in calculating evidence-based indices and informing their decision-making process regarding specific actions necessary for effective health responses.</p>
        </sec>
      </abstract>
      <kwd-group>
        <kwd>COVID-19</kwd>
        <kwd>SARS-CoV-2</kwd>
        <kwd>natural language processing</kwd>
        <kwd>coronavirus</kwd>
        <kwd>unstructured data</kwd>
        <kwd>data science</kwd>
        <kwd>health 4.0</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec sec-type="introduction">
      <title>Introduction</title>
      <p>Vaccines against the original SARS-CoV-2 strain have been developed. Public health policies are currently engaged in a battle against new waves of contamination and variants. The political logic is straightforward: the larger the population that has been immunized, the lower the probability of variants. Among their tools, they now have access to new data science tools (eg, machine learning–based analyses and big data, some of which are unstructured) and technological resources, such as high-performance computing platforms. Data science approaches are advantageous, not only for vaccine discovery but also for public health policies.</p>
      <p>In this action research–type paper, we use data science techniques to collect and analyze real-time global scientific data. The objective is to examine how data science can be used to improve public health policies. Indeed, with these new tools and data sources, policy makers can (1) conduct the most accurate diagnosis of the current state of knowledge regarding SARS-CoV-2 and (2) act by assisting leading collaborative teams. As a result, decision-making processes at the national and international levels must be optimized. We propose a data science protocol in this paper that could be quickly implemented, for example, with the support of the World Health Organization (WHO), in order to optimize research collaboration across countries, universities, and researchers.</p>
      <p>To our knowledge, this is the first paper describing a data science approach for better informing health policy decisions about coronaviruses based on global research.</p>
      <p>One of the lessons learned from the SARS-CoV-2 outbreak is the critical nature of public policy responses. Health policy makers must be aware of global research activity. They can, for example, use this information to support some research groups that are closer to developing a vaccine. Another critical feature is that they have real-time access to information, which improves response efficiency. The COVID-19 outbreak exemplifies the critical need for more accurate and timely information. COVID-19 was first identified in late 2019 in Wuhan, China, and some studies were already using data science as a methodology [<xref ref-type="bibr" rid="ref1">1</xref>]. On January 7, 2020, a novel coronavirus (2019-nCoV) was isolated. Since 2000, two coronavirus outbreaks have occurred: one caused by SARS-CoV and another by the Middle East respiratory syndrome coronavirus (MERS-CoV) [<xref ref-type="bibr" rid="ref2">2</xref>]. Thus, time is critical.</p>
      <p>Another critical factor is having access to the appropriate information. Governments have information about their research groups and their performance based on traditional data collection methods, such as annual reports. However, very few of the world’s close to 200 countries possess this information. Primary sources, on the other hand, are available in the form of research publications. It would first require leveraging all of the metadata contained in these publications. Nowadays, this is possible through the use of natural language processing (NLP) techniques. Second, it would necessitate the development of algorithms to visualize the researchers, countries, and concept networks extracted from these publications. This paper illustrates the use of NLP and social network analysis (SNA) to map the aforementioned networks.</p>
      <p>Therefore, our primary contribution is about the utility of a data science–based analysis of global coronavirus research for public health policies. We believe that a detailed map of global research on all coronaviruses is critical. Health care organizations may benefit from such a map. With today’s technologies, this comprehensive mapping can be performed in real time, thanks to a code-based pipeline as illustrated in this paper, allowing for the detection of potential outbreaks of new variants and providing the information necessary to develop subsequent vaccines.</p>
      <p>Secondly, a methodological contribution is made. Indeed, we employ metadata in order to conduct an algorithmic review of pertinent literature. In the Methods section, we go into detail about the methodology. It is, in our opinion, a necessary methodological complement to qualitative reviews and meta-analyses.</p>
      <p>In short, the primary objective of this paper is to use global research on coronaviruses to identify critical elements that can help inform public health policy decisions. By its very nature, our research question is inscribed in action research. It is methodological and exploratory: in the context of COVID-19 and our technological development stage, how can public health policy makers benefit from machine learning techniques (ie, NLP and SNA) to assist them in their decision making?</p>
    </sec>
    <sec sec-type="methods">
      <title>Methods</title>
      <sec>
        <title>Overview</title>
        <p>A metadata analysis entails accumulating more articles than a traditional systematic literature review (SLR) and using algorithms to filter and sort the initial data set. We approach this problem in two ways: first, by extracting text-as-data information via NLP techniques, and second, by visualizing potential collaboration networks via SNA.</p>
        <p>Combining these two methodological approaches is consistent with Cochrane Reviews’ principle of generating new knowledge through primary research. The primary objective of Cochrane Reviews is to provide information to individuals making health or health care decisions. New research should be designed or commissioned only if it does not duplicate previously conducted research in an unnecessary manner [<xref ref-type="bibr" rid="ref3">3</xref>]. As a result, an SLR is advantageous prior to initiating any new research, for example, by highlighting specific knowledge gaps or biases [<xref ref-type="bibr" rid="ref4">4</xref>].</p>
        <p>We were inspired by the guidelines for systematic reviews because we used a large data set of research documents. However, our distinction is that our objective was not to contribute to the development of a theoretical framework by identifying distinct research streams (ie, an academic objective) but to propose an example of applied research, more precisely action research.</p>
        <p>All of these considerations were particularly pertinent during the COVID-19 period. Thus, the methodology presented in this paper was focused on using the largest data set possible and highlighting some of the mappings that were technologically possible via NLP and SNA.</p>
        <p>We formulated two hypotheses about public health policies. First, policies require information about coronavirus research findings. This can assist governments and their various industrial partners in developing pandemic-related solutions. Second, they must be capable of supporting the ecosystems that generate these groundbreaking research findings. During a pandemic—but not exclusively—decision-making processes must be optimized to expedite the production of solutions based on research findings. This means that policy makers must be aware of the characteristics that contribute to the production of these research findings. Individuals (ie, single authors), groups of researchers (ie, multiauthored documents), interuniversity collaborations, or global collaborations are all examples of these characteristics.</p>
        <p>The years 2020 and 2021 logically demonstrate exponential growth in research output (<xref rid="figure1" ref-type="fig">Figure 1</xref>).</p>
        <fig id="figure1" position="float">
          <label>Figure 1</label>
          <caption>
            <p>Document count over time. The 2021 document count ended on May 4.</p>
          </caption>
          <graphic xlink:href="medinform_v9i11e31510_fig1.png" alt-version="no" mimetype="image" position="float" xlink:type="simple"/>
        </fig>
      </sec>
      <sec>
        <title>Protocol Development</title>
        <p>As previously stated, our research question is methodological in nature and exploratory in scope. It is about whether and how public health policy makers can benefit from machine learning techniques to inform their decision-making process in the COVID-19 context and at our technological development stage.</p>
        <p>We proposed a four-stage protocol: (1) the first stage required access to global research on coronaviruses, (2) the second stage used NLP techniques to convert the text from published research documents into data, (3) the third stage employed conventional statistical techniques, and (4) the fourth stage used SNA to identify key concepts and collaborators or universities. Interest in SNA has grown in recent years, despite the fact that it is a mathematical field that dates all the way back to the mid-1930s. SNA is predicated on the premise that the social contexts of actions matter [<xref ref-type="bibr" rid="ref5">5</xref>]. When applied to epidemiology, this means that social contexts matter in coronavirus research, which policy makers should consider.</p>
        <p>Each of these four stages would be computer intensive for a researcher but not for a national or international organization. We compiled the algorithms on a dedicated server built with an AMD Ryzen Threadripper processor (Advanced Micro Devices) with 32 cores (64 threads) at 3.2 GHz clock speed, with 128 GB memory.</p>
        <p>The first stage involved the collection of data on coronavirus research conducted globally. In the fall of 2019, precisely zero scientists were investigating COVID-19, which was unknown at the time. SARS-CoV-2, the coronavirus that causes the disease, had not yet been identified or named. By the end of March 2020, the disease had spread to over 170 countries and sickened over 750,000 people, and thousands of researchers had shifted their focus away from whatever intellectual challenges had previously piqued their interest and toward the pandemic [<xref ref-type="bibr" rid="ref6">6</xref>].</p>
        <p>In this context, our data collection relied on the EpiBibR (epidemiology-based bibliography for R) package available on GitHub [<xref ref-type="bibr" rid="ref7">7</xref>]. EpiBibR is a free resource based on open science principles (ie, reproducible research, open data, and open code). The package proposes 22 embedded metadata features and provides access to more than 120,000 references (N=121,231) from July 1, 1949, to May 4, 2021. Being a data package, it provides easy access to the data in order to be integrated efficiently in almost any researcher’s pipeline through the R language [<xref ref-type="bibr" rid="ref8">8</xref>]. The references were collected via PubMed, a free resource that is developed and maintained by the National Center for Biotechnology Information at the US National Library of Medicine, located at the National Institutes of Health. PubMed includes over 30 million citations from biomedical literature. More specifically, the EpiBibR package adopted the procedure used by the Allen Institute for AI (artificial intelligence) for their COVID-19 Open Research Dataset (CORD-19) project. EpiBibR applies a similar query on PubMed with the following keywords: “COVID-19” OR “coronavirus” OR “corona virus” OR “2019-nCoV” OR “SARS-CoV” OR “MERS-CoV” OR “severe acute respiratory syndrome” OR “Middle East respiratory syndrome” [<xref ref-type="bibr" rid="ref9">9</xref>]. To the best of our knowledge, the EpiBibR package is the only data package in R providing access to the global research on coronaviruses. This package is updated daily allowing us to build a real-time analysis. It is also the only one of this size. We were able to generate a data set of research documents as of May 4, 2021 (N=121,231). All of these references are accessible through the package [<xref ref-type="bibr" rid="ref7">7</xref>]. We used the already-available metadata from the package and then, through NLP techniques, we also generated new metadata as explained further below.</p>
        <p>For the second and fourth stages, we used the Bibliometrix package in R (version 3.1.4; The R Foundation) on top of our own algorithms, notably to perform disambiguation of authors’ names or to build the SNA [<xref ref-type="bibr" rid="ref10">10</xref>]. We also created new metadata from the title, the abstract, the keywords, and the references. The latter was particularly computing intensive. Indeed, the algorithm scanned all the references in the references section of each paper. Metadata were generated using NLP techniques. To begin, we prepared the data set by choosing tokens and n-grams [<xref ref-type="bibr" rid="ref10">10</xref>].</p>
        <p>These attributes were required for conducting quantitative analysis on the sample. We were able to create a synthesis of research by using these machine learning tools in conjunction with other techniques, such as SNA. Additionally, the dynamics of research contributions, collaborations, idea generation, and dissemination were examined.</p>
      </sec>
      <sec>
        <title>Study Design</title>
        <p>The publishing landscape has shifted due to the introduction of new vehicles and practices, such as preprint servers and open data [<xref ref-type="bibr" rid="ref11">11</xref>]. Technological advances have also provided access to new methods, such as NLP and machine learning, to complement more conventional SLRs or to present findings when a meta-analysis is not possible [<xref ref-type="bibr" rid="ref12">12</xref>].</p>
        <p>The SLR process is one that enables the collection of pertinent evidence on a given topic that meets predefined eligibility criteria and provides an answer to the formulated research questions. Meta-analyses employ descriptive and/or inferential statistical methods to pool data from multiple studies on a single subject. Thus, the techniques enable knowledge to be generated from a variety of qualitative and quantitative studies. The conventional method entails four basic steps: (1) search (define the search string and database types), (2) appraisal (use predefined criteria for literature inclusion and exclusion, as well as quality-assessment criteria), (3) synthesis (extract and categorize the data), and (4) analysis (narrate the results and, finally, reach a conclusion) [<xref ref-type="bibr" rid="ref13">13</xref>].</p>
        <p>The SLR process is defined as a “systematic, explicit, and reproducible method for identifying, evaluating, and synthesizing the existing body of completed and recorded work” [<xref ref-type="bibr" rid="ref14">14</xref>]. According to Lasserson et al (page 1) [<xref ref-type="bibr" rid="ref15">15</xref>], “A systematic review attempts to collate all the empirical evidence that fits pre-specified eligibility criteria in order to answer a specific research question.”</p>
        <p>SLRs are not intended to be exhaustive or to be performed in real time. As a result, to complement SLRs, we proposed mapping the entire global research on coronaviruses, given the field’s rapid advancement. The large data set allowed us to analyze the metadata associated with the documents, such as the authors’ affiliations, universities, and references.</p>
        <p>Another significant contribution of this new methodology is the computational treatment based on NLP techniques to convert the text to data. As such, NLP in systematic reviews is not new, and some articles have reflected on the interests of NLP techniques [<xref ref-type="bibr" rid="ref16">16</xref>-<xref ref-type="bibr" rid="ref18">18</xref>]. In particular, a first set of papers were about information extraction using NLP toolkits like scispaCy [<xref ref-type="bibr" rid="ref19">19</xref>] or language-based models like BioBERT (bidirectional encoder representations from transformers for biomedical text mining) [<xref ref-type="bibr" rid="ref20">20</xref>,<xref ref-type="bibr" rid="ref21">21</xref>]. Another set of papers was about text classification and sentence extraction using BERT [<xref ref-type="bibr" rid="ref22">22</xref>,<xref ref-type="bibr" rid="ref23">23</xref>]. Using the CORD-19 data set from the Allen Institute for AI, some other papers have used paper titles and abstracts to build word pairs and co-occurrences to build knowledge graphs highlighting the existence of networks [<xref ref-type="bibr" rid="ref24">24</xref>,<xref ref-type="bibr" rid="ref25">25</xref>].</p>
        <p>In this paper, we extended these NLP techniques by constructing a series of SNAs using the metadata. We were able to uncover research patterns, research history, and the actual research vehicles, as well as connect discoveries to institutions, to name a few examples. Co-occurrences in the titles and abstracts of each paper were used to highlight the findings from our SNAs.</p>
        <p>Finally, another critical dimension was more specific and pertains to the use of each document’s references section. By concentrating on the metrics, researchers can decipher patterns of knowledge transmission. Due to the sheer volume of data being analyzed, this information can only be accessed via an algorithmic approach.</p>
        <p>Additionally, we were cognizant of the exploratory nature of our research, employing tools and techniques whose validity had yet to be established. O’Mara-Eves et al [<xref ref-type="bibr" rid="ref16">16</xref>] documented the biases introduced by machine learning techniques used in systematic reviews. Hopefully, this paper contributes like many others to this healthy and necessary trial and error exercise in terms of scientific validity [<xref ref-type="bibr" rid="ref17">17</xref>]. Indeed, these new techniques may be used to save time by automating certain tasks, to act as a secondary screener, and to provide new analytical options, such as SNA. This latter point is precisely why this paper exists, particularly in the context of public health policies.</p>
        <p>We organized the presentation of the results of these computations using the following theoretical framework. Aria and Cuccurullo [<xref ref-type="bibr" rid="ref10">10</xref>] suggested examining three distinct structures in their study design—conceptual, intellectual, and social structures—which we did as follows:</p>
        <list list-type="order">
          <list-item>
            <p>The conceptual structures were concerned with leveraging the metadata to understand better which concepts and topics are used and how they have evolved in academic discourse.</p>
          </list-item>
          <list-item>
            <p>The intellectual structures helped us in determining who originated these concepts, which journals aided in the establishment of this nascent literature, and which articles were most frequently cited in the establishment of this literature.</p>
          </list-item>
          <list-item>
            <p>Finally, the social structures enabled us to investigate authors’ collaborations and the knowledge support provided by universities and countries due to those collaborations.</p>
          </list-item>
        </list>
      </sec>
      <sec>
        <title>Data Extraction and Quality Assessment</title>
        <p>The relevant “universe” of the literature consists of references from EpiBibR (<xref ref-type="table" rid="table1">Table 1</xref>)<bold>,</bold> totaling 121,231 documents, most of which have been published in refereed journals (<xref ref-type="table" rid="table2">Table 2</xref>). The literature review covered the period between January 1, 2020, and May 2021.</p>
        <p>The year 2020 has seen an exponential growth of papers on coronaviruses, and 2021 seems to be a replication of 2020. The average citations per document were 0.04 with the information we had. It is a low number, probably explained by the fact that these publications were published in the last few months. As a reference point, the total citations per paper in clinical medicine for the highly cited papers were 5.78 for the 2017-2021 period (Clarivate Analytics, 2021). As seen in <xref ref-type="table" rid="table1">Table 1</xref>, the documents were published within 7160 different sources, a diverse set of publication vehicles.</p>
        <p><xref ref-type="table" rid="table2">Table 2</xref> summarizes the documents’ classifications. The results may be conservative, as some references in the original data set may not contain all of the necessary information. Taking this limitation into account, articles dominated the sample for the entire period (<xref ref-type="table" rid="table2">Table 2</xref>), accounting for 88,374 occurrences, followed by 16,405 preprints and letters. There have been 120 SLRs published. To summarize, brief contributions (ie, articles and preprints) served as a proxy for the final product.</p>
        <p>Consider the metadata generated from the authors’ names and the keywords chosen by the authors of the documents. Coronavirus research on a global scale encompassed 5118 keywords during the overall period (<xref ref-type="table" rid="table3">Table 3</xref>). It is also worth noting for policy makers that this is a research agenda that interests 377,405 authors. There are a plethora of potential questions raised by these data in the context of public health policy. Additionally, the majority of publications were multiauthored, indicating the increasingly collaborative nature of domain research.</p>
        <p>Additionally, the descriptive statistics analysis revealed an average of 3.11 authors and 7.15 coauthors for each publication (<xref ref-type="table" rid="table4">Table 4</xref>). The vast majority of documents were collaboratively written. Only 13,794 documents were written by a single individual (<xref ref-type="table" rid="table4">Table 4</xref> [<xref ref-type="bibr" rid="ref26">26</xref>]).</p>
        <p>Now consider the three distinct structural components: conceptual, intellectual, and social. The first two are required to complete the descriptive statistics aspect.</p>
        <table-wrap position="float" id="table1">
          <label>Table 1</label>
          <caption>
            <p>Preliminary information about data during the overall period and per year.</p>
          </caption>
          <table width="1000" cellpadding="5" cellspacing="0" border="1" rules="groups" frame="hsides">
            <col width="500"/>
            <col width="300"/>
            <col width="100"/>
            <col width="100"/>
            <thead>
              <tr valign="top">
                <td>Information</td>
                <td>Overall time period: 2020-2021</td>
                <td>2020</td>
                <td>2021</td>
              </tr>
            </thead>
            <tbody>
              <tr valign="top">
                <td>Sources (journals, books, etc), n</td>
                <td>7160</td>
                <td>6142</td>
                <td>4982</td>
              </tr>
              <tr valign="top">
                <td>Documents, n</td>
                <td>121,231</td>
                <td>83,090</td>
                <td>38,141</td>
              </tr>
              <tr valign="top">
                <td>Average years from publication</td>
                <td>0.685</td>
                <td>1</td>
                <td>0</td>
              </tr>
              <tr valign="top">
                <td>Average citations per document</td>
                <td>0.04664</td>
                <td>0.06746</td>
                <td>0.001285</td>
              </tr>
              <tr valign="top">
                <td>Average citations per year per document</td>
                <td>0.02352</td>
                <td>0.03373</td>
                <td>0.001285</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <table-wrap position="float" id="table2">
          <label>Table 2</label>
          <caption>
            <p>Document type during the overall period and per year.</p>
          </caption>
          <table width="1000" cellpadding="5" cellspacing="0" border="1" rules="groups" frame="hsides">
            <col width="500"/>
            <col width="300"/>
            <col width="100"/>
            <col width="100"/>
            <thead>
              <tr valign="top">
                <td>Type of document</td>
                <td>Overall time period: 2020-2021, n</td>
                <td>2020, n</td>
                <td>2021, n</td>
              </tr>
            </thead>
            <tbody>
              <tr valign="top">
                <td>Case report</td>
                <td>3294</td>
                <td>2211</td>
                <td>1083</td>
              </tr>
              <tr valign="top">
                <td>Classical article</td>
                <td>2</td>
                <td>0</td>
                <td>2</td>
              </tr>
              <tr valign="top">
                <td>Clinical conference</td>
                <td>7</td>
                <td>5</td>
                <td>2</td>
              </tr>
              <tr valign="top">
                <td>Clinical study</td>
                <td>2</td>
                <td>2</td>
                <td>0</td>
              </tr>
              <tr valign="top">
                <td>Clinical trial</td>
                <td>13</td>
                <td>7</td>
                <td>6</td>
              </tr>
              <tr valign="top">
                <td>Clinical trial protocol</td>
                <td>41</td>
                <td>39</td>
                <td>2</td>
              </tr>
              <tr valign="top">
                <td>Clinical trial, phase II</td>
                <td>1</td>
                <td>1</td>
                <td>0</td>
              </tr>
              <tr valign="top">
                <td>Comparative study</td>
                <td>69</td>
                <td>58</td>
                <td>11</td>
              </tr>
              <tr valign="top">
                <td>Congress</td>
                <td>8</td>
                <td>5</td>
                <td>3</td>
              </tr>
              <tr valign="top">
                <td>Consensus development conference</td>
                <td>5</td>
                <td>4</td>
                <td>1</td>
              </tr>
              <tr valign="top">
                <td>Editorial</td>
                <td>5766</td>
                <td>4622</td>
                <td>1144</td>
              </tr>
              <tr valign="top">
                <td>English abstract</td>
                <td>1664</td>
                <td>1174</td>
                <td>490</td>
              </tr>
              <tr valign="top">
                <td>Equivalence trial</td>
                <td>1</td>
                <td>0</td>
                <td>1</td>
              </tr>
              <tr valign="top">
                <td>Evaluation study</td>
                <td>14</td>
                <td>11</td>
                <td>3</td>
              </tr>
              <tr valign="top">
                <td>Guideline</td>
                <td>15</td>
                <td>15</td>
                <td>0</td>
              </tr>
              <tr valign="top">
                <td>Historical article</td>
                <td>22</td>
                <td>21</td>
                <td>1</td>
              </tr>
              <tr valign="top">
                <td>Interview</td>
                <td>32</td>
                <td>27</td>
                <td>5</td>
              </tr>
              <tr valign="top">
                <td>Introductory journal article</td>
                <td>6</td>
                <td>6</td>
                <td>0</td>
              </tr>
              <tr valign="top">
                <td>Journal article</td>
                <td>88,374</td>
                <td>58,601</td>
                <td>29,773</td>
              </tr>
              <tr valign="top">
                <td>Lecture</td>
                <td>2</td>
                <td>2</td>
                <td>0</td>
              </tr>
              <tr valign="top">
                <td>Preprint or letter</td>
                <td>16,405</td>
                <td>13,068</td>
                <td>3337</td>
              </tr>
              <tr valign="top">
                <td>Meta-analysis</td>
                <td>9</td>
                <td>5</td>
                <td>4</td>
              </tr>
              <tr valign="top">
                <td>Published erratum</td>
                <td>492</td>
                <td>270</td>
                <td>222</td>
              </tr>
              <tr valign="top">
                <td>Retraction of publication</td>
                <td>15</td>
                <td>7</td>
                <td>8</td>
              </tr>
              <tr valign="top">
                <td>Review</td>
                <td>1</td>
                <td>1</td>
                <td>0</td>
              </tr>
              <tr valign="top">
                <td>Systematic review</td>
                <td>120</td>
                <td>65</td>
                <td>55</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <table-wrap position="float" id="table3">
          <label>Table 3</label>
          <caption>
            <p>Document content and authors during the overall period and per year.</p>
          </caption>
          <table width="1000" cellpadding="5" cellspacing="0" border="1" rules="groups" frame="hsides">
            <col width="500"/>
            <col width="300"/>
            <col width="100"/>
            <col width="100"/>
            <thead>
              <tr valign="top">
                <td>Document content</td>
                <td>Overall time period: 2020-2021, n</td>
                <td>2020, n</td>
                <td>2021, n</td>
              </tr>
            </thead>
            <tbody>
              <tr valign="top">
                <td>Authors’ keywords</td>
                <td>5118</td>
                <td>4699</td>
                <td>2044</td>
              </tr>
              <tr valign="top">
                <td>Authors</td>
                <td>377,405</td>
                <td>266,579</td>
                <td>188,900</td>
              </tr>
              <tr valign="top">
                <td>Author appearances</td>
                <td>866,589</td>
                <td>569,924</td>
                <td>296,665</td>
              </tr>
              <tr valign="top">
                <td>Authors of single-authored documents</td>
                <td>8819</td>
                <td>6835</td>
                <td>2580</td>
              </tr>
              <tr valign="top">
                <td>Authors of multiauthored documents</td>
                <td>368,586</td>
                <td>259,744</td>
                <td>186,320</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <table-wrap position="float" id="table4">
          <label>Table 4</label>
          <caption>
            <p>Details about authors’ collaborations.</p>
          </caption>
          <table width="1000" cellpadding="5" cellspacing="0" border="1" rules="groups" frame="hsides">
            <col width="500"/>
            <col width="300"/>
            <col width="100"/>
            <col width="100"/>
            <thead>
              <tr valign="top">
                <td>Collaboration measure</td>
                <td>Overall time period: 2020-2021</td>
                <td>2020</td>
                <td>2021</td>
              </tr>
            </thead>
            <tbody>
              <tr valign="top">
                <td>Single-authored documents, n</td>
                <td>13,794</td>
                <td>10,324</td>
                <td>3470</td>
              </tr>
              <tr valign="top">
                <td>Documents per author, n</td>
                <td>0.321</td>
                <td>0.312</td>
                <td>0.202</td>
              </tr>
              <tr valign="top">
                <td>Authors-per-document index<sup>a</sup></td>
                <td>3.11</td>
                <td>3.21</td>
                <td>4.95</td>
              </tr>
              <tr valign="top">
                <td>Coauthors per document, n</td>
                <td>7.15</td>
                <td>6.86</td>
                <td>7.78</td>
              </tr>
              <tr valign="top">
                <td>Collaboration index<sup>b</sup></td>
                <td>3.43</td>
                <td>3.57</td>
                <td>5.37</td>
              </tr>
            </tbody>
          </table>
          <table-wrap-foot>
            <fn id="table4fn1">
              <p><sup>a</sup>The authors-per-document index was calculated by dividing the total number of authors by the total number of articles.</p>
            </fn>
            <fn id="table4fn2">
              <p><sup>b</sup>The collaboration index was calculated by multiplying the total number of authors on multiauthored documents by the total number of multiauthored documents [<xref ref-type="bibr" rid="ref26">26</xref>].</p>
            </fn>
          </table-wrap-foot>
        </table-wrap>
      </sec>
    </sec>
    <sec sec-type="results">
      <title>Results</title>
      <sec>
        <title>Overview</title>
        <p>As mentioned in the Methods section, we used Aria and Cuccurullo’s [<xref ref-type="bibr" rid="ref10">10</xref>] theoretical framework to present our findings. We present, respectively, the conceptual, intellectual, and social structures. For each structure, we present the relevant metrics that are available.</p>
        <p>Additionally, as a proof of concept, we generated the necessary metadata and metrics based on the 121,231 total documents. We would encourage future researchers to filter the data set to address their own research questions, for example, by limiting their search to randomized controlled trial documents or even by content, such as proteins. Due to the fact that text is data, a new set of options becomes available.</p>
      </sec>
      <sec>
        <title>Conceptual Structures of the Global Research on Coronaviruses</title>
        <sec>
          <title>Overview</title>
          <p>In the following subsections, we examined the conceptual structures of our sample by analyzing the keywords, their co-occurrences, and the evolution of the topics using a topic modeling technique. To create this conceptual framework, we created a matrix of the keywords and titles of the 121,231 documents.</p>
        </sec>
        <sec>
          <title>Keyword-Based Metrics</title>
          <p>The keyword section of <xref rid="figure2" ref-type="fig">Figure 2</xref> highlights the most frequently used keywords by authors in their documents. Between 2020 and 2021, it was largely stable. <xref ref-type="table" rid="table5">Table 5</xref> displays the top keywords in the overall sample and per year.</p>
          <fig id="figure2" position="float">
            <label>Figure 2</label>
            <caption>
              <p>Evolution of the usage of authors’ keywords.</p>
            </caption>
            <graphic xlink:href="medinform_v9i11e31510_fig2.png" alt-version="no" mimetype="image" position="float" xlink:type="simple"/>
          </fig>
          <table-wrap position="float" id="table5">
            <label>Table 5</label>
            <caption>
              <p>Most relevant keywords during the overall period and per year.</p>
            </caption>
            <table width="1000" cellpadding="5" cellspacing="0" border="1" rules="groups" frame="hsides">
              <col width="30"/>
              <col width="470"/>
              <col width="0"/>
              <col width="500"/>
              <thead>
                <tr valign="top">
                  <td colspan="3">Author keywords</td>
                  <td>Articles where keywords appear (N=121,231), n (%)</td>
                </tr>
              </thead>
              <tbody>
                <tr valign="top">
                  <td colspan="4">
                    <bold>Overall time period: 2020-2021</bold>
                  </td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>Epidemiology</td>
                  <td colspan="2">8216 (6.8)</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>Humans</td>
                  <td colspan="2">8188 (6.8)</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>Pandemics</td>
                  <td colspan="2">6829 (5.6)</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>Coronavirus infections</td>
                  <td colspan="2">6807 (5.6)</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>Pneumonia viral</td>
                  <td colspan="2">6672 (5.5)</td>
                </tr>
                <tr valign="top">
                  <td colspan="4">
                    <bold>2021</bold>
                  </td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>Humans</td>
                  <td colspan="2">1296 (1.1)</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>COVID-19</td>
                  <td colspan="2">1246 (1.1)</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>SARS-CoV-2</td>
                  <td colspan="2">857 (0.1)</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>Epidemiology</td>
                  <td colspan="2">799 (0.1)</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>Pandemics</td>
                  <td colspan="2">425 (0.1)</td>
                </tr>
                <tr valign="top">
                  <td colspan="4">
                    <bold>2020</bold>
                  </td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>Epidemiology</td>
                  <td colspan="2">7417 (6.1)</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>Humans</td>
                  <td colspan="2">6892 (5.7)</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>Coronavirus infections</td>
                  <td colspan="2">6759 (5.6)</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>Pneumonia viral</td>
                  <td colspan="2">6658 (5.5)</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>Pandemics</td>
                  <td colspan="2">6404 (5.3)</td>
                </tr>
              </tbody>
            </table>
          </table-wrap>
        </sec>
        <sec>
          <title>Topic Modeling–Based Analyses Using Keywords</title>
          <p>We added a new dimension to the analysis in the following section using structural topic modeling. The purpose of this section is to supplement the information gleaned from keyword co-occurrences. We illustrate this analysis in <xref rid="figure3" ref-type="fig">Figure 3</xref> (overall period), <xref rid="figure4" ref-type="fig">Figure 4</xref> (2020), and <xref rid="figure5" ref-type="fig">Figure 5</xref> (2021). We discovered that the topics were classified into four categories: fundamental themes, emerging or declining themes, niche themes, and motor themes. The results in this case were based on the keywords solely to demonstrate the framework.</p>
          <p>The analysis can be carried out using techniques for dimensionality reduction. The following sections make use of multiple correspondence analysis.</p>
          <p>We augmented our field’s conceptual structure with k-means clustering in order to identify clusters of documents expressing common concepts solely based on keywords. We used NLP to extract terms from the keywords section. In addition, the algorithm implemented the Porter stemming algorithm to reduce inflected, or sometimes derived, words to their word stem, base, or root form. Finally, we tokenized all the words, and we computed the latent variables to identify potential topics. Because of the necessary high computing power, we performed this analysis on the 2021 data set.</p>
          <p><xref rid="figure6" ref-type="fig">Figures 6</xref> and <xref rid="figure7" ref-type="fig">7</xref> illustrate ample room for policy implications regarding social distancing and vaccination, respectively (red). The significant topic is population (ie, health status, age, and so on), which is depicted in blue in <xref rid="figure6" ref-type="fig">Figure 6</xref> and red in <xref rid="figure7" ref-type="fig">Figure 7</xref>. The same analysis can be performed on additional terms, such as those found in titles, abstracts, or references. As a result, a plethora of potential classifications becomes available.</p>
          <p>Following our examination of possible measures of conceptual structures, let us turn our attention to the analysis of intellectual structures.</p>
          <fig id="figure3" position="float">
            <label>Figure 3</label>
            <caption>
              <p>Topic modeling for the overall period.</p>
            </caption>
            <graphic xlink:href="medinform_v9i11e31510_fig3.png" alt-version="no" mimetype="image" position="float" xlink:type="simple"/>
          </fig>
          <fig id="figure4" position="float">
            <label>Figure 4</label>
            <caption>
              <p>Topic modeling for 2020.</p>
            </caption>
            <graphic xlink:href="medinform_v9i11e31510_fig4.png" alt-version="no" mimetype="image" position="float" xlink:type="simple"/>
          </fig>
          <fig id="figure5" position="float">
            <label>Figure 5</label>
            <caption>
              <p>Topic modeling for 2021.</p>
            </caption>
            <graphic xlink:href="medinform_v9i11e31510_fig5.png" alt-version="no" mimetype="image" position="float" xlink:type="simple"/>
          </fig>
          <fig id="figure6" position="float">
            <label>Figure 6</label>
            <caption>
              <p>Conceptual structure map based on multiple correspondence analysis. Dim: dimension.</p>
            </caption>
            <graphic xlink:href="medinform_v9i11e31510_fig6.png" alt-version="no" mimetype="image" position="float" xlink:type="simple"/>
          </fig>
          <fig id="figure7" position="float">
            <label>Figure 7</label>
            <caption>
              <p>Topic dendogram.</p>
            </caption>
            <graphic xlink:href="medinform_v9i11e31510_fig7.png" alt-version="no" mimetype="image" position="float" xlink:type="simple"/>
          </fig>
        </sec>
      </sec>
      <sec>
        <title>Intellectual Structures of the Global Research on Coronaviruses</title>
        <p>Another dimension leading to another interesting analysis is to know who, what journals, and which organizations are leaders in these topic dynamics.</p>
        <sec>
          <title>Author-Based Metrics</title>
          <p>In the intellectual structure, authors are interesting to consider for public policies. These metrics come with many biases, as some family names can be prevalent. An important dimension is equity, diversity, and inclusion (EDI). It is not the focus of this paper on public health policy. However, it is possible for future research to delve deeper into this author component of the intellectual structure. With this algorithmic approach and the available metadata, scholars can design EDI metrics to assess, for instance, gender-related questions, such as first and last authors; leadership positions in academia; among others [<xref ref-type="bibr" rid="ref27">27</xref>-<xref ref-type="bibr" rid="ref32">32</xref>]. An EDI-based analysis could also correct for the fact that fewer articles have females as the last author and these articles accrue fewer citations per publication [<xref ref-type="bibr" rid="ref33">33</xref>]. With this metadata-based approach, scholars have access to these metrics. This is a subject that would require a more comprehensive examination of the field as a whole, which is beyond the scope of this work.</p>
          <p>In <xref rid="figure8" ref-type="fig">Figures 8</xref> and <xref rid="figure9" ref-type="fig">9</xref>, respectively, we present the total count per name for the overall period and per year. It is important to note that homonymy is always an issue to correct. To correct for homonymy, several strategies exist. We could use the ORCID (Open Researcher and Contributor ID) numbers or any other unique identifier. Unfortunately, this information was not available in the original data set. Thus, we designed an algorithm that would associate an author’s name with a university’s name. We sorted the whole data set making sure there were unique pairs of authors and affiliations. Sometimes, university affiliations were written in different forms. We corrected them by creating a dictionary of affiliations to standardize the format.</p>
          <fig id="figure8" position="float">
            <label>Figure 8</label>
            <caption>
              <p>Top authors in terms of production during the overall period.</p>
            </caption>
            <graphic xlink:href="medinform_v9i11e31510_fig8.png" alt-version="no" mimetype="image" position="float" xlink:type="simple"/>
          </fig>
          <fig id="figure9" position="float">
            <label>Figure 9</label>
            <caption>
              <p>Top authors in terms of production per year.</p>
            </caption>
            <graphic xlink:href="medinform_v9i11e31510_fig9.png" alt-version="no" mimetype="image" position="float" xlink:type="simple"/>
          </fig>
          <p>We can go a little deeper and look at the average productivity of all the authors. One way to design better metrics would be to consider how many articles an author produces per year in our 2-year sample. In <xref rid="figure10" ref-type="fig">Figures 10</xref>-<xref rid="figure12" ref-type="fig">12</xref>, we computed the Lotka coefficient for the overall period, 2020, and 2021, respectively, to compare the scientific productivity of researchers to the Lotka theoretical coefficient [<xref ref-type="bibr" rid="ref34">34</xref>]. The Lotka law describes the frequency of publication by authors as an inverse square law, where the number of authors publishing a certain number of articles is a fixed ratio to the number of authors publishing a single article. This assumption implies that the theoretical β coefficient of the Lotka law is equal to 2.</p>
          <p><xref rid="figure10" ref-type="fig">Figures 10</xref>-<xref rid="figure12" ref-type="fig">12</xref> describe the share of authors having published a certain number of articles. Here, there was a statistically significant difference between the observed and the theoretical Lotka distributions, meaning that authors were more prolific in this research topic. This does not come as a surprise, considering the urgency of the topic.</p>
          <fig id="figure10" position="float">
            <label>Figure 10</label>
            <caption>
              <p>Scientific productivity during the overall period.</p>
            </caption>
            <graphic xlink:href="medinform_v9i11e31510_fig10.png" alt-version="no" mimetype="image" position="float" xlink:type="simple"/>
          </fig>
          <fig id="figure11" position="float">
            <label>Figure 11</label>
            <caption>
              <p>Scientific productivity during 2020.</p>
            </caption>
            <graphic xlink:href="medinform_v9i11e31510_fig11.png" alt-version="no" mimetype="image" position="float" xlink:type="simple"/>
          </fig>
          <fig id="figure12" position="float">
            <label>Figure 12</label>
            <caption>
              <p>Scientific productivity during 2021.</p>
            </caption>
            <graphic xlink:href="medinform_v9i11e31510_fig12.png" alt-version="no" mimetype="image" position="float" xlink:type="simple"/>
          </fig>
          <p>Due to the large size of the data set, our dedicated server was not powerful enough to compute the results. Our strategy was, thus, to extract a random sample for 2020 and 2021 of 25,000 documents each year. The 2021 sample corresponded to 65.5% of the total 2021 data set. The 2020 sample corresponded to 30.0% of the total 2020 data set.</p>
          <p>To go further, we narrowed it down to specific groups of authors, institutions, or research teams and computed the scientific productivity. It may be relevant, indeed, to allocate resources, as a policy maker, to some of these dimensions.</p>
          <p>To conclude, in <xref rid="figure13" ref-type="fig">Figure 13</xref>, we first filtered the original authors’ list to authors having published fewer than 25 articles and to those who had fewer than 20 total citations per year. It was an arbitrary choice, and we could easily filter it differently, which is precisely in line with our main point: data science allows this agile adaptation.</p>
          <p>Let us now move to the article element as another interesting dimension to measure intellectual structures.</p>
          <fig id="figure13" position="float">
            <label>Figure 13</label>
            <caption>
              <p>Productivity of the top authors over time. TC: total citations.</p>
            </caption>
            <graphic xlink:href="medinform_v9i11e31510_fig13.png" alt-version="no" mimetype="image" position="float" xlink:type="simple"/>
          </fig>
        </sec>
        <sec>
          <title>Article-Based Metrics</title>
          <p>We had a look at the citations from the data set (N=121,231). Authors represented interesting information regarding public health policies, including their productivity metrics, but we also found it interesting that the most cited manuscripts may help refine the metrics (<xref ref-type="table" rid="table6">Table 6</xref>).</p>
          <p>Let us now go deeper and consider the social structures of the global research on coronaviruses.</p>
          <table-wrap position="float" id="table6">
            <label>Table 6</label>
            <caption>
              <p>Most cited manuscripts.</p>
            </caption>
            <table width="1000" cellpadding="5" cellspacing="0" border="1" rules="groups" frame="hsides">
              <col width="500"/>
              <col width="200"/>
              <col width="300"/>
              <thead>
                <tr valign="top">
                  <td>Articles (author, year, journal)</td>
                  <td>Total citations, n</td>
                  <td>Total citations per year, n</td>
                </tr>
              </thead>
              <tbody>
                <tr valign="top">
                  <td>Huang C, 2020, The Lancet</td>
                  <td>146</td>
                  <td>73.0</td>
                </tr>
                <tr valign="top">
                  <td>Zhu N, 2020, New England Journal of Medicine</td>
                  <td>102</td>
                  <td>51.0</td>
                </tr>
                <tr valign="top">
                  <td>Chen N, 2020, The Lancet</td>
                  <td>100</td>
                  <td>50.0</td>
                </tr>
                <tr valign="top">
                  <td>Li Q, 2020, New England Journal of Medicine</td>
                  <td>89</td>
                  <td>44.5</td>
                </tr>
                <tr valign="top">
                  <td>Chan JF, 2020, The Lancet</td>
                  <td>75</td>
                  <td>37.5</td>
                </tr>
                <tr valign="top">
                  <td>Veljkovic V, 2021, F1000Research</td>
                  <td>7</td>
                  <td>7.0</td>
                </tr>
                <tr valign="top">
                  <td>Endo A, 2021, Wellcome Open Research</td>
                  <td>6</td>
                  <td>6.0</td>
                </tr>
                <tr valign="top">
                  <td>Wang L, 2021, medRxiv</td>
                  <td>2</td>
                  <td>2.0</td>
                </tr>
                <tr valign="top">
                  <td>Fu L, 2021, Clinical Cardiology</td>
                  <td>1</td>
                  <td>1.0</td>
                </tr>
                <tr valign="top">
                  <td>Ackermann M, 2021, New England Journal of Medicine</td>
                  <td>1</td>
                  <td>1.0</td>
                </tr>
              </tbody>
            </table>
          </table-wrap>
        </sec>
      </sec>
      <sec>
        <title>Social Structures of the Global Research on Coronaviruses</title>
        <p>In this section, we focus on different measures to capture the social connections: the co-citations of authors, the co-citations of articles, the co-citations of journals, and the collaborations across institutions.</p>
        <sec>
          <title>Authors’ Collaboration Metrics</title>
          <p><xref rid="figure14" ref-type="fig">Figure 14</xref> highlights the authors’ collaborations. This figure shows the network of the top authors. Again, we can see a high level of collaboration and knowledge transfer. In further research, scholars could also perform the analyses with EDI in mind and use the metadata to have a metric of potential EDI metric imbalances [<xref ref-type="bibr" rid="ref35">35</xref>]. This can be particularly useful in order to correct these imbalances.</p>
          <p>Let us now move our discussion to the country level.</p>
          <fig id="figure14" position="float">
            <label>Figure 14</label>
            <caption>
              <p>Authors’ collaboration networks in 2021.</p>
            </caption>
            <graphic xlink:href="medinform_v9i11e31510_fig14.png" alt-version="no" mimetype="image" position="float" xlink:type="simple"/>
          </fig>
        </sec>
        <sec>
          <title>Country-Based Metrics</title>
          <p>It is also possible to extract country information from the documents. We mapped the top five countries per period. Most of the authors were residents of the United States, the People’s Republic of China, India, and Europe (<xref ref-type="table" rid="table7">Table 7</xref>).</p>
          <p><xref ref-type="table" rid="table8">Table 8</xref> provides supplementary information on the total citations per country. Again, the United States and China dominated the ranking.</p>
          <p><xref rid="figure15" ref-type="fig">Figures 15</xref> and <xref rid="figure16" ref-type="fig">16</xref> show an apparent increase in the contributions coming from Asia: China and India were at the forefront of academic production. Starting from a bibliographic matrix, two groups of descriptive measures were computed: (1) the summary statistics of the network and (2) the leading indices of centrality and prestige of vertices.</p>
          <table-wrap position="float" id="table7">
            <label>Table 7</label>
            <caption>
              <p>Corresponding authors’ countries during the overall period and per year.</p>
            </caption>
            <table width="1000" cellpadding="5" cellspacing="0" border="1" rules="groups" frame="hsides">
              <col width="30"/>
              <col width="140"/>
              <col width="200"/>
              <col width="100"/>
              <col width="170"/>
              <col width="190"/>
              <col width="170"/>
              <thead>
                <tr valign="top">
                  <td colspan="2">Country</td>
                  <td>Articles (N=121,231), n (%)</td>
                  <td>Frequency</td>
                  <td>Single-country<break/>publications</td>
                  <td>Multiple-country<break/>publications</td>
                  <td>Multiple-country<break/>publications ratio</td>
                </tr>
              </thead>
              <tbody>
                <tr valign="top">
                  <td colspan="7">
                    <bold>Overall time period: 2020-2021</bold>
                  </td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>United States</td>
                  <td>15,904 (13.1)</td>
                  <td>0.1923</td>
                  <td>15,840</td>
                  <td>64</td>
                  <td>0.004024</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>China</td>
                  <td>11,471 (9.5)</td>
                  <td>0.1387</td>
                  <td>11,451</td>
                  <td>20</td>
                  <td>0.001744</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>Italy</td>
                  <td>7565 (6.2)</td>
                  <td>0.0915</td>
                  <td>7533</td>
                  <td>32</td>
                  <td>0.004230</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>India</td>
                  <td>5314 (4.4)</td>
                  <td>0.0643</td>
                  <td>5295</td>
                  <td>19</td>
                  <td>0.003575</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>France</td>
                  <td>3156 (2.6)</td>
                  <td>0.0382</td>
                  <td>3139</td>
                  <td>17</td>
                  <td>0.005387</td>
                </tr>
                <tr valign="top">
                  <td colspan="7">
                    <bold>2021</bold>
                  </td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>United States</td>
                  <td>5483 (4.5)</td>
                  <td>0.2025</td>
                  <td>5433</td>
                  <td>50</td>
                  <td>0.00912</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>China</td>
                  <td>2859 (2.4)</td>
                  <td>0.1056</td>
                  <td>2843</td>
                  <td>16</td>
                  <td>0.00560</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>Italy</td>
                  <td>2052 (1.7)</td>
                  <td>0.0758</td>
                  <td>2022</td>
                  <td>30</td>
                  <td>0.01462</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>India</td>
                  <td>1838 (1.5)</td>
                  <td>0.0679</td>
                  <td>1824</td>
                  <td>14</td>
                  <td>0.00762</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>Spain</td>
                  <td>980 (0.1)</td>
                  <td>0.0362</td>
                  <td>975</td>
                  <td>5</td>
                  <td>0.00510</td>
                </tr>
                <tr valign="top">
                  <td colspan="7">
                    <bold>2020</bold>
                  </td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>United States</td>
                  <td>10,421 (8.6)</td>
                  <td>0.1874</td>
                  <td>10,407</td>
                  <td>14</td>
                  <td>0.001343</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>China</td>
                  <td>8612 (7.1)</td>
                  <td>0.1549</td>
                  <td>8608</td>
                  <td>4</td>
                  <td>0.000464</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>Italy</td>
                  <td>5513 (4.5)</td>
                  <td>0.0991</td>
                  <td>5511</td>
                  <td>2</td>
                  <td>0.000363</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>India</td>
                  <td>3476 (2.9)</td>
                  <td>0.0625</td>
                  <td>3471</td>
                  <td>5</td>
                  <td>0.001438</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>France</td>
                  <td>2237 (1.8)</td>
                  <td>0.0402</td>
                  <td>2236</td>
                  <td>1</td>
                  <td>0.000447</td>
                </tr>
              </tbody>
            </table>
          </table-wrap>
          <table-wrap position="float" id="table8">
            <label>Table 8</label>
            <caption>
              <p>Total citations per country during the overall period and per year.</p>
            </caption>
            <table width="1000" cellpadding="5" cellspacing="0" border="1" rules="groups" frame="hsides">
              <col width="30"/>
              <col width="470"/>
              <col width="250"/>
              <col width="0"/>
              <col width="250"/>
              <thead>
                <tr valign="top">
                  <td colspan="2">Country</td>
                  <td colspan="2">Total citations, n</td>
                  <td>Average article citation</td>
                </tr>
              </thead>
              <tbody>
                <tr valign="top">
                  <td colspan="5">
                    <bold>Overall time period: 2020-2021</bold>
                  </td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>China</td>
                  <td>2011</td>
                  <td colspan="2">0.17531</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>United States</td>
                  <td>550</td>
                  <td colspan="2">0.03458</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>Italy</td>
                  <td>315</td>
                  <td colspan="2">0.04164</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>Germany</td>
                  <td>131</td>
                  <td colspan="2">0.05240</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>France</td>
                  <td>129</td>
                  <td colspan="2">0.04087</td>
                </tr>
                <tr valign="top">
                  <td colspan="5">
                    <bold>2021</bold>
                  </td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>United States</td>
                  <td>10</td>
                  <td colspan="2">0.001824</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>China</td>
                  <td>4</td>
                  <td colspan="2">0.001399</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>Germany</td>
                  <td>4</td>
                  <td colspan="2">0.004381</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>Belgium</td>
                  <td>1</td>
                  <td colspan="2">0.004484</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>France</td>
                  <td>1</td>
                  <td colspan="2">0.001088</td>
                </tr>
                <tr valign="top">
                  <td colspan="5">
                    <bold>2020</bold>
                  </td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>China</td>
                  <td>2007</td>
                  <td colspan="2">0.23305</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>United States</td>
                  <td>540</td>
                  <td colspan="2">0.05182</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>Italy</td>
                  <td>314</td>
                  <td colspan="2">0.05696</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>France</td>
                  <td>128</td>
                  <td colspan="2">0.05722</td>
                </tr>
                <tr valign="top">
                  <td>
                    <break/>
                  </td>
                  <td>Germany</td>
                  <td>127</td>
                  <td colspan="2">0.08003</td>
                </tr>
              </tbody>
            </table>
          </table-wrap>
          <fig id="figure15" position="float">
            <label>Figure 15</label>
            <caption>
              <p>The most productive countries during the overall period, according to authors’ residences.</p>
            </caption>
            <graphic xlink:href="medinform_v9i11e31510_fig15.png" alt-version="no" mimetype="image" position="float" xlink:type="simple"/>
          </fig>
          <fig id="figure16" position="float">
            <label>Figure 16</label>
            <caption>
              <p>The most productive countries during 2021 (top) and 2020 (bottom), according to authors’ residences.</p>
            </caption>
            <graphic xlink:href="medinform_v9i11e31510_fig16.png" alt-version="no" mimetype="image" position="float" xlink:type="simple"/>
          </fig>
          <p>We can then graph the country networks using these new measures. It is, in our opinion, an excellent showcase for public health policies and decision making. It is critical information for international health organizations, research institutions, and national governments (<xref rid="figure17" ref-type="fig">Figures 17</xref>-<xref rid="figure19" ref-type="fig">19</xref>)</p>
          <fig id="figure17" position="float">
            <label>Figure 17</label>
            <caption>
              <p>Country collaboration networks during the overall period.</p>
            </caption>
            <graphic xlink:href="medinform_v9i11e31510_fig17.png" alt-version="no" mimetype="image" position="float" xlink:type="simple"/>
          </fig>
          <fig id="figure18" position="float">
            <label>Figure 18</label>
            <caption>
              <p>Country collaboration networks during 2020.</p>
            </caption>
            <graphic xlink:href="medinform_v9i11e31510_fig18.png" alt-version="no" mimetype="image" position="float" xlink:type="simple"/>
          </fig>
          <fig id="figure19" position="float">
            <label>Figure 19</label>
            <caption>
              <p>Country collaboration networks during 2021.</p>
            </caption>
            <graphic xlink:href="medinform_v9i11e31510_fig19.png" alt-version="no" mimetype="image" position="float" xlink:type="simple"/>
          </fig>
          <p>Considering the results mentioned above, the United States and China are at the forefront of academic production. Below, we also investigated the connections at the institutional level.</p>
        </sec>
        <sec>
          <title>Co-citations of Institutional Metrics</title>
          <p>In order to continue our social structure–oriented analysis, we made use of the collaborations that have developed among universities. We used the authors’ affiliations as relevant metadata in this case, and we created a collaboration matrix to facilitate the mapping of existing links.</p>
          <p>The network of university collaborations is also worth studying for public health policy purposes (<xref rid="figure20" ref-type="fig">Figures 20</xref>-<xref rid="figure22" ref-type="fig">22</xref>), as it indicates a strong collaboration between universities within the United States, between the United States and Canada, and between the United States and the United Kingdom.</p>
          <fig id="figure20" position="float">
            <label>Figure 20</label>
            <caption>
              <p>University collaboration networks during the overall period.</p>
            </caption>
            <graphic xlink:href="medinform_v9i11e31510_fig20.png" alt-version="no" mimetype="image" position="float" xlink:type="simple"/>
          </fig>
          <fig id="figure21" position="float">
            <label>Figure 21</label>
            <caption>
              <p>University collaboration networks during 2020.</p>
            </caption>
            <graphic xlink:href="medinform_v9i11e31510_fig21.png" alt-version="no" mimetype="image" position="float" xlink:type="simple"/>
          </fig>
          <fig id="figure22" position="float">
            <label>Figure 22</label>
            <caption>
              <p>University collaboration networks during 2021.</p>
            </caption>
            <graphic xlink:href="medinform_v9i11e31510_fig22.png" alt-version="no" mimetype="image" position="float" xlink:type="simple"/>
          </fig>
          <p>Another point worth noting is the lack of stability between 2020 and 2021, indicating that authors from various universities preferred to collaborate on topics relevant to their research rather than replicate previous collaborations. However, we only have data for 2020 and the first half of 2021 to compare, and it would require additional research to determine whether these collaborations can be sustained over time.</p>
          <p>To summarize, <xref rid="figure23" ref-type="fig">Figure 23</xref> visualizes the major components of three fields (ie, authors, keywords, and journals) and their relationships using a so-called Sankey diagram. Particularly evident in the three fields plotted in <xref rid="figure23" ref-type="fig">Figure 23</xref> are the connections between the main keywords and interest in these keywords expressed by the editors of the leading journals. We can see that the majority of the journals published articles that contained the most popular keywords suggested by the authors. Currently, there are no differentiation strategies being implemented by the publishers. <xref rid="figure23" ref-type="fig">Figure 23</xref> was compiled based on 25,000 documents randomly extracted, due to the computing power limits.</p>
          <fig id="figure23" position="float">
            <label>Figure 23</label>
            <caption>
              <p>Sankey diagram of three fields representing 2020 data: authors (left), keywords (middle), and journals (right).</p>
            </caption>
            <graphic xlink:href="medinform_v9i11e31510_fig23.png" alt-version="no" mimetype="image" position="float" xlink:type="simple"/>
          </fig>
        </sec>
      </sec>
    </sec>
    <sec sec-type="discussion">
      <title>Discussion</title>
      <sec>
        <title>Principal Findings</title>
        <p>We used metadata to conduct an analysis of the global research on coronaviruses. A large portion of this analysis was carried out using data science techniques, such as NLP and structured natural language analysis. It was a time-consuming and computationally intensive task. A metadata-based approach to conducting SLRs complements more traditional methods of conducting systematic reviews of the literature. There are three axes that we used to organize the literature mapping: conceptual, intellectual, and social.</p>
        <p>When dealing with a crisis, timing is everything. Our findings were based on the transformation of text to data and then NLP analyses of the overall global research on coronaviruses. We conducted our research in order to demonstrate what we hoped would be a proof of concept. As a result, this paper falls under the umbrella term of “action research.” It was our goal to demonstrate some metrics that can be applied to text-based documents, as well as how they could be applied to public health policies, with this proof of concept.</p>
        <p>Our findings are, thus, essentially methodological and can demonstrate this approach’s ability to optimize global research support. In this paper, based on data science techniques, we designed some metrics, which are static in a PDF document. Now, another powerful feature is that by using the EpiBibR data package in a research pipeline based on code, we can compile those metrics in almost real time. Indeed, all those visuals can be updated on a daily basis when the package updates itself.</p>
        <p>In terms of actionable metrics, we have discovered that most of the research was developed in 2020 and 2021, although the first article appeared in July 1949. We also learned that the United States is the leading country in terms of scientific research on this topic. China comes second, and then individual European Union members. It was also interesting to be able to identify the international collaborations between research centers, notably between the United States, Canada, and the United Kingdom. Another interesting result was being able to capture the sizes of the research fields related to the coronaviruses, such as epidemiology, pneumology, among others.</p>
      </sec>
      <sec>
        <title>Strengths and Limitations</title>
        <p>Policy makers must use the most effective tools when designing public health responses in the context of the COVID-19 pandemic. Using coronaviruses as an example, this paper proposed a framework for identifying key topics and research institutions that conduct the most relevant coronavirus research.</p>
        <p>This is especially true in the midst of what are referred to as infodemics [<xref ref-type="bibr" rid="ref36">36</xref>]. Health policy makers may be exposed to risks associated with a lack of information, but they may also be exposed to risks associated with an overabundance of information. The quality of the information is the most important factor to consider. Indeed, one of the issues raised by WHO Director-General Tedros Ghebreyesus at the beginning of the pandemic was the “infodemic,” which is defined as the rapid spread of large volumes of information, whether true or false; the infodemic was declared on February 15, 2020 [<xref ref-type="bibr" rid="ref37">37</xref>].</p>
        <p>We must rely even more heavily on the contributions of the scientific community in the future. Because of advances in technology and data accessibility, policy makers today must employ the most up-to-date data science techniques in order to develop evidence-based public health policies, even more so in the COVID-19 era.</p>
        <p>Our framework has also helped bring to light some of the limitations and biases that can be introduced into the process. These are not roadblocks, but rather concerns that a health data scientist should take into consideration. When it comes to author names, the homonymy problem serves as an excellent illustration. EDI is another aspect to consider in using those metrics. There are solutions to this problem, but they must be taken into consideration.</p>
        <p>Another constraint is the amount of computing power required to run these machine learning routines on a large scale. National governments and international organizations, on the other hand, are not bound by this restriction in any way.</p>
        <p>It may also be beneficial to include references from other disciplines in order to benefit from the vast number of methodologies, theories, and concepts that are available. In order to assess the spread of the disease, for example, demographers’ literature, as well as theories, would undoubtedly be relevant.</p>
      </sec>
      <sec>
        <title>Conclusions</title>
        <p>This is the first time that metadata have been used to analyze global research on coronaviruses. A total of 121,231 documents have been processed, resulting in a text-as-data data set. Using machine learning and NLP techniques, we have proposed a framework for public health policy makers. This framework and its metrics have the potential to assist national governments and international organizations, such as the WHO, in identifying critical global collaborations in the fight against COVID-19. It exemplifies the utility of emerging data science techniques and new modes of thought in public health.</p>
      </sec>
    </sec>
  </body>
  <back>
    <app-group/>
    <glossary>
      <title>Abbreviations</title>
      <def-list>
        <def-item>
          <term id="abb1">2019-nCoV</term>
          <def>
            <p>novel coronavirus</p>
          </def>
        </def-item>
        <def-item>
          <term id="abb2">AI</term>
          <def>
            <p>artificial intelligence</p>
          </def>
        </def-item>
        <def-item>
          <term id="abb3">BERT</term>
          <def>
            <p>bidirectional encoder representations from transformers</p>
          </def>
        </def-item>
        <def-item>
          <term id="abb4">BioBERT</term>
          <def>
            <p>bidirectional encoder representations from transformers for biomedical text mining</p>
          </def>
        </def-item>
        <def-item>
          <term id="abb5">CIRANO</term>
          <def>
            <p>Centre interuniversitaire de recherche en analyse des organisations</p>
          </def>
        </def-item>
        <def-item>
          <term id="abb6">CORD-19</term>
          <def>
            <p>COVID-19 Open Research Dataset</p>
          </def>
        </def-item>
        <def-item>
          <term id="abb7">EDI</term>
          <def>
            <p>equity, diversity, and inclusion</p>
          </def>
        </def-item>
        <def-item>
          <term id="abb8">EpiBibR</term>
          <def>
            <p>epidemiology-based bibliography for R</p>
          </def>
        </def-item>
        <def-item>
          <term id="abb9">MERS-CoV</term>
          <def>
            <p>Middle East respiratory syndrome coronavirus</p>
          </def>
        </def-item>
        <def-item>
          <term id="abb10">NLP</term>
          <def>
            <p>natural language processing</p>
          </def>
        </def-item>
        <def-item>
          <term id="abb11">ORCID</term>
          <def>
            <p>Open Researcher and Contributor ID</p>
          </def>
        </def-item>
        <def-item>
          <term id="abb12">SLR</term>
          <def>
            <p>systematic literature review</p>
          </def>
        </def-item>
        <def-item>
          <term id="abb13">SNA</term>
          <def>
            <p>social network analysis</p>
          </def>
        </def-item>
        <def-item>
          <term id="abb14">WHO</term>
          <def>
            <p>World Health Organization</p>
          </def>
        </def-item>
      </def-list>
    </glossary>
    <ack>
      <p>The author expresses his gratitude to CIRANO (Centre interuniversitaire de recherche en analyse des organisations; Montreal, Canada) and Marine Leroi (CIRANO) for their support.</p>
    </ack>
    <fn-group>
      <fn fn-type="conflict">
        <p>None declared.</p>
      </fn>
    </fn-group>
    <ref-list>
      <ref id="ref1">
        <label>1</label>
        <nlm-citation citation-type="journal">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Wu</surname>
              <given-names>T</given-names>
            </name>
            <name name-style="western">
              <surname>Hu</surname>
              <given-names>E</given-names>
            </name>
            <name name-style="western">
              <surname>Ge</surname>
              <given-names>X</given-names>
            </name>
            <name name-style="western">
              <surname>Yu</surname>
              <given-names>G</given-names>
            </name>
          </person-group>
          <article-title>nCov2019: An R package for studying the COVID-19 coronavirus pandemic</article-title>
          <source>PeerJ</source>
          <year>2021</year>
          <volume>9</volume>
          <fpage>e11421</fpage>
          <comment>
            <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://doi.org/10.7717/peerj.11421"/>
          </comment>
          <pub-id pub-id-type="doi">10.7717/peerj.11421</pub-id>
          <pub-id pub-id-type="medline">34178436</pub-id>
          <pub-id pub-id-type="pii">11421</pub-id>
          <pub-id pub-id-type="pmcid">PMC8199916</pub-id>
        </nlm-citation>
      </ref>
      <ref id="ref2">
        <label>2</label>
        <nlm-citation citation-type="journal">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Wang</surname>
              <given-names>C</given-names>
            </name>
            <name name-style="western">
              <surname>Horby</surname>
              <given-names>PW</given-names>
            </name>
            <name name-style="western">
              <surname>Hayden</surname>
              <given-names>FG</given-names>
            </name>
            <name name-style="western">
              <surname>Gao</surname>
              <given-names>GF</given-names>
            </name>
          </person-group>
          <article-title>A novel coronavirus outbreak of global health concern</article-title>
          <source>Lancet</source>
          <year>2020</year>
          <month>02</month>
          <day>15</day>
          <volume>395</volume>
          <issue>10223</issue>
          <fpage>470</fpage>
          <lpage>473</lpage>
          <comment>
            <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://linkinghub.elsevier.com/retrieve/pii/S0140-6736(20)30185-9"/>
          </comment>
          <pub-id pub-id-type="doi">10.1016/S0140-6736(20)30185-9</pub-id>
          <pub-id pub-id-type="medline">31986257</pub-id>
          <pub-id pub-id-type="pii">S0140-6736(20)30185-9</pub-id>
          <pub-id pub-id-type="pmcid">PMC7135038</pub-id>
        </nlm-citation>
      </ref>
      <ref id="ref3">
        <label>3</label>
        <nlm-citation citation-type="journal">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Chalmers</surname>
              <given-names>I</given-names>
            </name>
            <name name-style="western">
              <surname>Bracken</surname>
              <given-names>MB</given-names>
            </name>
            <name name-style="western">
              <surname>Djulbegovic</surname>
              <given-names>B</given-names>
            </name>
            <name name-style="western">
              <surname>Garattini</surname>
              <given-names>S</given-names>
            </name>
            <name name-style="western">
              <surname>Grant</surname>
              <given-names>J</given-names>
            </name>
            <name name-style="western">
              <surname>Gülmezoglu</surname>
              <given-names>AM</given-names>
            </name>
            <name name-style="western">
              <surname>Howells</surname>
              <given-names>DW</given-names>
            </name>
            <name name-style="western">
              <surname>Ioannidis</surname>
              <given-names>JPA</given-names>
            </name>
            <name name-style="western">
              <surname>Oliver</surname>
              <given-names>S</given-names>
            </name>
          </person-group>
          <article-title>How to increase value and reduce waste when research priorities are set</article-title>
          <source>Lancet</source>
          <year>2014</year>
          <month>01</month>
          <day>11</day>
          <volume>383</volume>
          <issue>9912</issue>
          <fpage>156</fpage>
          <lpage>165</lpage>
          <pub-id pub-id-type="doi">10.1016/S0140-6736(13)62229-1</pub-id>
          <pub-id pub-id-type="medline">24411644</pub-id>
          <pub-id pub-id-type="pii">S0140-6736(13)62229-1</pub-id>
        </nlm-citation>
      </ref>
      <ref id="ref4">
        <label>4</label>
        <nlm-citation citation-type="journal">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Macleod</surname>
              <given-names>MR</given-names>
            </name>
            <name name-style="western">
              <surname>Michie</surname>
              <given-names>S</given-names>
            </name>
            <name name-style="western">
              <surname>Roberts</surname>
              <given-names>I</given-names>
            </name>
            <name name-style="western">
              <surname>Dirnagl</surname>
              <given-names>U</given-names>
            </name>
            <name name-style="western">
              <surname>Chalmers</surname>
              <given-names>I</given-names>
            </name>
            <name name-style="western">
              <surname>Ioannidis</surname>
              <given-names>JPA</given-names>
            </name>
            <name name-style="western">
              <surname>Al-Shahi Salman</surname>
              <given-names>R</given-names>
            </name>
            <name name-style="western">
              <surname>Chan</surname>
              <given-names>A</given-names>
            </name>
            <name name-style="western">
              <surname>Glasziou</surname>
              <given-names>P</given-names>
            </name>
          </person-group>
          <article-title>Biomedical research: Increasing value, reducing waste</article-title>
          <source>Lancet</source>
          <year>2014</year>
          <month>01</month>
          <day>11</day>
          <volume>383</volume>
          <issue>9912</issue>
          <fpage>101</fpage>
          <lpage>104</lpage>
          <pub-id pub-id-type="doi">10.1016/S0140-6736(13)62329-6</pub-id>
          <pub-id pub-id-type="medline">24411643</pub-id>
          <pub-id pub-id-type="pii">S0140-6736(13)62329-6</pub-id>
        </nlm-citation>
      </ref>
      <ref id="ref5">
        <label>5</label>
        <nlm-citation citation-type="book">
          <person-group person-group-type="editor">
            <name name-style="western">
              <surname>Carrington</surname>
              <given-names>PJ</given-names>
            </name>
            <name name-style="western">
              <surname>Scott</surname>
              <given-names>J</given-names>
            </name>
            <name name-style="western">
              <surname>Wasserman</surname>
              <given-names>S</given-names>
            </name>
          </person-group>
          <source>Models and Methods in Social Network Analysis</source>
          <year>2005</year>
          <publisher-loc>Cambridge, UK</publisher-loc>
          <publisher-name>Cambridge University Press</publisher-name>
        </nlm-citation>
      </ref>
      <ref id="ref6">
        <label>6</label>
        <nlm-citation citation-type="journal">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Myers</surname>
              <given-names>KR</given-names>
            </name>
            <name name-style="western">
              <surname>Tham</surname>
              <given-names>WY</given-names>
            </name>
            <name name-style="western">
              <surname>Yin</surname>
              <given-names>Y</given-names>
            </name>
            <name name-style="western">
              <surname>Cohodes</surname>
              <given-names>N</given-names>
            </name>
            <name name-style="western">
              <surname>Thursby</surname>
              <given-names>JG</given-names>
            </name>
            <name name-style="western">
              <surname>Thursby</surname>
              <given-names>MC</given-names>
            </name>
            <name name-style="western">
              <surname>Schiffer</surname>
              <given-names>P</given-names>
            </name>
            <name name-style="western">
              <surname>Walsh</surname>
              <given-names>JT</given-names>
            </name>
            <name name-style="western">
              <surname>Lakhani</surname>
              <given-names>KR</given-names>
            </name>
            <name name-style="western">
              <surname>Wang</surname>
              <given-names>D</given-names>
            </name>
          </person-group>
          <article-title>Unequal effects of the COVID-19 pandemic on scientists</article-title>
          <source>Nat Hum Behav</source>
          <year>2020</year>
          <month>09</month>
          <volume>4</volume>
          <issue>9</issue>
          <fpage>880</fpage>
          <lpage>883</lpage>
          <pub-id pub-id-type="doi">10.1038/s41562-020-0921-y</pub-id>
          <pub-id pub-id-type="medline">32669671</pub-id>
          <pub-id pub-id-type="pii">10.1038/s41562-020-0921-y</pub-id>
        </nlm-citation>
      </ref>
      <ref id="ref7">
        <label>7</label>
        <nlm-citation citation-type="web">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Warin</surname>
              <given-names>T</given-names>
            </name>
          </person-group>
          <article-title>EpiBibR</article-title>
          <source>GitHub</source>
          <year>2020</year>
          <access-date>2021-06-21</access-date>
          <comment>
            <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://github.com/warint/EpiBibR">https://github.com/warint/EpiBibR</ext-link>
          </comment>
        </nlm-citation>
      </ref>
      <ref id="ref8">
        <label>8</label>
        <nlm-citation citation-type="journal">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Warin</surname>
              <given-names>T</given-names>
            </name>
          </person-group>
          <article-title>Global Research on Coronaviruses: An R Package</article-title>
          <source>J Med Internet Res</source>
          <year>2020</year>
          <month>08</month>
          <day>11</day>
          <volume>22</volume>
          <issue>8</issue>
          <fpage>e19615</fpage>
          <comment>
            <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://www.jmir.org/2020/8/e19615/"/>
          </comment>
          <pub-id pub-id-type="doi">10.2196/19615</pub-id>
          <pub-id pub-id-type="medline">32730218</pub-id>
          <pub-id pub-id-type="pii">v22i8e19615</pub-id>
          <pub-id pub-id-type="pmcid">PMC7423387</pub-id>
        </nlm-citation>
      </ref>
      <ref id="ref9">
        <label>9</label>
        <nlm-citation citation-type="journal">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Wang</surname>
              <given-names>L</given-names>
            </name>
            <name name-style="western">
              <surname>Lo</surname>
              <given-names>K</given-names>
            </name>
            <name name-style="western">
              <surname>Chandrasekhar</surname>
              <given-names>Y</given-names>
            </name>
            <name name-style="western">
              <surname>Reas</surname>
              <given-names>R</given-names>
            </name>
            <name name-style="western">
              <surname>Yang</surname>
              <given-names>J</given-names>
            </name>
            <name name-style="western">
              <surname>Burdick</surname>
              <given-names>D</given-names>
            </name>
            <name name-style="western">
              <surname>Eide</surname>
              <given-names>D</given-names>
            </name>
            <name name-style="western">
              <surname>Funk</surname>
              <given-names>K</given-names>
            </name>
            <name name-style="western">
              <surname>Katsis</surname>
              <given-names>Y</given-names>
            </name>
            <name name-style="western">
              <surname>Kinney</surname>
              <given-names>R</given-names>
            </name>
            <name name-style="western">
              <surname>Li</surname>
              <given-names>Y</given-names>
            </name>
            <name name-style="western">
              <surname>Liu</surname>
              <given-names>Z</given-names>
            </name>
            <name name-style="western">
              <surname>Merrill</surname>
              <given-names>W</given-names>
            </name>
            <name name-style="western">
              <surname>Mooney</surname>
              <given-names>P</given-names>
            </name>
            <name name-style="western">
              <surname>Murdick</surname>
              <given-names>D</given-names>
            </name>
            <name name-style="western">
              <surname>Rishi</surname>
              <given-names>D</given-names>
            </name>
            <name name-style="western">
              <surname>Sheehan</surname>
              <given-names>J</given-names>
            </name>
            <name name-style="western">
              <surname>Shen</surname>
              <given-names>Z</given-names>
            </name>
            <name name-style="western">
              <surname>Stilson</surname>
              <given-names>B</given-names>
            </name>
            <name name-style="western">
              <surname>Wade</surname>
              <given-names>A</given-names>
            </name>
            <name name-style="western">
              <surname>Wang</surname>
              <given-names>K</given-names>
            </name>
            <name name-style="western">
              <surname>Wang</surname>
              <given-names>NXR</given-names>
            </name>
            <name name-style="western">
              <surname>Wilhelm</surname>
              <given-names>C</given-names>
            </name>
            <name name-style="western">
              <surname>Xie</surname>
              <given-names>B</given-names>
            </name>
            <name name-style="western">
              <surname>Raymond</surname>
              <given-names>D</given-names>
            </name>
            <name name-style="western">
              <surname>Weld</surname>
              <given-names>DS</given-names>
            </name>
            <name name-style="western">
              <surname>Etzioni</surname>
              <given-names>O</given-names>
            </name>
            <name name-style="western">
              <surname>Kohlmeier</surname>
              <given-names>S</given-names>
            </name>
          </person-group>
          <article-title>CORD-19: The COVID-19 Open Research Dataset</article-title>
          <source>ArXiv.</source>
          <comment>Preprint posted online on July 10, 2020
            <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://arxiv.org/pdf/2004.10706.pdf"/>
          </comment>
        </nlm-citation>
      </ref>
      <ref id="ref10">
        <label>10</label>
        <nlm-citation citation-type="journal">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Aria</surname>
              <given-names>M</given-names>
            </name>
            <name name-style="western">
              <surname>Cuccurullo</surname>
              <given-names>C</given-names>
            </name>
          </person-group>
          <article-title>bibliometrix: An R-tool for comprehensive science mapping analysis</article-title>
          <source>J Informetr</source>
          <year>2017</year>
          <month>11</month>
          <volume>11</volume>
          <issue>4</issue>
          <fpage>959</fpage>
          <lpage>975</lpage>
          <pub-id pub-id-type="doi">10.1016/j.joi.2017.08.007</pub-id>
        </nlm-citation>
      </ref>
      <ref id="ref11">
        <label>11</label>
        <nlm-citation citation-type="journal">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Moher</surname>
              <given-names>D</given-names>
            </name>
            <name name-style="western">
              <surname>Stewart</surname>
              <given-names>L</given-names>
            </name>
            <name name-style="western">
              <surname>Shekelle</surname>
              <given-names>P</given-names>
            </name>
          </person-group>
          <article-title>Establishing a new journal for systematic review products</article-title>
          <source>Syst Rev</source>
          <year>2012</year>
          <month>02</month>
          <day>09</day>
          <volume>1</volume>
          <fpage>1</fpage>
          <comment>
            <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://systematicreviewsjournal.biomedcentral.com/articles/10.1186/2046-4053-1-1"/>
          </comment>
          <pub-id pub-id-type="doi">10.1186/2046-4053-1-1</pub-id>
          <pub-id pub-id-type="medline">22587946</pub-id>
          <pub-id pub-id-type="pii">2046-4053-1-1</pub-id>
          <pub-id pub-id-type="pmcid">PMC3348672</pub-id>
        </nlm-citation>
      </ref>
      <ref id="ref12">
        <label>12</label>
        <nlm-citation citation-type="journal">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Campbell</surname>
              <given-names>M</given-names>
            </name>
            <name name-style="western">
              <surname>McKenzie</surname>
              <given-names>JE</given-names>
            </name>
            <name name-style="western">
              <surname>Sowden</surname>
              <given-names>A</given-names>
            </name>
            <name name-style="western">
              <surname>Katikireddi</surname>
              <given-names>SV</given-names>
            </name>
            <name name-style="western">
              <surname>Brennan</surname>
              <given-names>SE</given-names>
            </name>
            <name name-style="western">
              <surname>Ellis</surname>
              <given-names>S</given-names>
            </name>
            <name name-style="western">
              <surname>Hartmann-Boyce</surname>
              <given-names>J</given-names>
            </name>
            <name name-style="western">
              <surname>Ryan</surname>
              <given-names>R</given-names>
            </name>
            <name name-style="western">
              <surname>Shepperd</surname>
              <given-names>S</given-names>
            </name>
            <name name-style="western">
              <surname>Thomas</surname>
              <given-names>J</given-names>
            </name>
            <name name-style="western">
              <surname>Welch</surname>
              <given-names>V</given-names>
            </name>
            <name name-style="western">
              <surname>Thomson</surname>
              <given-names>H</given-names>
            </name>
          </person-group>
          <article-title>Synthesis without meta-analysis (SWiM) in systematic reviews: Reporting guideline</article-title>
          <source>BMJ</source>
          <year>2020</year>
          <month>01</month>
          <day>16</day>
          <volume>368</volume>
          <fpage>l6890</fpage>
          <comment>
            <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="http://www.bmj.com/lookup/pmidlookup?view=long&#38;pmid=31948937"/>
          </comment>
          <pub-id pub-id-type="doi">10.1136/bmj.l6890</pub-id>
          <pub-id pub-id-type="medline">31948937</pub-id>
          <pub-id pub-id-type="pmcid">PMC7190266</pub-id>
        </nlm-citation>
      </ref>
      <ref id="ref13">
        <label>13</label>
        <nlm-citation citation-type="journal">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Mengist</surname>
              <given-names>W</given-names>
            </name>
            <name name-style="western">
              <surname>Soromessa</surname>
              <given-names>T</given-names>
            </name>
            <name name-style="western">
              <surname>Legese</surname>
              <given-names>G</given-names>
            </name>
          </person-group>
          <article-title>Method for conducting systematic literature review and meta-analysis for environmental science research</article-title>
          <source>MethodsX</source>
          <year>2020</year>
          <volume>7</volume>
          <fpage>100777</fpage>
          <comment>
            <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://linkinghub.elsevier.com/retrieve/pii/S2215-0161(19)30353-X"/>
          </comment>
          <pub-id pub-id-type="doi">10.1016/j.mex.2019.100777</pub-id>
          <pub-id pub-id-type="medline">31993339</pub-id>
          <pub-id pub-id-type="pii">S2215-0161(19)30353-X</pub-id>
          <pub-id pub-id-type="pmcid">PMC6974768</pub-id>
        </nlm-citation>
      </ref>
      <ref id="ref14">
        <label>14</label>
        <nlm-citation citation-type="journal">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Fernández del Amo</surname>
              <given-names>I</given-names>
            </name>
            <name name-style="western">
              <surname>Erkoyuncu</surname>
              <given-names>JA</given-names>
            </name>
            <name name-style="western">
              <surname>Roy</surname>
              <given-names>R</given-names>
            </name>
            <name name-style="western">
              <surname>Palmarini</surname>
              <given-names>R</given-names>
            </name>
            <name name-style="western">
              <surname>Onoufriou</surname>
              <given-names>D</given-names>
            </name>
          </person-group>
          <article-title>A systematic review of augmented reality content-related techniques for knowledge transfer in maintenance applications</article-title>
          <source>Comput Ind</source>
          <year>2018</year>
          <month>12</month>
          <volume>103</volume>
          <fpage>47</fpage>
          <lpage>71</lpage>
          <pub-id pub-id-type="doi">10.1016/j.compind.2018.08.007</pub-id>
        </nlm-citation>
      </ref>
      <ref id="ref15">
        <label>15</label>
        <nlm-citation citation-type="book">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Lasserson</surname>
              <given-names>TJ</given-names>
            </name>
            <name name-style="western">
              <surname>Thomas</surname>
              <given-names>J</given-names>
            </name>
            <name name-style="western">
              <surname>Higgins</surname>
              <given-names>JPT</given-names>
            </name>
          </person-group>
          <person-group person-group-type="editor">
            <name name-style="western">
              <surname>Higgins</surname>
              <given-names>JPT</given-names>
            </name>
            <name name-style="western">
              <surname>Thomas</surname>
              <given-names>J</given-names>
            </name>
          </person-group>
          <article-title>Starting a review</article-title>
          <source>Cochrane Handbook for Systematic Reviews of Interventions. 2nd edition</source>
          <year>2019</year>
          <month>09</month>
          <day>20</day>
          <publisher-loc>Hoboken, NJ</publisher-loc>
          <publisher-name>John Wiley &#38; Sons</publisher-name>
          <fpage>3</fpage>
          <lpage>12</lpage>
        </nlm-citation>
      </ref>
      <ref id="ref16">
        <label>16</label>
        <nlm-citation citation-type="journal">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>O'Mara-Eves</surname>
              <given-names>A</given-names>
            </name>
            <name name-style="western">
              <surname>Thomas</surname>
              <given-names>J</given-names>
            </name>
            <name name-style="western">
              <surname>McNaught</surname>
              <given-names>J</given-names>
            </name>
            <name name-style="western">
              <surname>Miwa</surname>
              <given-names>M</given-names>
            </name>
            <name name-style="western">
              <surname>Ananiadou</surname>
              <given-names>S</given-names>
            </name>
          </person-group>
          <article-title>Using text mining for study identification in systematic reviews: A systematic review of current approaches</article-title>
          <source>Syst Rev</source>
          <year>2015</year>
          <month>01</month>
          <day>14</day>
          <volume>4</volume>
          <fpage>5</fpage>
          <comment>
            <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://systematicreviewsjournal.biomedcentral.com/articles/10.1186/2046-4053-4-5"/>
          </comment>
          <pub-id pub-id-type="doi">10.1186/2046-4053-4-5</pub-id>
          <pub-id pub-id-type="medline">25588314</pub-id>
          <pub-id pub-id-type="pii">2046-4053-4-5</pub-id>
          <pub-id pub-id-type="pmcid">PMC4320539</pub-id>
        </nlm-citation>
      </ref>
      <ref id="ref17">
        <label>17</label>
        <nlm-citation citation-type="journal">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Marshall</surname>
              <given-names>IJ</given-names>
            </name>
            <name name-style="western">
              <surname>Wallace</surname>
              <given-names>BC</given-names>
            </name>
          </person-group>
          <article-title>Toward systematic review automation: A practical guide to using machine learning tools in research synthesis</article-title>
          <source>Syst Rev</source>
          <year>2019</year>
          <month>07</month>
          <day>11</day>
          <volume>8</volume>
          <issue>1</issue>
          <fpage>163</fpage>
          <comment>
            <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://systematicreviewsjournal.biomedcentral.com/articles/10.1186/s13643-019-1074-9"/>
          </comment>
          <pub-id pub-id-type="doi">10.1186/s13643-019-1074-9</pub-id>
          <pub-id pub-id-type="medline">31296265</pub-id>
          <pub-id pub-id-type="pii">10.1186/s13643-019-1074-9</pub-id>
          <pub-id pub-id-type="pmcid">PMC6621996</pub-id>
        </nlm-citation>
      </ref>
      <ref id="ref18">
        <label>18</label>
        <nlm-citation citation-type="journal">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Marshall</surname>
              <given-names>IJ</given-names>
            </name>
            <name name-style="western">
              <surname>Noel-Storr</surname>
              <given-names>A</given-names>
            </name>
            <name name-style="western">
              <surname>Kuiper</surname>
              <given-names>J</given-names>
            </name>
            <name name-style="western">
              <surname>Thomas</surname>
              <given-names>J</given-names>
            </name>
            <name name-style="western">
              <surname>Wallace</surname>
              <given-names>BC</given-names>
            </name>
          </person-group>
          <article-title>Machine learning for identifying randomized controlled trials: An evaluation and practitioner's guide</article-title>
          <source>Res Synth Methods</source>
          <year>2018</year>
          <month>12</month>
          <volume>9</volume>
          <issue>4</issue>
          <fpage>602</fpage>
          <lpage>614</lpage>
          <comment>
            <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="http://europepmc.org/abstract/MED/29314757"/>
          </comment>
          <pub-id pub-id-type="doi">10.1002/jrsm.1287</pub-id>
          <pub-id pub-id-type="medline">29314757</pub-id>
          <pub-id pub-id-type="pmcid">PMC6030513</pub-id>
        </nlm-citation>
      </ref>
      <ref id="ref19">
        <label>19</label>
        <nlm-citation citation-type="confproc">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Neumann</surname>
              <given-names>M</given-names>
            </name>
            <name name-style="western">
              <surname>King</surname>
              <given-names>D</given-names>
            </name>
            <name name-style="western">
              <surname>Beltagy</surname>
              <given-names>I</given-names>
            </name>
            <name name-style="western">
              <surname>Ammar</surname>
              <given-names>W</given-names>
            </name>
          </person-group>
          <article-title>ScispaCy: Fast and robust models for biomedical natural language processing</article-title>
          <source>Proceedings of the 18th BioNLP Workshop and Shared Task</source>
          <year>2019</year>
          <conf-name>18th BioNLP Workshop and Shared Task</conf-name>
          <conf-date>August 1, 2019</conf-date>
          <conf-loc>Florence, Italy</conf-loc>
          <publisher-loc>Stroudsburg, PA</publisher-loc>
          <publisher-name>Association for Computational Linguistics</publisher-name>
          <fpage>319</fpage>
          <lpage>327</lpage>
          <comment>
            <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://aclanthology.org/W19-5034.pdf"/>
          </comment>
          <pub-id pub-id-type="doi">10.18653/v1/w19-5034</pub-id>
        </nlm-citation>
      </ref>
      <ref id="ref20">
        <label>20</label>
        <nlm-citation citation-type="journal">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Lee</surname>
              <given-names>J</given-names>
            </name>
            <name name-style="western">
              <surname>Yoon</surname>
              <given-names>W</given-names>
            </name>
            <name name-style="western">
              <surname>Kim</surname>
              <given-names>S</given-names>
            </name>
            <name name-style="western">
              <surname>Kim</surname>
              <given-names>D</given-names>
            </name>
            <name name-style="western">
              <surname>Kim</surname>
              <given-names>S</given-names>
            </name>
            <name name-style="western">
              <surname>So</surname>
              <given-names>C</given-names>
            </name>
            <name name-style="western">
              <surname>Kang</surname>
              <given-names>J</given-names>
            </name>
          </person-group>
          <article-title>BioBERT: A pre-trained biomedical language representation model for biomedical text mining</article-title>
          <source>Bioinformatics</source>
          <year>2020</year>
          <month>02</month>
          <day>15</day>
          <volume>36</volume>
          <issue>4</issue>
          <fpage>1234</fpage>
          <lpage>1240</lpage>
          <comment>
            <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="http://europepmc.org/abstract/MED/31501885"/>
          </comment>
          <pub-id pub-id-type="doi">10.1093/bioinformatics/btz682</pub-id>
          <pub-id pub-id-type="medline">31501885</pub-id>
          <pub-id pub-id-type="pii">5566506</pub-id>
          <pub-id pub-id-type="pmcid">PMC7703786</pub-id>
        </nlm-citation>
      </ref>
      <ref id="ref21">
        <label>21</label>
        <nlm-citation citation-type="confproc">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Beltagy</surname>
              <given-names>I</given-names>
            </name>
            <name name-style="western">
              <surname>Lo</surname>
              <given-names>K</given-names>
            </name>
            <name name-style="western">
              <surname>Cohan</surname>
              <given-names>A</given-names>
            </name>
          </person-group>
          <article-title>SciBERT: A pretrained language model for scientific text</article-title>
          <source>Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)</source>
          <year>2019</year>
          <conf-name>2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)</conf-name>
          <conf-date>November 3-7, 2019</conf-date>
          <conf-loc>Hong Kong, China</conf-loc>
          <publisher-loc>Stroudsburg, PA</publisher-loc>
          <publisher-name>Association for Computational Linguistics</publisher-name>
          <fpage>3615</fpage>
          <lpage>3620</lpage>
          <comment>
            <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://aclanthology.org/D19-1371.pdf"/>
          </comment>
          <pub-id pub-id-type="doi">10.18653/v1/d19-1371</pub-id>
        </nlm-citation>
      </ref>
      <ref id="ref22">
        <label>22</label>
        <nlm-citation citation-type="journal">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Liang</surname>
              <given-names>Y</given-names>
            </name>
            <name name-style="western">
              <surname>Xie</surname>
              <given-names>P</given-names>
            </name>
          </person-group>
          <article-title>Identifying radiological findings related to COVID-19 from medical literature</article-title>
          <source>ArXiv.</source>
          <comment>Preprint posted online on April 4, 2020
            <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://arxiv.org/pdf/2004.01862.pdf"/>
          </comment>
        </nlm-citation>
      </ref>
      <ref id="ref23">
        <label>23</label>
        <nlm-citation citation-type="confproc">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Devlin</surname>
              <given-names>J</given-names>
            </name>
            <name name-style="western">
              <surname>Chang</surname>
              <given-names>M</given-names>
            </name>
            <name name-style="western">
              <surname>Lee</surname>
              <given-names>K</given-names>
            </name>
            <name name-style="western">
              <surname>Toutanova</surname>
              <given-names>K</given-names>
            </name>
          </person-group>
          <article-title>BERT: Pre-training of deep bidirectional transformers for language understanding</article-title>
          <source>Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers)</source>
          <year>2019</year>
          <conf-name>2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies</conf-name>
          <conf-date>June 2-7, 2019</conf-date>
          <conf-loc>Minneapolis, MN</conf-loc>
          <publisher-loc>Stroudsburg, PA</publisher-loc>
          <publisher-name>Association for Computational Linguistics</publisher-name>
          <fpage>4171</fpage>
          <lpage>4186</lpage>
          <comment>
            <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://aclanthology.org/N19-1423.pdf"/>
          </comment>
          <pub-id pub-id-type="doi">10.18653/v1/n19-1423</pub-id>
        </nlm-citation>
      </ref>
      <ref id="ref24">
        <label>24</label>
        <nlm-citation citation-type="confproc">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Espinosa-Anke</surname>
              <given-names>L</given-names>
            </name>
            <name name-style="western">
              <surname>Schockaert</surname>
              <given-names>S</given-names>
            </name>
          </person-group>
          <article-title>SeVeN: Augmenting word embeddings with unsupervised relation vectors</article-title>
          <source>Proceedings of the 27th International Conference on Computational Linguistics</source>
          <year>2018</year>
          <conf-name>27th International Conference on Computational Linguistics</conf-name>
          <conf-date>August 20-26, 2018</conf-date>
          <conf-loc>Sante Fe, NM</conf-loc>
          <publisher-loc>Stroudsburg, PA</publisher-loc>
          <publisher-name>Association for Computational Linguistics</publisher-name>
          <fpage>2653</fpage>
          <lpage>2665</lpage>
          <comment>
            <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://aclanthology.org/C18-1225.pdf"/>
          </comment>
        </nlm-citation>
      </ref>
      <ref id="ref25">
        <label>25</label>
        <nlm-citation citation-type="journal">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Ahamed</surname>
              <given-names>S</given-names>
            </name>
            <name name-style="western">
              <surname>Samad</surname>
              <given-names>M</given-names>
            </name>
          </person-group>
          <article-title>Information mining for COVID-19 research from a large volume of scientific literature</article-title>
          <source>ArXiv.</source>
          <comment>Preprint posted online on April 5, 2020
            <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://arxiv.org/pdf/2004.02085.pdf"/>
          </comment>
        </nlm-citation>
      </ref>
      <ref id="ref26">
        <label>26</label>
        <nlm-citation citation-type="journal">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Koseoglu</surname>
              <given-names>MA</given-names>
            </name>
          </person-group>
          <article-title>Mapping the institutional collaboration network of strategic management research: 1980–2014</article-title>
          <source>Scientometrics</source>
          <year>2016</year>
          <month>2</month>
          <day>22</day>
          <volume>109</volume>
          <issue>1</issue>
          <fpage>203</fpage>
          <lpage>226</lpage>
          <pub-id pub-id-type="doi">10.1007/s11192-016-1894-5</pub-id>
        </nlm-citation>
      </ref>
      <ref id="ref27">
        <label>27</label>
        <nlm-citation citation-type="journal">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Grinnell</surname>
              <given-names>M</given-names>
            </name>
            <name name-style="western">
              <surname>Higgins</surname>
              <given-names>S</given-names>
            </name>
            <name name-style="western">
              <surname>Yost</surname>
              <given-names>K</given-names>
            </name>
            <name name-style="western">
              <surname>Ochuba</surname>
              <given-names>O</given-names>
            </name>
            <name name-style="western">
              <surname>Lobl</surname>
              <given-names>M</given-names>
            </name>
            <name name-style="western">
              <surname>Grimes</surname>
              <given-names>P</given-names>
            </name>
            <name name-style="western">
              <surname>Wysong</surname>
              <given-names>A</given-names>
            </name>
          </person-group>
          <article-title>The proportion of male and female editors in women's health journals: A critical analysis and review of the sex gap</article-title>
          <source>Int J Womens Dermatol</source>
          <year>2020</year>
          <month>01</month>
          <volume>6</volume>
          <issue>1</issue>
          <fpage>7</fpage>
          <lpage>12</lpage>
          <comment>
            <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://linkinghub.elsevier.com/retrieve/pii/S2352-6475(19)30123-6"/>
          </comment>
          <pub-id pub-id-type="doi">10.1016/j.ijwd.2019.11.005</pub-id>
          <pub-id pub-id-type="medline">32025554</pub-id>
          <pub-id pub-id-type="pii">S2352-6475(19)30123-6</pub-id>
          <pub-id pub-id-type="pmcid">PMC6997826</pub-id>
        </nlm-citation>
      </ref>
      <ref id="ref28">
        <label>28</label>
        <nlm-citation citation-type="journal">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Thomas</surname>
              <given-names>EG</given-names>
            </name>
            <name name-style="western">
              <surname>Jayabalasingham</surname>
              <given-names>B</given-names>
            </name>
            <name name-style="western">
              <surname>Collins</surname>
              <given-names>T</given-names>
            </name>
            <name name-style="western">
              <surname>Geertzen</surname>
              <given-names>J</given-names>
            </name>
            <name name-style="western">
              <surname>Bui</surname>
              <given-names>C</given-names>
            </name>
            <name name-style="western">
              <surname>Dominici</surname>
              <given-names>F</given-names>
            </name>
          </person-group>
          <article-title>Gender disparities in invited commentary authorship in 2459 medical journals</article-title>
          <source>JAMA Netw Open</source>
          <year>2019</year>
          <month>10</month>
          <day>02</day>
          <volume>2</volume>
          <issue>10</issue>
          <fpage>e1913682</fpage>
          <comment>
            <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://jamanetwork.com/journals/jamanetworkopen/fullarticle/10.1001/jamanetworkopen.2019.13682"/>
          </comment>
          <pub-id pub-id-type="doi">10.1001/jamanetworkopen.2019.13682</pub-id>
          <pub-id pub-id-type="medline">31642926</pub-id>
          <pub-id pub-id-type="pii">2753395</pub-id>
          <pub-id pub-id-type="pmcid">PMC6820037</pub-id>
        </nlm-citation>
      </ref>
      <ref id="ref29">
        <label>29</label>
        <nlm-citation citation-type="journal">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Filardo</surname>
              <given-names>G</given-names>
            </name>
            <name name-style="western">
              <surname>da Graca</surname>
              <given-names>B</given-names>
            </name>
            <name name-style="western">
              <surname>Sass</surname>
              <given-names>DM</given-names>
            </name>
            <name name-style="western">
              <surname>Pollock</surname>
              <given-names>BD</given-names>
            </name>
            <name name-style="western">
              <surname>Smith</surname>
              <given-names>EB</given-names>
            </name>
            <name name-style="western">
              <surname>Martinez</surname>
              <given-names>MA</given-names>
            </name>
          </person-group>
          <article-title>Trends and comparison of female first authorship in high impact medical journals: Observational study (1994-2014)</article-title>
          <source>BMJ</source>
          <year>2016</year>
          <month>03</month>
          <day>02</day>
          <volume>352</volume>
          <fpage>i847</fpage>
          <comment>
            <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="http://www.bmj.com/lookup/pmidlookup?view=long&#38;pmid=26935100"/>
          </comment>
          <pub-id pub-id-type="doi">10.1136/bmj.i847</pub-id>
          <pub-id pub-id-type="medline">26935100</pub-id>
          <pub-id pub-id-type="pmcid">PMC4775869</pub-id>
        </nlm-citation>
      </ref>
      <ref id="ref30">
        <label>30</label>
        <nlm-citation citation-type="journal">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>McClelland</surname>
              <given-names>S</given-names>
            </name>
            <name name-style="western">
              <surname>Mitin</surname>
              <given-names>T</given-names>
            </name>
            <name name-style="western">
              <surname>Jagsi</surname>
              <given-names>R</given-names>
            </name>
            <name name-style="western">
              <surname>Thomas</surname>
              <given-names>CR</given-names>
            </name>
            <name name-style="western">
              <surname>Jaboin</surname>
              <given-names>JJ</given-names>
            </name>
          </person-group>
          <article-title>Importance of first and second authorship in assessing citation-based scholarly activity of US radiation oncology residents and subsequent choice of academic versus private practice career</article-title>
          <source>J Am Coll Radiol</source>
          <year>2018</year>
          <month>09</month>
          <volume>15</volume>
          <issue>9</issue>
          <fpage>1322</fpage>
          <lpage>1325</lpage>
          <pub-id pub-id-type="doi">10.1016/j.jacr.2018.05.015</pub-id>
          <pub-id pub-id-type="medline">29933976</pub-id>
          <pub-id pub-id-type="pii">S1546-1440(18)30607-0</pub-id>
        </nlm-citation>
      </ref>
      <ref id="ref31">
        <label>31</label>
        <nlm-citation citation-type="journal">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Qureshi</surname>
              <given-names>R</given-names>
            </name>
            <name name-style="western">
              <surname>Lê</surname>
              <given-names>J</given-names>
            </name>
            <name name-style="western">
              <surname>Li</surname>
              <given-names>T</given-names>
            </name>
            <name name-style="western">
              <surname>Ibrahim</surname>
              <given-names>M</given-names>
            </name>
            <name name-style="western">
              <surname>Dickersin</surname>
              <given-names>K</given-names>
            </name>
          </person-group>
          <article-title>Gender and editorial authorship in high-impact epidemiology journals</article-title>
          <source>Am J Epidemiol</source>
          <year>2019</year>
          <month>12</month>
          <day>31</day>
          <volume>188</volume>
          <issue>12</issue>
          <fpage>2140</fpage>
          <lpage>2145</lpage>
          <comment>
            <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="http://europepmc.org/abstract/MED/30995311"/>
          </comment>
          <pub-id pub-id-type="doi">10.1093/aje/kwz094</pub-id>
          <pub-id pub-id-type="medline">30995311</pub-id>
          <pub-id pub-id-type="pii">5474949</pub-id>
          <pub-id pub-id-type="pmcid">PMC7326372</pub-id>
        </nlm-citation>
      </ref>
      <ref id="ref32">
        <label>32</label>
        <nlm-citation citation-type="journal">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Erren</surname>
              <given-names>TC</given-names>
            </name>
            <name name-style="western">
              <surname>Groß</surname>
              <given-names>JV</given-names>
            </name>
            <name name-style="western">
              <surname>Shaw</surname>
              <given-names>DM</given-names>
            </name>
            <name name-style="western">
              <surname>Selle</surname>
              <given-names>B</given-names>
            </name>
          </person-group>
          <article-title>Representation of women as authors, reviewers, editors in chief, and editorial board members at 6 general medical journals in 2010 and 2011</article-title>
          <source>JAMA Intern Med</source>
          <year>2014</year>
          <month>04</month>
          <volume>174</volume>
          <issue>4</issue>
          <fpage>633</fpage>
          <lpage>635</lpage>
          <pub-id pub-id-type="doi">10.1001/jamainternmed.2013.14760</pub-id>
          <pub-id pub-id-type="medline">24566922</pub-id>
          <pub-id pub-id-type="pii">1832194</pub-id>
        </nlm-citation>
      </ref>
      <ref id="ref33">
        <label>33</label>
        <nlm-citation citation-type="journal">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Schisterman</surname>
              <given-names>EF</given-names>
            </name>
            <name name-style="western">
              <surname>Swanson</surname>
              <given-names>CW</given-names>
            </name>
            <name name-style="western">
              <surname>Lu</surname>
              <given-names>Y</given-names>
            </name>
            <name name-style="western">
              <surname>Mumford</surname>
              <given-names>SL</given-names>
            </name>
          </person-group>
          <article-title>The changing face of epidemiology</article-title>
          <source>Epidemiology</source>
          <year>2017</year>
          <volume>28</volume>
          <issue>2</issue>
          <fpage>159</fpage>
          <lpage>168</lpage>
          <pub-id pub-id-type="doi">10.1097/ede.0000000000000593</pub-id>
        </nlm-citation>
      </ref>
      <ref id="ref34">
        <label>34</label>
        <nlm-citation citation-type="journal">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Lotka</surname>
              <given-names>AJ</given-names>
            </name>
          </person-group>
          <article-title>The frequency distribution of scientific productivity</article-title>
          <source>J Wash Acad Sci</source>
          <year>1926</year>
          <month>06</month>
          <day>17</day>
          <volume>16</volume>
          <issue>12</issue>
          <fpage>317</fpage>
          <lpage>323</lpage>
        </nlm-citation>
      </ref>
      <ref id="ref35">
        <label>35</label>
        <nlm-citation citation-type="journal">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Qureshi</surname>
              <given-names>R</given-names>
            </name>
            <name name-style="western">
              <surname>Han</surname>
              <given-names>G</given-names>
            </name>
            <name name-style="western">
              <surname>Fapohunda</surname>
              <given-names>K</given-names>
            </name>
            <name name-style="western">
              <surname>Abariga</surname>
              <given-names>S</given-names>
            </name>
            <name name-style="western">
              <surname>Wilson</surname>
              <given-names>R</given-names>
            </name>
            <name name-style="western">
              <surname>Li</surname>
              <given-names>T</given-names>
            </name>
          </person-group>
          <article-title>Authorship diversity among systematic reviews in eyes and vision</article-title>
          <source>Syst Rev</source>
          <year>2020</year>
          <month>08</month>
          <day>27</day>
          <volume>9</volume>
          <issue>1</issue>
          <fpage>192</fpage>
          <comment>
            <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://systematicreviewsjournal.biomedcentral.com/articles/10.1186/s13643-020-01451-1"/>
          </comment>
          <pub-id pub-id-type="doi">10.1186/s13643-020-01451-1</pub-id>
          <pub-id pub-id-type="medline">32854764</pub-id>
          <pub-id pub-id-type="pii">10.1186/s13643-020-01451-1</pub-id>
          <pub-id pub-id-type="pmcid">PMC7450569</pub-id>
        </nlm-citation>
      </ref>
      <ref id="ref36">
        <label>36</label>
        <nlm-citation citation-type="journal">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Tangcharoensathien</surname>
              <given-names>V</given-names>
            </name>
            <name name-style="western">
              <surname>Calleja</surname>
              <given-names>N</given-names>
            </name>
            <name name-style="western">
              <surname>Nguyen</surname>
              <given-names>T</given-names>
            </name>
            <name name-style="western">
              <surname>Purnat</surname>
              <given-names>T</given-names>
            </name>
            <name name-style="western">
              <surname>D'Agostino</surname>
              <given-names>M</given-names>
            </name>
            <name name-style="western">
              <surname>Garcia-Saiso</surname>
              <given-names>S</given-names>
            </name>
            <name name-style="western">
              <surname>Landry</surname>
              <given-names>M</given-names>
            </name>
            <name name-style="western">
              <surname>Rashidian</surname>
              <given-names>A</given-names>
            </name>
            <name name-style="western">
              <surname>Hamilton</surname>
              <given-names>C</given-names>
            </name>
            <name name-style="western">
              <surname>AbdAllah</surname>
              <given-names>A</given-names>
            </name>
            <name name-style="western">
              <surname>Ghiga</surname>
              <given-names>I</given-names>
            </name>
            <name name-style="western">
              <surname>Hill</surname>
              <given-names>A</given-names>
            </name>
            <name name-style="western">
              <surname>Hougendobler</surname>
              <given-names>D</given-names>
            </name>
            <name name-style="western">
              <surname>van Andel</surname>
              <given-names>J</given-names>
            </name>
            <name name-style="western">
              <surname>Nunn</surname>
              <given-names>M</given-names>
            </name>
            <name name-style="western">
              <surname>Brooks</surname>
              <given-names>I</given-names>
            </name>
            <name name-style="western">
              <surname>Sacco</surname>
              <given-names>PL</given-names>
            </name>
            <name name-style="western">
              <surname>De Domenico</surname>
              <given-names>M</given-names>
            </name>
            <name name-style="western">
              <surname>Mai</surname>
              <given-names>P</given-names>
            </name>
            <name name-style="western">
              <surname>Gruzd</surname>
              <given-names>A</given-names>
            </name>
            <name name-style="western">
              <surname>Alaphilippe</surname>
              <given-names>A</given-names>
            </name>
            <name name-style="western">
              <surname>Briand</surname>
              <given-names>S</given-names>
            </name>
          </person-group>
          <article-title>Framework for managing the COVID-19 infodemic: Methods and results of an online, crowdsourced WHO technical consultation</article-title>
          <source>J Med Internet Res</source>
          <year>2020</year>
          <month>06</month>
          <day>26</day>
          <volume>22</volume>
          <issue>6</issue>
          <fpage>e19659</fpage>
          <comment>
            <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://www.jmir.org/2020/6/e19659/"/>
          </comment>
          <pub-id pub-id-type="doi">10.2196/19659</pub-id>
          <pub-id pub-id-type="medline">32558655</pub-id>
          <pub-id pub-id-type="pii">v22i6e19659</pub-id>
          <pub-id pub-id-type="pmcid">PMC7332158</pub-id>
        </nlm-citation>
      </ref>
      <ref id="ref37">
        <label>37</label>
        <nlm-citation citation-type="web">
          <article-title>Munich Security Conference</article-title>
          <source>World Health Organization</source>
          <year>2020</year>
          <month>02</month>
          <day>15</day>
          <access-date>2021-11-02</access-date>
          <comment>
            <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://www.who.int/director-general/speeches/detail/munich-security-conference">https://www.who.int/director-general/speeches/detail/munich-security-conference</ext-link>
          </comment>
        </nlm-citation>
      </ref>
    </ref-list>
  </back>
</article>
