Background: Precision medicine (PM) is playing a more and more important role in clinical practice. In recent years, the scale of PM research has been growing rapidly. Many reviews have been published to facilitate a better understanding of the status of PM research. However, there is still a lack of research on the intellectual structure in terms of topics.
Objective: This study aimed to identify the intellectual structure and evolutionary trends of PM research through the application of various social network analysis and visualization methods.
Methods: The bibliographies of papers published between 2009 and 2018 were extracted from the Web of Science database. Based on the statistics of keywords in the papers, a coword network was generated and used to calculate network indicators of both the entire network and local networks. Communities were then detected to identify subdirections of PM research. Topological maps of networks, including networks between communities and within each community, were drawn to reveal the correlation structure. An evolutionary graph and a strategic graph were finally produced to reveal research venation and trends in discipline communities.
Results: The results showed that PM research involves extensive themes and, overall, is not balanced. A minority of themes with a high frequency and network indicators, such as Biomarkers, Genomics, Cancer, Therapy, Genetics, Drug, Target Therapy, Pharmacogenomics, Pharmacogenetics, and Molecular, can be considered the core areas of PM research. However, there were five balanced theme directions with distinguished status and tendencies: Cancer, Biomarkers, Genomics, Drug, and Therapy. These were shown to be the main branches that were both focused and well developed. Therapy, though, was shown to be isolated and undeveloped.
Conclusions: The hotspots, structures, evolutions, and development trends of PM research in the past ten years were revealed using social network analysis and visualization. In general, PM research is unbalanced, but its subdirections are balanced. The clear evolutionary and developmental trend indicates that PM research has matured in recent years. The implications of this study involving PM research will provide reasonable and effective support for researchers, funders, policymakers, and clinicians.
Precision medicine (PM), also called personalized medicine, is a new medical model aimed at providing precise diagnosis, therapy, prognosis prediction, and prevention strategies based on information in a patient’s genes, proteins, and their environment . The scientific basis of PM is molecular pathological epidemiology, and it aims to identify the relationship between biomarkers, the drug response, and outcome in disease [ , ]. During the Human Genome Project, it took “one dollar one bp” and 13 years to complete the sequencing of the whole genome. Owing to breakthroughs in techniques and lower prices, effective, high-throughput, and accurate sequencing can be applied to map genomics, metabolomics, microbiomics, and proteomics, which has led to the discovery of increasingly more causative biomarkers [ - ]. However, clinicians currently use clinical trials and pilot studies to assess the relationship between biomarkers and diseases.
Great progress has been made in personalized treatment in the field of oncology. According to a meta-analysis of phase II clinical trials, a personalized treatment strategy across malignancies yields a better outcome and lower likelihood of death than nonpersonalized targeted therapies . Thus, it can be expected that PM will use new knowledge, including the integration of clinical medicine, pathology, epidemiology, and omics, to provide better therapies for patients.
Owing to the potential importance of PM, a few leading experts reviewed this new medical approach in regards to its relevant history, clinical applications, and any interdisciplinary research associated with PM, such as bioinformatics, artificial intelligence, and big data [- ]. It is logical to assess the status of the subfields or branches. However, there are still some limitations regarding the review themes; specifically, the overall structure and characteristics of PM research have not been mapped, the relationship between the subfields has not been revealed, and the predictions regarding PM in those reviews were not made based on an accurate quantitative analysis.
Coword analysis is a bibliometric method used to identify relationships between subfields within research areas and to measure the strength of the relationships [, ]. According to the co-occurrence correlation, the keywords can be classified into clusters and displayed as network maps. Some other indices, such as density and centrality, can be used to evaluate the shape of the maps. By comparing the network maps of different periods, the dynamic evolution of one research area can be clearly displayed. Owing to the characteristics of “quantitative” and “zoom” in coword analysis, scientists can uncover the links within a subfield, obtain the overall structure of networks according to simplified graphs, and focus on one certain subarea to obtain more information [ ].
Coword analysis has been widely used to illustrate the intellectual structure and developmental status of research areas [- ]. Our study applied this method to explore the overall research structure, correlation among themes, and entire set of evolutionary trends in the field of PM. Our results may help scientists and clinicians better understand its developmental characteristics and even yield new insights on breakthroughs.
PM is a new medical approach that classifies patients into different groups related to their diagnosis, treatment, and prevention based on individual gene, protein, or environmental information. It is noteworthy that the terms “precision medicine,” “personalized medicine,” “stratified medicine,” and “P4 [predictive, preventative, personalized, and participatory] medicine” are still interchangeably used by some organizations and scientists [, ]. In the period after the Human Genome Project, considerably more effort was put into exploring the relationship between genomic information and patient care [ ]. In 2015, the Precision Medicine Initiative was launched by Barack Obama, then the president of the United States, indicating the beginning of a new medical age [ ].
Every person has polymorphisms in their DNA, RNA, and proteins, as well as methylation. Recent scientific methods have enabled the analysis of biomarkers using omics techniques, including genomics, epigenomics, transcriptomics, proteomics, metabolomics, microbiome analysis, and immunomics. However, pathologists, epidemiologists, and clinicians have also made many contributions to the discovery of links between biomarkers and clinical features . Advances in the PM model have played an important role in disease diagnosis, treatment, and prevention. Cancer treatment is the field in which PM originated and it has seen the most mature use of PM, such as when cancer genomics were successfully applied in personalized medicine. With the deep awareness of genetic variations of tumors, treatment strategy can be tailored to the group of cancer patients with the same genotype. For example, the human epidermal growth factor receptor 2 (HER2) gene is amplified and overexpressed in 25-30% of breast cancers. According to evidence from clinical trials, trastuzumab, a targeted therapy drug, increases the clinical benefit of first-line chemotherapy in metastatic breast cancer that overexpresses HER2 [ ].
PM also plays an important role in disease treatment and prevention. DNA information from individual phenotyping will lead to more effective and accurate treatment and prevention. For example, the high risk in women for developing breast cancer is strongly correlated with mutations in BRCA1 or BRCA2 . Clinicians can make better decisions regarding prevention for patients carrying these genetic mutations. Pharmacologists and genomic scientists have also provided many contributions to the assessment of genetic variations that affect drug discovery and clinical pharmacology [ ]. Considering information on personal phenotypes, physicians can provide reasonable drug prescriptions that represent targeted therapies and are more cost-effective, but also have fewer side effects [ ]. Furthermore, PM has changed clinical trials. Among the minority of clinical trials involving PM, the proportion of trials on adult cancers in the United States that require a genomic alteration for enrollment has increased substantially over the past several years [ ]. Finally, PM will yield some challenges in the field of ethics, patient privacy, and refurbishment policies, which will require more attention from scientists and policymakers in the field [ , ].
With the pace of PM research rapidly increasing, a large number of studies have been performed from different perspectives. Many reviews have been published to facilitate a better understanding of the status of PM research, as well as clarifying the concept, history, clinical application, ethical concerns, and technological challenges. The efforts listed above have helped raise awareness of the new clinical model among patients, clinicians, and even health policy makers. They have played an important role in the development of intelligent support for decision-making, clinical practice, and public health policies.
According to recent reviews, the features of PM research are as follows. First, some reviews reported by top experts in the field of PM research discuss the foundation, techniques, applications, and perspectives of this new discipline [, , ]. Second, PM research involves significant interdisciplinary collaboration. Many scientists, such as those specializing in clinical medicine, clinical oncology, systems biology, or biochemistry, are focused on the development of this new field. Advanced technologies, such as next-generation sequencing (NGS), molecular imaging, omics (genomics, proteomics, metabolomics, and microbiomics), nanotechnology, big data, and artificial intelligence, have been applied to laboratory tests in PM to achieve more accurate results [ , , - ]. Third, the scope of the PM model has been expanded from clinical oncology to noncancer disciplines. This strategy has led to several innovations in the diagnosis and treatment of mental illness, cardiovascular disease, asthma, and inflammatory bowel disease [ - ]. Finally, the reviewers also emphasized the issues to be solved in the development of PM, such as technological bottlenecks, patient privacy, and ethical challenges [ - ]. The information provided in the reviews made a large contribution to the global acknowledgment of PM.
The Rationale for the Study
Research on PM is still increasing, and some important discoveries have already been beneficial to patients. However, there is still a long way to go in the utilization of PM. How can interdisciplinary researchers start studies? What type of public policy really makes sense regarding the field? How can funders ensure that investment works effectively? All these decisions should be made based on knowledge of PM, so great efforts have been made to describe the nature of this new field. The aim of our study was to address the following problems:
- What is the distribution of topics in PM research?
- What is the correlation structure of topics in PM research?
- What are the evolutionary venations and development trends of PM research?
Data Collection and Processing
According to previous studies, papers in the Web of Science Core Collection (WOSCC) can represent the status of medical science, including PM; therefore, we chose WOSCC as our data source. Data processing is shown in.
Papers were collected from the WOSCC that covered the period from 1999 to 2018. In this study, initial retrieval was conducted using “precision medicine,” “P4 medicine,” “personalized medicine,” and “stratified medicine” as terms in the field of Topic to guarantee a recall ratio. It included the document types of Article, Review, and Proceedings. The retrieval strategy is illustrated as follows: TOPIC: (“precision medicine”) or TOPIC: (“personalized medicine”) or TOPIC: (“individualized medicine’’) or TOPIC: (“P4 medicine’’) or TOPIC: (“stratified medicine”). Refined by: Document Types: (Article or Review or Proceedings Paper) Timespan: 1999 to 2018. Indices: SCI-EXPANDED, SSCI, A&HCI, CPCI-S, CPCI-SSH, ESCI, CCR-EXPANDED, and IC.
A total of 25,573 publications were retrieved, and their bibliographic records were downloaded through the function Save to Other File Formats provided by WOS. Next, a text file containing all records in Tab-delimited (Win) format was obtained. In general, records without keywords data were excluded. Meanwhile, publications not containing the search terms above in Title (the TI field) or Keywords (the DE field) were identified as unrelated to PM research  and also excluded. Thus, 10,177 records were selected as the final data sample, and there were 17,818 unique keywords. shows the number of papers in the sample by year. The PM research started proliferating in 2000, and the number of related papers increased each year.
In this study, mainstream keywords of high frequency were selected for further analysis, that is, based on their coword network. The largest connected component extracted from the whole coword network represents the mainstream research directions of one field . After several rounds of testing of the largest connected component using social network analysis, keywords with a frequency of ≥20 were selected. The sum of the frequency of these words accounts for 52.19% (24,492/46,921) of the total that can represent mainstream research of PM. Meanwhile, keywords were normalized to ensure consistent treatment of the singular and plural forms of words, unifying the synonyms and clarifying the homonyms. For example, the items “target, targets, targeting, target-Specific” were replaced by “Targeted Therapy.” Keywords with a frequency of less than 20 were merged into broader terms. For example, “Hematopoietic Stem Cell” (frequency 6) is replaced by “Stem Cell” (frequency 107). General items which were too broad to be of practical connotation, such as “medicine,” “research,” and “mechanism,” were removed. With the replacement, 244 related words with a frequency greater than 20, which were collected as the basic sample for analysis, were used in this study.
Methodology and Tools
Keywords in a paper provide an adequate description of its contents. A study on the correlation of keywords can reveal connotations of contents . Co-occurrence [ ] of words (coword) is a kind of correlation in connotation, that is to say, two keywords co-occurring within the same document (eg, paper) are an implication of correlation between topics which they refer to [ ]. The high frequency of co-occurrence of each keyword pair means a high degree of correlation between them. Coword analysis has been proven to be effective in identifying main themes and revealing the intellectual structure and patterns of a research field in many previous studies [ , ]. In this study, we used coword analysis that combines both social network analysis tools and scientific mapping tools to analyze the research field of PM. These tools were used to detect and visualize its overall research structure and patterns, conceptual subdomains (thematic communities), and thematic evolution.
Social Network Analysis
Many methods have been used to conduct coword analysis, including the method of social network analysis. It is derived from mathematical graph theory, which computes indicators of a coword network and identifies characteristics of the whole network and an individual network . SCI2, version 1.2 beta (Cyberinfrastructure for Network Science Center, Indiana University, Bloomington, Indiana, United States), is an effective bibliometric tool to extract items (eg, keywords, authors, and institutions) from bibliography records of articles or other structured research literature [ ]. Its feasibility and effectiveness has been widely proven in previous studies [ ]. In this study, the bibliographic record file was imported into SCI2 to obtain statistical data of keywords and coword network data. Coword network data include both keywords and their links with the respective weights. The weight of a keyword is its frequency of occurrence and that of the link between the keyword pair is its frequency of co-occurrence.
As unconnected or uncorrelated keywords cannot reflect main thematic subdomains and as what we focused on is the largest component , we used SCI2 to exclude isolated nodes and extract the largest component of the coword network [ , ]. Network indicators of the largest component of the coword network were then calculated using Pajek [ ], including centralization (centrality), density, and the clustering coefficient. Network indicators of the whole network or nodes can be used to identify the overall intellectual structure and patterns of one research field as well as a keyword’s characteristics, such as power, stratification, ranking, and inequality, in the network [ ].
Centralization measures the overall characteristics of global network, degree centralization measures the centripetal degree, and closeness centralization measures the proximity degree between any 2 nodes in the network. Its high level equals the close distance between any 2 nodes on the whole. Betweenness centralization indicates the degree of correlation between any 2 nodes through a third one (bridge), and its high level equal the high possibility of correlation through a bridge. Similarly, centrality, the individual network indicator, measures the capacity of one node in network. High degree centrality of one node indicates that it is central in the network and is correlated to many other nodes. It also indicates its powerful capacity of influence and control. High closeness centrality equals the capacity of one node that correlates others as short as possible or directly correlates others. High betweenness centrality equals the powerful role as a bridge to correlate other 2 nodes. Density measures the correlation strength within the network . It means the higher the density, the more mature the research field. The clustering coefficient indicates the possibility that keywords are clustered into a contrasting group [ ].
In addition, community detection is an effective method to discover research directions or subfields according to the correlation structure of the network . The Louvain algorithm embedded in Pajek, the most common algorithm used to detect communities, was also used to detect communities in this study [ ]. Different communities, including highly correlated keywords, represent different research directions or subfields.
Visualization and Evolution Analysis
Visualization is an important method to intuitively display the intellectual structure of coword correlation, the thematic evolution of a research field, and even the comparative development trends of subfields [, ]. After repeated comparison of several visualization tools, VOSviewer, version 1.6.13, Centre for Science and Technology Studies, Leiden University, Leiden, Netherlands), was found to enable better visualization of topological networks and was selected to conduct the visualization in this study, including the overall network with communities and the individual networks [ ]. The research themes of PM have been evolving over time. We divided bibliographic records chronologically and imported them into Cortext [ ]. Evolutionary trends of the keyword community were visualized, allowing for a layout of the dynamics as depicted by tubes in an alluvial model [ ].
A strategic diagram indicates the comparative status and evolutionary trends of subfields of one research field. It is a two-dimensional (2D) map in which the x-axis represents centrality and the y-axis represents density . The origin of the axes is determined by the average centrality and density. Centrality can be understood as a measurement of importance and the degree of core in the whole research network. Density can be understood as a measurement of maturity of a theme’s development. A total of 4 quadrants in a strategic diagram represent different meanings. Themes in Quadrant 1 are central and developed, with both high centrality and high density; in Quadrant 2, they are highly developed but isolated, with high density and low centrality; in Quadrant 3, they are marginal and isolated (emerging or declining), with both low centrality and low density; and in Quadrant 4, they are central with a trend toward high centrality but low density. Therefore, the developing status and trends of themes or research communities can be predicted by a strategic diagram.
Themes Involved in Precision Medicine Research
In this study, a total of 17,818 keywords were extracted from the sample, and the total frequency was 47,883. The frequency distribution conforms to the power law distribution with an exponent of –1.32 (). This shows that the frequency of very few keywords is very high, whereas most keywords are of extremely low frequency. The results indicate that the subject trends in current PM research are obvious, and researchers are inclined to focus on a few major themes and pay less attention to most other themes in the PM field.
lists the 100 most frequent keywords, the sum of the frequencies of which accounts for up to 39% of the total frequency. The keywords are so typical and representative in research topics that they can be considered as the mainstream themes of PM research in the past decade. It is interesting that the proportion of the 10 most frequent keywords is 14.2%. Biomarkers and Genomics are the first echelon; Cancer, Therapy, and Genetics are the second echelon; Drug, Target Therapy, Pharmacogenomics, Pharmacogenetics, and Molecular belong to the third echelon. The findings highlight the core and mainstream of PM research topics and show the imbalanced status of PM research as well.
|80||Electronic Health Records||89|
|85||Circulating Tumor Cell||86|
aNGS: next-generation sequencing.
bSNP: Single Nucleotide Polymorphisms.
cEGFR: epidermal growth factor receptor.
dGWAS: genome-wide association studies.
ePET: positron emission tomography.
fNSCLC: non–small cell lung cancer.
Correlation Network Analysis of Precision Medicine Research
Network Indicators of the Correlation Structure of the Themes
The 244 keywords (frequency above 20) in the study generate a total of 9178 edges, which constitute a keyword correlation network. It is known that the network is the largest connected component, indicating that a relatively consistent mainstream direction has been formed in PM studies in recent years. As shown in, the degree centralization and closeness centralization of the keywords are relatively high, indicating that the overall network is more concentric, and most of the keywords are clustered, centering on a few core keywords. We also discovered that the keywords in the network tend to be directly correlated rather than indirectly correlated. The path between keywords is short and tends to be directly correlated with the core words. Therefore, it can be concluded that a few core words have very strong control of the entire network. According to the characteristics listed above, we can draw the following conclusions: current PM research is very centralized, the difference between the core words and noncore words is obvious, and the main themes are formed around the core words. However, the lower betweenness centrality also indicates that most of the keyword correlations in PM research can form direct correlations without other words working as bridges. Combined with higher clustering coefficients, it can be observed that there are multiple thematic directions in this PM study with a large degree of difference. The correlation between keywords within the subject direction is higher than that between the other directions. Finally, the overall network is closely correlated, equaling a high network density. This result means that PM research has formed a systematic, relatively mature research pattern.
In the same way, the indices used to describe each keyword (degree centrality, closeness centrality, and betweenness centrality) represent their position and role in the network. As shown in, , and , the keywords, such as Biomarkers, Genomics, Therapy, Cancer, Genetics, Drug, Prediction, Pharmacogenomics, Target Therapy, and Molecular, occupied the top 10 positions on the lists of degree centrality and closeness centrality. It is particularly worth mentioning that the orders of the keywords in the lists of degree centrality and closeness centrality are identical, and both indicators are of high value. These words are clustered around a large number of keywords to a large extent, which are themselves directly correlated with other keywords. This indicates that these keywords are very important, are in the core position, and have a strong influence on the entirety of PM research. In contrast, the value of betweenness centrality is low. Instead of using an intermedia or a bridge word, most keywords are directly correlated, and the connection path is short. Interestingly, the ranking of the keyword “Therapy” is significantly improved in the list of betweenness centrality compared with its position in the lists of degree centrality and closeness centrality. It indicates that “Therapy” plays an important role of bridging other keywords in the overall network in PM research.
|Number of nodes||244|
|Number of edges||9178|
|Network all degree centralization||0.6214|
|Network all closeness centralization||0.6685|
|Network betweenness centralization||0.0277|
|Network clustering coefficient||0.4843|
The Themes of the Correlated Communities
On the basis of community detection in the coword network, PM research has focused on 5 theme communities or research subdirections in the last decade. These communities are visualized asin the next section. Modularity (0.2077) [ ] of community detection indicates a good result to distinguish topic communities in PM research. Each community has a strong internal correlation, and the distinction between them is obvious. These communities are as follows: C1-Cancer (including Target Therapy, Molecular, Breast Cancer, NGS, Tumor, Mutation, Clinical Trials, Gene, and Prognosis), C2-Biomarkers (including Prediction, Diagnostics, Proteomics, Phenotype, Omics, Metabolism, Bioinformatics, Asthma, and Inflammation), C3-Genomics (including Genetics, Sequencing, Epigenetics, Genetic Test, Risk, Genome-Wide Association Studies, Translation Medicine, Ethics, and Health Care), C4-Drug (including Pharmacogenomics, Pharmacogenetics, Single Nucleotide Polymorphisms, Pharmacology, Polymorphism, Genotype, Drug Development, Pharmacokinetics, and Depression), and C5-Therapy (including Therapy, Imaging, Stem Cell, Nanotechnology, positron emission tomography [PET], Drug Delivery, Theranostics, MRI, Molecular Imaging, and Brain). According to the research scale, PM studies can be divided into 3 levels: Level 1, C1, is the largest level; Level 2, including C2, C3, and C4, is the medium scale; and Level 3, C5, is the smallest. On the basis of the results above, the study of PM mainly focused on Cancer, Biomarkers, Genomics, and Drug in the past decade. More importantly, these themes represent the mainstream direction of PM studies; however, the C5-Therapy community is still weaker than the other 4 communities.
Visualization of the Theme Correlation Network
The structural characteristics of PM research need to be further assessed by the visualization of its coword networks. As shown in, each node represents one theme community or research subdirection. The size of the node, determined by the sum of the frequency of all words in the community, represents the scale of this direction. Each edge represents the correlation between the theme communities. Thicker edges indicate greater correlation strengths and a greater influence between communities. In general, C1-Cancer, C2-Biomarkers, C3-Genomics, and C4-Drug have formed a closely related and stable research structure; however, C5-Therapy, loosely correlated with the communities mentioned above, is considered an isolated and marginal research direction. It is noteworthy that the C1-Cancer community has the highest correlation with other communities, highlighting its important position and influence in the entire PM research field. Particularly, C1 has shown that its correlation strength with C2 and C3 is at the highest level. The 3 communities above can be regarded as core directions of PM research, which have the strongest interaction with and influence on each other. In addition, the correlation between the C1 and C5 communities is also strong, indicating interactions between the 2 research directions of Cancer and Therapy, as well as Genomics and Drug.
Furthermore, in terms of the internal correlation of the research themes community (and ), especially regarding the indices of the average degree and density, the degree centrality of C1-Cancer and C2-Biomarkers is the highest. Second, C3-Genomics and C4-Drug are subcore research themes, whereas C5-Therapy is a self-contained research theme in PM research but in a marginal position. Finally, C1-Cancer theme community is the most closely correlated within the community and the most mature subject direction in this PM research. The other thematic communities are also closely correlated within them and have a relatively mature development. Overall, the density of all PM research topic communities is higher than the overall density of the coword network. Each research direction has been self-contained and well-developed. However, the strength of the correlation between communities is much weaker than that within the community. The results show that PM research directions are significantly differentiated and that the correlations and interactions between communities are generally insufficient.
|Community||Number of nodes||Number of edges||Total frequency||Average degree||Density|
Evolution of and Trends in Precision Medicine Research
The bibliographic data were divided with the year as the unit of time, and an evolution graph was generated to reveal the evolutionary patterns of PM research. In addition, based on centrality and density, theme communities were graphed in a strategic graph (a 2D map). The relative status and development trend of each theme community in the PM research were revealed.
Evolution Venation of Precision Medicine Research
To clearly show the development, the evolution of PM research was divided into 2 stages, namely, Stage 1 (2009-2013) and Stage 2 (2014-2018), as shown inand . Tubes are colored in each year to represent different topic communities. They are linked because of overlap in keywords in 2 adjacent years, and the evolution venations will be generated with the same color as shown in the figures. According to variations in the topics, such as overlapping, differentiation and fusion of topics, and isolation, we aimed to determine the developing trends of PM. In the past ten years, the continuity in PM themes has generally been good, and a consensus on research directions has formed. Moreover, research in PM has deepened and expanded, especially in Stage 2 (2014-2018), where PM research has maintained good continuity. In the same period, there was more differentiation and integration of the research areas; the interaction between the subjects of the studies was also more pronounced.
Stage 1 (2009-2013)
First, there are 4 obvious thematic evolutions: the Pharmacogenomics and Pharmacogenetics venation (including Pharmacogenomics, Genetics, Polymorphism, Adverse Drug Reactions, and CYP2C9), the epidermal growth factor receptor (EGFR) and v-raf murine sarcoma viral oncogene homolog B1 (BRAF) venation (including Molecular Imaging, Drug Delivery, non–small-cell lung cancer [NSCLC], and Ki-ras2 Kirsten rat sarcoma viral oncogene homolog [KRAS]), the Proteomics and Metabolomics venation (including Sequencing, Bioinformatics, and Translation Medicine), and the Ethics and Cost-Effectiveness venation (including Health Care, Genetic Test, Health Policy, and Breast Cancer).
Each venation is independent and less differentiated, and the internal system for the theme communities is relatively mature. The Pharmacogenomics and Pharmacogenetics venation and the EGFR and BRAF venation are larger scale, so they can thus be considered the 2 important research directions in this period. The evolution of some themes, such as Schizophrenia and Oncogenes, has been interrupted, which may be due to the lack of continuous concern about such subjects or their integration into other subjects. We also find that there are a few isolated themes during different periods, such as Policy, Clinical Practice, Tumor, Chemotherapy, Organ, and NGS. Owing to strong internal correlation, these themes have been clustered as a research direction. However, such studies have not yet formed a systematic and continuous direction.
Stage 2 (2014-2018)
We performed an independent analysis for the years 2013 and 2018 to discover the continuity between 2013 and 2014. There are many overlapping thematic communities in these 2 years as well as overlapping research themes, such as EGFR and BRAF, Molecular Imaging and Drugs, and Pharmacogenomics and Pharmacogenetics, which exhibit good continuity. Overall, the sustainability and stability of PM research in this stage are better than that in Stage 1. Research on PM in terms of themes is more concentrated, which indicates the more consistent and mature direction of progression.
According to the evolutionary graph, there are 3 major research themes at this stage: Molecular Imaging and Drug Delivery, EGFR and Mutation, and Pharmacogenomics and Pharmacogenetics. First, the Molecular Imaging and Drug Delivery venation includes Theranostics, Diagnostics, Immunotherapy, and Machine Learning. The EGFR and Mutation venation includes NSCLC, KRAS, Tumor, Target Therapy, NGS, DNA, and MicroRNA. The Pharmacogenomics and Pharmacogenetics venation includes Cytochrome P450, Epigenetics, Cardiovascular Disease, Omics, and Bioinformatics. Simultaneously, Stratification and Prediction and related topics have also formed an independent evolutionary venation. Although small in scale, they have also become a self-contained system. However, there are also discontinuous evolutions and isolated topics at this stage, such as the evolution of Bipolar Disorder, which was interrupted in 2015. In this period, Parkinson Disease, Stem Cell, and Big Data finally become isolated research themes rather than evolutionary venations.
Development Trends in Precision Medicine Research
The theme community in the PM study is distributed in the strategic map according to centrality and density (). On the basis of indicator analysis of the theme communities, we found that C1 is in the first quadrant. Its centrality degree and density are relatively high, indicating that it is the core and mature direction of PM research. C4 is in the second quadrant, with low centrality degree and high density. It can be considered to be the mature direction, but not the core of PM research. In the third quadrant, C3 and C5, with both low centrality and density, are not the core directions and are not mature. However, C3 is close to the origin of density, which means it has the potential to be the core of PM research and that it will develop into a mature community. C2 is in the fourth quadrant, the centrality of which is high, but the density is low. C2 can be considered the core direction, but it is generally immature or involves too many topics.
Based on the results, it is possible for us to better understand the main research directions of PM research and accurately evaluate its importance, maturity, and interactions. First, we determined that overall work in PM research is unbalanced but that the theme community is balanced. As PM was newly born as an independent academic subject, researchers paid most of their attention to only a few popular words, such as Biomarkers, Genomics, Cancer, Therapy, Genetics, Drug, Target Therapy, Pharmacogenomics, Pharmacogenetics, and Molecular. The words mentioned above can be classified into the following categories: The applied subject (Cancer), The associated technology and research (Biomarkers, Genomics, and Genetics), pharmacology (Pharmacogenomics), and clinical practice (Treatment, Risk Prediction, Molecular Target Treatment, and Diagnosis). These words not only reflect areas of scientific concern, but more importantly, they indicate the major research directions of PM. However, we also found that the attention paid to most research themes is relatively dispersed. We could speculate that the current status of PM research is possibly as follows: (1) the most mature application of PM is in the subject of Oncology; (2) scientists are interested in discovering Biomarkers, mainly using genomics and genetic methods; (3) pharmacology is an important interdisciplinary field involved with PM, with the aim to make drug utility safer and more efficient; and (4) PM is widely used in Clinical Medicine, including for consulting, diagnosis, and treatment (especially molecular target treatment).
With the visualization of the coword network, we found that the themes were more inclined to be clustered around other popular minority keywords. Thus, the theme communities, both well-layered and balanced-scaled, were finally formed. The communities included C1-Cancer, C2-Biomarkers, C3-Genomics, C4-Drug, and C5-Therapy. According to the analysis of correlation between the theme’s communities, we can draw the following inferences: C1-Cancer, as the largest community, indicates that the application of PM in Clinical Oncology might already be mature. The other directions, such as technical studies and Clinical Medicine, are widely associated with Cancer. C2-Biomarkers is the second largest group and plays a key role as the basis of PM research. Scientists still strive for biomarker discovery with various techniques and for the transformation of these discoveries into clinical therapeutics and the prediction of clinical outcomes [, ]. Owing to significant progress in Genomics and Pharmacology, C4-Drug community, as an independent community, indicates special concern by both pharmacologists and clinicians. In this area, scientists are trying to explore the genetic correlation of Pharmacology and Genomics. These findings will be the foundation of PM, improving drug efficacy and safety [ , ]. It is also noteworthy that the development time of C4-Drug is short, but the fastest. Significant progress has been made in Genomics, Pharmacological Dynamics, Pharmacology, and Metabolomics, and these disciplines are playing an increasingly important role in the field. Although C3-Genomics is relatively isolated and not at the core of PM research, Genomics is one of the most important methods of detecting Biomarkers and is still widely used in various fields of PM [ ]. Its decline is due to the application of new technologies, such as high-throughput Omics [ ] and Molecular Imaging technology [ ]. C5-Therapy, independent but of the smallest scale, indicates that individualized treatment is the ultimate goal of PM, resulting in this aspect gaining the attention of scientists. However, the strategy for treatment is still far from well-developed, which proves the limited scale of the community. According to the major themes included in the C5-Therapy community, individualized treatment mainly involves traditional strategies such as Surgery, Chemotherapy, and Radiology. Interestingly, new methods such as gene therapy, stem cell, and tissue engineering have been available in PM treatment. On the other hand, PET and Molecular Imaging are new technologies that can be applied for stratifications. Through the strategic diagram, C5-Therapy is noncore in PM research; however, we can infer that while the community has not yet matured, it is of great potential.
Through the analysis of the evolution of theme communities over time, PM research has a clear evolutionary and developmental trend. In 2 stages of evolution, we have discovered a large number of well-concentrated evolutionary pathways, which indicates the maturity of PM. The theme community in PM research is well-structured and contains the core and promising directions, such as Biomarkers, Pharmacogenomics, MicroRNA, Imaging, and even Machine Learning. We also identified a dramatic development in techniques and pharmacology directions. It is worth noting that the trend toward PM in nononcology diseases has the potential to become mature, and NSCLC could develop to become an independent and mature venation. It indicates that the application of PM in NSCLC is relatively mature. Clinicians have applied strategies or technologies involved with PM, such as Biomarker, Molecular Imaging, and Pharmacogenomics, to achieve precise treatment [- ].
Our study reveals the structure and developmental trends of PM research from the perspective of keywords and their relationships. To some extent, this study provides insight into PM research; however, there are still limitations to this work. Regarding the research sample, this study used the literature to reveal the development status of PM. This research method could be regarded as a reasonable and cost-effective strategy rather than a comprehensive and accurate way to evaluate the true status of PM research.
Conclusions and Future Directions
Our study reveals the hotspots, structures, evolutions, and developmental trends of PM research in the past 10 years by means of social network analysis and visualization. We also made the following valuable discoveries: (1) using a graph, the network can describe, in detail, the development of PM research; and (2) the network uncovers the relationship between the themes and the intrinsic mechanism about how they interact, which could provide insights into future research directions.
In the future, we will perform data mining on the content of PM-related literature (eg, reports and illness records) to better reveal the condition of the entire network from various perspectives. In terms of research methods, based on previous work, the efficacy of coword analysis has been identified. Our study also validates this research method, and using it, we were able to obtain some valuable discoveries. In future studies, we aim to perform a further, comprehensive assessment of PM research through various perspectives, such as interdisciplinary research and institutes.
This study was supported by the National Natural Science Foundation of China Funded Project (71874125), the Ministry of Education in China’s Project of Humanities and Social Sciences (18YJA870004), and the Wuhan University Scientific Research Project (2042014KF0164).
JH and XL conceptualized the study and collected and analyzed the data. They participated in all phases of the review. WD and XX assisted with study conception and design, as well as interpretation of data, drafting of the manuscript, and critical revision. All authors contributed to the writing of the manuscript and approved the final version.
Conflicts of Interest
- Mohler J, Najafi B, Fain M, Ramos KS. Precision medicine: a wider definition. J Am Geriatr Soc 2015 Sep;63(9):1971-1972. [CrossRef] [Medline]
- Geyer FC, Lopez-Garcia MA, Lambros MB, Reis-Filho JS. Genetic characterization of breast cancer and implications for clinical management. J Cell Mol Med 2009 Oct;13(10):4090-4103 [FREE Full text] [CrossRef] [Medline]
- Luttropp K, Lindholm B, Carrero JJ, Glorieux G, Schepers E, Vanholder R, et al. Genetics/Genomics in chronic kidney disease--towards personalized medicine? Semin Dial 2009;22(4):417-422. [CrossRef] [Medline]
- Aronson SJ, Rehm HL. Building the foundation for genomics in precision medicine. Nature 2015 Oct 15;526(7573):336-342 [FREE Full text] [CrossRef] [Medline]
- Kohler I, Hankemeier T, van der Graaf PH, Knibbe CA, van Hasselt JG. Integrating clinical metabolomics-based biomarker discovery and clinical pharmacology to enable precision medicine. Eur J Pharm Sci 2017 Nov 15;109S:S15-S21 [FREE Full text] [CrossRef] [Medline]
- Kuntz TM, Gilbert JA. Introducing the microbiome into precision medicine. Trends Pharmacol Sci 2017 Jan;38(1):81-91. [CrossRef] [Medline]
- Duarte TT, Spencer CT. Personalized proteomics: the future of precision medicine. Proteomes 2016;4(4):pii: 29 [FREE Full text] [CrossRef] [Medline]
- Schwaederle M, Zhao M, Lee JJ, Eggermont AM, Schilsky RL, Mendelsohn J, et al. Impact of precision medicine in diverse cancers: a meta-analysis of phase II clinical trials. J Clin Oncol 2015 Nov 10;33(32):3817-3825 [FREE Full text] [CrossRef] [Medline]
- Printz C. National Institutes of Health releases new guidelines for stem cell research. Cancer 2009 Sep 15;115(18):4043-4044 [FREE Full text] [CrossRef] [Medline]
- Hollingsworth SJ. Precision medicine in oncology drug development: a pharma perspective. Drug Discov Today 2015 Dec;20(12):1455-1463. [CrossRef] [Medline]
- He M, Xia J, Shehab M, Wang X. The development of precision medicine in clinical practice. Clin Transl Med 2015 Dec;4(1):69 [FREE Full text] [CrossRef] [Medline]
- He Q. Knowledge discovery through co-word analysis. Libr Trends 1999;48(1):133-159 [FREE Full text]
- Li F, Li M, Guan P, Ma S, Cui L. Mapping publication trends and identifying hot spots of research on Internet health information seeking behavior: a quantitative and co-word biclustering analysis. J Med Internet Res 2015 Mar 25;17(3):e81 [FREE Full text] [CrossRef] [Medline]
- Chang X, Zhou X, Luo L, Yang C, Pan H, Zhang S. Hotspots in research on the measurement of medical students' clinical competence from 2012-2016 based on co-word analysis. BMC Med Educ 2017 Sep 12;17(1):162 [FREE Full text] [CrossRef] [Medline]
- Leung XY, Sun J, Bai B. Bibliometrics of social media research: a co-citation and co-word analysis. Int J Hosp Manag 2017;66:35-45. [CrossRef]
- Li X, Qiao H, Wang S. Exploring evolution and emerging trends in business model study: a co-citation analysis. Scientometrics 2017;111(2):869-887. [CrossRef]
- Chaker AM, Klimek L. [Individualized, personalized and stratified medicine: a challenge for allergology in ENT?]. HNO 2015 May;63(5):334-342. [CrossRef] [Medline]
- Sobradillo P, Pozo F, Agustí A. P4 Medicine: the Future Around the Corner. Arch Bronconeumol 2011 Jan;47(1):35-40. [CrossRef]
- Manolio TA, Green ED. Leading the way to genomic medicine. Am J Med Genet C Semin Med Genet 2014 Mar;166C(1):1-7. [CrossRef] [Medline]
- Ashley EA. The precision medicine initiative: a new national effort. J Am Med Assoc 2015 Jun 2;313(21):2119-2120. [CrossRef] [Medline]
- Ogino S, Nishihara R, VanderWeele TJ, Wang M, Nishi A, Lochhead P, et al. Review article: the role of molecular pathological epidemiology in the study of neoplastic and non-neoplastic diseases in the era of precision medicine. Epidemiology 2016 Jul;27(4):602-611 [FREE Full text] [CrossRef] [Medline]
- Slamon DJ, Leyland-Jones B, Shak S, Fuchs H, Paton V, Bajamonde A, et al. Use of chemotherapy plus a monoclonal antibody against HER2 for metastatic breast cancer that overexpresses HER2. N Engl J Med 2001 Mar 15;344(11):783-792. [CrossRef] [Medline]
- Antoniou A, Pharoah PD, Narod S, Risch HA, Eyfjord JE, Hopper JL, et al. Average risks of breast and ovarian cancer associated with BRCA1 or BRCA2 mutations detected in case Series unselected for family history: a combined analysis of 22 studies. Am J Hum Genet 2003 May;72(5):1117-1130 [FREE Full text] [CrossRef] [Medline]
- Mancinelli L, Cronin M, Sadée W. Pharmacogenomics: the promise of personalized medicine. AAPS PharmSci 2000;2(1):E4 [FREE Full text] [CrossRef] [Medline]
- Shord SS. The role of clinical pharmacology in oncology dose selection: advances and opportunities in personalized medicine. J Clin Pharmacol 2017 Oct;57(Suppl 10):S99-104. [CrossRef] [Medline]
- Roper N, Stensland KD, Hendricks R, Galsky MD. The landscape of precision cancer medicine clinical trials in the United States. Cancer Treat Rev 2015 May;41(5):385-390. [CrossRef] [Medline]
- Fiore RN, Goodman KW. Precision medicine ethics: selected issues and developments in next-generation sequencing, clinical oncology, and ethics. Curr Opin Oncol 2016 Jan;28(1):83-87. [CrossRef] [Medline]
- Trosman JR, Weldon CB, Douglas MP, Kurian AW, Kelley RK, Deverka PA, et al. Payer coverage for hereditary cancer panels: barriers, opportunities, and implications for the precision medicine initiative. J Natl Compr Canc Netw 2017 Feb;15(2):219-228 [FREE Full text] [CrossRef] [Medline]
- McPadden J, Durant TJ, Bunch DR, Coppi A, Price N, Rodgerson K, et al. Health care and precision medicine research: analysis of a scalable data science platform. J Med Internet Res 2019 Apr 9;21(4):e13043 [FREE Full text] [CrossRef] [Medline]
- Krittanawong C, Zhang H, Wang Z, Aydar M, Kitai T. Artificial intelligence in precision cardiovascular medicine. J Am Coll Cardiol 2017 May 30;69(21):2657-2664 [FREE Full text] [CrossRef] [Medline]
- Hu J, Zhang Y. Discovering the interdisciplinary nature of Big Data research through social network analysis and visualization. Scientometrics 2017;112(1):91-109. [CrossRef]
- Chen R, Snyder M. Promise of personalized omics to precision medicine. Wiley Interdiscip Rev Syst Biol Med 2013;5(1):73-82 [FREE Full text] [CrossRef] [Medline]
- Lu Z, Minko T. Molecular imaging for precision medicine. Adv Drug Deliv Rev 2017 Apr;113:1-2. [CrossRef] [Medline]
- Davis T. Biomedical, Bio-Nano, Personalized Medicine – It's All Nanomedicine to Us!. Aust J Chem 2012;65(1):3-4. [CrossRef]
- Vieta E. [Personalised medicine applied to mental health: precision psychiatry]. Rev Psiquiatr Salud Ment 2015;8(3):117-118. [CrossRef] [Medline]
- Krittanawong C. Future physicians in the era of precision cardiovascular medicine. Circulation 2017 Oct 24;136(17):1572-1574. [CrossRef] [Medline]
- Oberle AJ, Mathur P. Precision medicine in asthma: the role of bronchial thermoplasty. Curr Opin Pulm Med 2017 May;23(3):254-260. [CrossRef] [Medline]
- Fischer S, Neurath MF. Precision medicine in inflammatory bowel diseases. Clin Pharmacol Ther 2017 Oct;102(4):623-632. [CrossRef] [Medline]
- Korngiebel DM, Thummel KE, Burke W. Implementing precision medicine: the ethical challenges. Trends Pharmacol Sci 2017 Jan;38(1):8-14 [FREE Full text] [CrossRef] [Medline]
- Brothers KB, Rothstein MA. Ethical, legal and social implications of incorporating personalized medicine into healthcare. Per Med 2015;12(1):43-51 [FREE Full text] [CrossRef] [Medline]
- Duffy DJ. Problems, challenges and promises: perspectives on precision medicine. Brief Bioinform 2016 May;17(3):494-504. [CrossRef] [Medline]
- Loncar-Turukalo T, Zdravevski E, Machado da Silva J, Chouvarda I, Trajkovik V. Literature on wearable technology for connected health: scoping review of research trends, advances, and barriers. J Med Internet Res 2019 Sep 5;21(9):e14017 [FREE Full text] [CrossRef] [Medline]
- Wei W, Shi B, Guan X, Ma J, Wang Y, Liu J. Mapping theme trends and knowledge structures for human neural stem cells: a quantitative and co-word biclustering analysis for the 2013-2018 period. Neural Regen Res 2019 Oct;14(10):1823-1832 [FREE Full text] [CrossRef] [Medline]
- Hu J, Zhang Y. Research patterns and trends of Recommendation System in China using co-word analysis. Inf Process Manage 2015 Jul;51(4):329-339. [CrossRef]
- Hu C, Hu J, Deng S, Liu Y. A co-word analysis of library and information science in China. Scientometrics 2013;97(2):369-382. [CrossRef]
- Börner K. Plug-and-play macroscopes. Commun ACM 2011 Mar;54(3):60-69. [CrossRef]
- Hu J, Zhang Y. Structure and patterns of cross-national Big Data research collaborations. J Doc 2017;73(6):1119-1136. [CrossRef]
- Leydesdorff L, de Moya-Anegón F, Guerrero-Bote VP. Journal maps, interactive overlays, and the measurement of interdisciplinarity on the basis of Scopus data (1996-2012). J Assn Inf Sci Tech 2015;66(5):1001-1016. [CrossRef]
- Hu J, Huang R, Wang Y. Geographical visualization of research collaborations of library science in China. Electron Libr 2018;36(3):414-429. [CrossRef]
- Doreian P, Lloyd P, Mrvar A. Partitioning large signed two-mode networks: problems and prospects. Soc Netw 2013 May;35(2):178-203. [CrossRef]
- Callon M, Courtial JP, Laville F. Co-word analysis as a tool for describing the network of interactions between basic and technological research: The case of polymer chemsitry. Scientometrics 1991 Sep;22(1):155-205. [CrossRef]
- Albert R, Barabási AL. Statistical mechanics of complex networks. Rev Mod Phys 2002 Jan;74(1):47-97. [CrossRef]
- Chen G, Xiao L. Selecting publication keywords for domain analysis in bibliometrics: a comparison of three methods. J Inform 2016 Feb;10(1):212-223. [CrossRef]
- Blondel VD, Guillaume J, Lambiotte R, Lefebvre E. Fast unfolding of communities in large networks. J Stat Mech 2008 Oct;2008(10):P10008. [CrossRef]
- Muñoz-Leiva F, Viedma-del-Jesús MI, Sánchez-Fernández J, López-Herrera AG. An application of co-word analysis and bibliometric maps for detecting the most highlighting themes in the consumer behaviour research from a longitudinal perspective. Qual Quant 2012;46(4):1077-1095. [CrossRef]
- Chen Y, Fang S. Mapping the evolving patterns of patent assignees’ collaboration networks and identifying the collaboration potential. Scientometrics 2014;101(2):1215-1231. [CrossRef]
- van Eck NJ, Waltman L. Software survey: VOSviewer, a computer program for bibliometric mapping. Scientometrics 2010 Aug;84(2):523-538 [FREE Full text] [CrossRef] [Medline]
- Rosvall M, Bergstrom CT. Mapping change in large networks. PLoS One 2010 Jan 27;5(1):e8694 [FREE Full text] [CrossRef] [Medline]
- Leydesdorff L, Goldstone RL. Interdisciplinarity at the journal and specialty level: The changing knowledge bases of the journal. J Assoc Inf Sci Tech 2014;65(1):164-177. [CrossRef]
- Leydesdorff L, Park HW, Wagner C. International coauthorship relations in the Social Sciences Citation Index: Is internationalization leading the Network? J Assoc Inf Sci Tech 2014;65(10):2111-2126. [CrossRef]
- Wang E, Cho WC, Wong SC, Liu S. Disease biomarkers for precision medicine: challenges and future opportunities. Genomics Proteomics Bioinformatics 2017 Apr;15(2):57-58 [FREE Full text] [CrossRef] [Medline]
- Collins DC, Sundar R, Lim JS, Yap TA. Towards precision medicine in the clinic: from biomarker discovery to novel therapeutics. Trends Pharmacol Sci 2017 Jan;38(1):25-40. [CrossRef] [Medline]
- Lauschke VM, Milani L, Ingelman-Sundberg M. Pharmacogenomic biomarkers for improved drug therapy-recent progress and future developments. AAPS J 2017 Nov 27;20(1):4. [CrossRef] [Medline]
- Relling MV, Evans WE. Pharmacogenomics in the clinic. Nature 2015 Oct 15;526(7573):343-350 [FREE Full text] [CrossRef] [Medline]
- Carrasco-Ramiro F, Peiró-Pastor R, Aguado B. Human genomics projects and precision medicine. Gene Ther 2017 Sep;24(9):551-561. [CrossRef] [Medline]
- Kim D, Kim Y, Son N, Kang C, Kim A. Recent omics technologies and their emerging applications for personalised medicine. IET Syst Biol 2017 Jun;11(3):87-98. [CrossRef] [Medline]
- Wright CL, Binzel K, Zhang J, Knopp MV. Advanced functional tumor imaging and precision nuclear medicine enabled by digital pet technologies. Contrast Media Mol Imaging 2017;2017:5260305 [FREE Full text] [CrossRef] [Medline]
- Hofman P. ALK in non-small cell lung cancer (NSCLC) pathobiology, epidemiology, detection from tumor tissue and algorithm diagnosis in a daily practice. Cancers (Basel) 2017 Aug 12;9(8):pii: E107 [FREE Full text] [CrossRef] [Medline]
- Bahce I, Yaqub M, Smit EF, Lammertsma AA, van Dongen GA, Hendrikse NH. Personalizing NSCLC therapy by characterizing tumors using TKI-PET and immuno-PET. Lung Cancer 2017 May;107:1-13 [FREE Full text] [CrossRef] [Medline]
- Yin J, Li X, Zhou H, Liu Z. Pharmacogenomics of platinum-based chemotherapy sensitivity in NSCLC: toward precision medicine. Pharmacogenomics 2016 Aug;17(12):1365-1378. [CrossRef] [Medline]
|BRAF: v-raf murine sarcoma viral oncogene homolog B1|
|EGFR: epidermal growth factor receptor|
|HER2: human epidermal growth factor receptor 2|
|KRAS: Ki-ras2 Kirsten rat sarcoma viral oncogene homolog|
|NGS: next-generation sequencing|
|NSCLC: non–small cell lung cancer|
|P4 medicine: predictive, preventative, personalized, and participatory medicine|
|PET: positron emission tomography|
|PM: precision medicine|
|WOSCC: Web of Science Core Collection|
Edited by G Eysenbach; submitted 13.06.18; peer-reviewed by A Mavragani; comments to author 09.10.18; revised version received 07.10.19; accepted 19.10.19; published 04.02.20Copyright
©Xiaoguang Lyu, Jiming Hu, Weiguo Dong, Xin Xu. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 04.02.2020.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included.