Retrospective and prospective approaches of coronavirus publications in the last half-century: a Latent Dirichlet allocation analysis
ISSN: 0737-8831
Article publication date: 1 April 2021
Issue publication date: 13 September 2021
Abstract
Purpose
The present article's primary purpose is the topic modeling of the global coronavirus publications in the last 50 years.
Design/methodology/approach
The present study is applied research that has been conducted using text mining. The statistical population is the coronavirus publications that have been collected from the Web of Science Core Collection (1970–2020). The main keywords were extracted from the Medical Subject Heading browser to design the search strategy. Latent Dirichlet allocation and Python programming language were applied to analyze the data and implement the text mining algorithms of topic modeling.
Findings
The findings indicated that the SARS, science, protein, MERS, veterinary, cell, human, RNA, medicine and virology are the most important keywords in the global coronavirus publications. Also, eight important topics were identified in the global coronavirus publications by implementing the topic modeling algorithm. The highest number of publications were respectively on the following topics: “structure and proteomics,” “Cell signaling and immune response,” “clinical presentation and detection,” “Gene sequence and genomics,” “Diagnosis tests,” “vaccine and immune response and outbreak,” “Epidemiology and Transmission” and “gastrointestinal tissue.”
Originality/value
The originality of this article can be considered in three ways. First, text mining and Latent Dirichlet allocation were applied to analyzing coronavirus literature for the first time. Second, coronavirus is mentioned as a hot topic of research. Finally, in addition to the retrospective approaches to 50 years of data collection and analysis, the results can be exploited with prospective approaches to strategic planning and macro-policymaking.
Keywords
Acknowledgements
Researchers appreciate the Vice-Chancellor for Research and Technology of Gonabad University of Medical Sciences for its financial support and the respiratory infectious diseases specialists for their expert advice.
Research funding: The present article is extracted from a research project with code A‐10–1263–2 and research ethics ID. I.R.GMU.REC.1398.189 approved by the Research Council of Gonabad University of Medical Sciences and conducted with financial support.
Citation
Danesh, F., Dastani, M. and Ghorbani, M. (2021), "Retrospective and prospective approaches of coronavirus publications in the last half-century: a Latent Dirichlet allocation analysis", Library Hi Tech, Vol. 39 No. 3, pp. 855-872. https://doi.org/10.1108/LHT-09-2020-0216
Publisher
:Emerald Publishing Limited
Copyright © 2021, Emerald Publishing Limited