Exploring the nuances of biomedical language: a study on the polysemy of the word pattern
ISSN: 0368-492X
Article publication date: 25 July 2023
Issue publication date: 12 November 2024
Abstract
Purpose
Effective communication is crucial in the medical field where different stakeholders use various terminologies to describe and classify healthcare concepts such as ICD, SNOMED CT, UMLS and MeSH, but the problem of polysemy can make natural language processing difficult. This study explores the contextual meanings of the term “pattern” in the biomedical literature, compares them to existing definitions, annotates a corpus for use in machine learning and proposes new definitions of terms such as “Syndrome, feature” and “pattern recognition.”
Design/methodology/approach
Entrez API was used to retrieve articles form PubMed for the study which assembled a corpus of 398 articles using a search query for the ambiguous term “pattern” in the titles or abstracts. The python NLTK library was used to extract the terms and their contexts, and an expert check was carried out. To understand the various meanings of the term, the contextual environment was analyzed by extracting the surrounding words of the term. The expert determined the appropriate size of the context for analysis to gain a more nuanced understanding of the different meanings of the term pattern.
Findings
The study found that the categories of meanings of the term “pattern” are broader in biomedical publications than in common definitions, and new categories have been emerging from the term's use in the biomedical field. The study highlights the importance of annotated corpora in advancing natural language processing techniques and provides valuable insights into the nuances of biomedical language.
Originality/value
The study's findings demonstrate the importance of exploring contextual meanings and proposing new definitions of terms in the biomedical field to improve natural language processing techniques.
Keywords
Citation
Khakimova, A., Zolotarev, O. and Kaushal, S. (2024), "Exploring the nuances of biomedical language: a study on the polysemy of the word pattern", Kybernetes, Vol. 53 No. 11, pp. 4747-4758. https://doi.org/10.1108/K-05-2023-0767
Publisher
:Emerald Publishing Limited
Copyright © 2023, Emerald Publishing Limited