To read this content please select one of the options below:

Exploring the nuances of biomedical language: a study on the polysemy of the word pattern

Aida Khakimova (Institute of Information Systems and Engineering Computer Technologies, Russian New University, Moscow, Russia)
Oleg Zolotarev (Department of Information Systems in Economics and Control, Russian New University, Moscow, Russia)
Sanjay Kaushal (Department of General Management, Sharda University, Greater Noida, India)

Kybernetes

ISSN: 0368-492X

Article publication date: 25 July 2023

Issue publication date: 12 November 2024

97

Abstract

Purpose

Effective communication is crucial in the medical field where different stakeholders use various terminologies to describe and classify healthcare concepts such as ICD, SNOMED CT, UMLS and MeSH, but the problem of polysemy can make natural language processing difficult. This study explores the contextual meanings of the term “pattern” in the biomedical literature, compares them to existing definitions, annotates a corpus for use in machine learning and proposes new definitions of terms such as “Syndrome, feature” and “pattern recognition.”

Design/methodology/approach

Entrez API was used to retrieve articles form PubMed for the study which assembled a corpus of 398 articles using a search query for the ambiguous term “pattern” in the titles or abstracts. The python NLTK library was used to extract the terms and their contexts, and an expert check was carried out. To understand the various meanings of the term, the contextual environment was analyzed by extracting the surrounding words of the term. The expert determined the appropriate size of the context for analysis to gain a more nuanced understanding of the different meanings of the term pattern.

Findings

The study found that the categories of meanings of the term “pattern” are broader in biomedical publications than in common definitions, and new categories have been emerging from the term's use in the biomedical field. The study highlights the importance of annotated corpora in advancing natural language processing techniques and provides valuable insights into the nuances of biomedical language.

Originality/value

The study's findings demonstrate the importance of exploring contextual meanings and proposing new definitions of terms in the biomedical field to improve natural language processing techniques.

Keywords

Citation

Khakimova, A., Zolotarev, O. and Kaushal, S. (2024), "Exploring the nuances of biomedical language: a study on the polysemy of the word pattern", Kybernetes, Vol. 53 No. 11, pp. 4747-4758. https://doi.org/10.1108/K-05-2023-0767

Publisher

:

Emerald Publishing Limited

Copyright © 2023, Emerald Publishing Limited

Related articles