To read this content please select one of the options below:

Term conflation methods in information retrieval: Non‐linguistic and linguistic approaches

Carmen Galvez (Department of Information Science, University of Granada, Granada, Spain)

Félix de Moya‐Anegón (Department of Information Science, University of Granada, Granada, Spain)

Víctor H. Solana (Department of Information Science, University of Granada, Granada, Spain)

Journal of Documentation

ISSN: 0022-0418

Article publication date: 1 August 2005

Downloads

1333

Abstract

Purpose

–

To propose a categorization of the different conflation procedures at the two basic approaches, non‐linguistic and linguistic techniques, and to justify the application of normalization methods within the framework of linguistic techniques.

Design/methodology/approach

–

Presents a range of term conflation methods, that can be used in information retrieval. The uniterm and multiterm variants can be considered equivalent units for the purposes of automatic indexing. Stemming algorithms, segmentation rules, association measures and clustering techniques are well evaluated non‐linguistic methods, and experiments with these techniques show a wide variety of results. Alternatively, the lemmatisation and the use of syntactic pattern‐matching, through equivalence relations represented in finite‐state transducers (FST), are emerging methods for the recognition and standardization of terms.

Findings

–

The survey attempts to point out the positive and negative effects of the linguistic approach and its potential as a term conflation method.

Originality/value

–

Outlines the importance of FSTs for the normalization of term variants.

Keywords

Citation

Galvez, C., de Moya‐Anegón, F. and Solana, V.H. (2005), "Term conflation methods in information retrieval: Non‐linguistic and linguistic approaches", Journal of Documentation, Vol. 61 No. 4, pp. 520-547. https://doi.org/10.1108/00220410510607507

Publisher

:

Emerald Group Publishing Limited

To read this content please select one of the options below:

Please note you do not have access to teaching notes

Term conflation methods in information retrieval: Non‐linguistic and linguistic approaches

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Keywords

Citation

Publisher

Related articles

To read this content please select one of the options below:

Please note you do not have access to teaching notes

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Keywords

Citation

Publisher

Related articles

All feedback is valuable

Report an issue or find answers to frequently asked questions