Search results

1 – 3 of 3
Per page
102050
Citations:
Loading...
Access Restricted. View access options
Article
Publication date: 1 February 1998

Anders Ardö

What was the aim in building a Nordic search engine? The goals were to build a system that:

37

Abstract

What was the aim in building a Nordic search engine? The goals were to build a system that:

Details

The Electronic Library, vol. 16 no. 2
Type: Research Article
ISSN: 0264-0473

Access Restricted. View access options
Article
Publication date: 1 August 2004

Sanjica Faletar

conference held in Dubrovnik, May 2004, which had the dual theme of human information behaviour and competences for digital libraries.

1649

Abstract

conference held in Dubrovnik, May 2004, which had the dual theme of human information behaviour and competences for digital libraries.

Details

Library Hi Tech News, vol. 21 no. 7
Type: Research Article
ISSN: 0741-9058

Keywords

Access Restricted. View access options
Article
Publication date: 18 November 2013

Arash Joorabchi and Abdulhussain E. Mahdi

This paper aims to report on the design and development of a new approach for automatic classification and subject indexing of research documents in scientific digital libraries…

1752

Abstract

Purpose

This paper aims to report on the design and development of a new approach for automatic classification and subject indexing of research documents in scientific digital libraries and repositories (DLR) according to library controlled vocabularies such as DDC and FAST.

Design/methodology/approach

The proposed concept matching-based approach (CMA) detects key Wikipedia concepts occurring in a document and searches the OPACs of conventional libraries via querying the WorldCat database to retrieve a set of MARC records which share one or more of the detected key concepts. Then the semantic similarity of each retrieved MARC record to the document is measured and, using an inference algorithm, the DDC classes and FAST subjects of those MARC records which have the highest similarity to the document are assigned to it.

Findings

The performance of the proposed method in terms of the accuracy of the DDC classes and FAST subjects automatically assigned to a set of research documents is evaluated using standard information retrieval measures of precision, recall, and F1. The authors demonstrate the superiority of the proposed approach in terms of accuracy performance in comparison to a similar system currently deployed in a large scale scientific search engine.

Originality/value

The proposed approach enables the development of a new type of subject classification system for DLR, and addresses some of the problems similar systems suffer from, such as the problem of imbalanced training data encountered by machine learning-based systems, and the problem of word-sense ambiguity encountered by string matching-based systems.

1 – 3 of 3
Per page
102050