A case study for understanding the nature of redundant entities in bibliographic digital libraries
Program: electronic library and information systems
ISSN: 0033-0337
Article publication date: 1 July 2014
Abstract
Purpose
The purpose of this paper is to collect and understand the nature of real cases of author name variants that have often appeared in bibliographic digital libraries (DLs) as a case study of the name authority control problem in DLs.
Design/methodology/approach
To find a sample of name variants across DLs (e.g. DBLP and ACM) and in a single DL (e.g. ACM), the approach is based on two bipartite matching algorithms: Maximum Weighted Bipartite Matching and Maximum Cardinality Bipartite Matching.
Findings
First, the authors validated the effectiveness and efficiency of the bipartite matching algorithms. The authors also studied the nature of real cases of author name variants that had been found across DLs (e.g. ACM, CiteSeer and DBLP) and in a single DL.
Originality/value
To the best of the authors knowledge, there is less research effort to understand the nature of author name variants shown in DLs. A thorough analysis can help focus research effort on real problems that arise when the authors perform duplicate detection methods.
Keywords
Acknowledgements
This work was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) grant funded by the Ministry of Science, ICT, and Future Planning (No. 2013012524) for the first author, by Basic Science Research Program through the National Research Foundation of Korea (NRF) grant funded by the Ministry of Education, Science, and Technology (No. 20110008686) for the second author, and by the Energy Efficiency & Resources of the Korea Institute of Energy Technology Evaluation and Planning (KETEP) grant funded by the Korea government Ministry of Knowledge Economy (No. 20132010101800) for the first and second authors.
Citation
On, B.-W., Sang Choi, G. and Jung, S.-M. (2014), "A case study for understanding the nature of redundant entities in bibliographic digital libraries", Program: electronic library and information systems, Vol. 48 No. 3, pp. 246-271. https://doi.org/10.1108/PROG-07-2012-0037
Publisher
:Emerald Group Publishing Limited
Copyright © 2014, Emerald Group Publishing Limited