To read this content please select one of the options below:

Task design and assignment of full-text generation on mass Chinese historical archives in digital humanities: A crowdsourcing approach

Jihong Liang (Renmin University of China, Beijing, China)
Hao Wang (Renmin University of China, Beijing, China)
Xiaojing Li (Renmin University of China, Beijing, China)

Aslib Journal of Information Management

ISSN: 2050-3806

Article publication date: 25 March 2020

Issue publication date: 20 April 2020

977

Abstract

Purpose

The purpose of this paper is to explore the task design and assignment of full-text generation on mass Chinese historical archives (CHAs) by crowdsourcing, with special attention paid to how to best divide full-text generation tasks into smaller ones assigned to crowdsourced volunteers and to improve the digitization of mass CHAs and the data-oriented processing of the digital humanities.

Design/methodology/approach

This paper starts from the complexities of character recognition of mass CHAs, takes Sheng Xuanhuai archives crowdsourcing project of Shanghai Library as a case study, and makes use of the theories of archival science, including diplomatics of Chinese archival documents, and the historical approach of Chinese archival traditions as the theoretical basis and analysis methods. The results are generated through the comprehensive research.

Findings

This paper points out that volunteer tasks of full-text generation include transcription, punctuation, proofreading, metadata description, segmentation, and attribute annotation in digital humanities and provides a metadata element set for volunteers to use in creating or revising metadata descriptions and also provides an attribute tag set. The two sets can be used across the humanities to construct overall observations about texts and the archives of which they are a part. Along these lines, this paper presents significant insights for application in outlining the principles, methods, activities, and procedures of crowdsourced full-text generation for mass CHAs.

Originality/value

This study is the first to explore and identify the effective design and allocation of tasks for crowdsourced volunteers completing full-text generation on CHAs in digital humanities.

Keywords

Acknowledgements

The authors would like to thank Cuijuan Xia and Meredith Doviak for providing information and data on Sheng Xuanhuai Archives transcription project and Citizen Archivist project. The authors would also like to thank the anonymous reviewers for their comments and questions that have helped to improve the quality of our paper and the editor for patience with the paper. The paper is supported by National Social Science Foundation of China (Grant No. 10&ZD132).

Citation

Liang, J., Wang, H. and Li, X. (2020), "Task design and assignment of full-text generation on mass Chinese historical archives in digital humanities: A crowdsourcing approach", Aslib Journal of Information Management, Vol. 72 No. 2, pp. 262-286. https://doi.org/10.1108/AJIM-09-2019-0245

Publisher

:

Emerald Publishing Limited

Copyright © 2020, Emerald Publishing Limited

Related articles