To read this content please select one of the options below:

Entity recognition in the field of coal mine construction safety based on a pre-training language model

Na Xu (China University of Mining and Technology, Xuzhou, China)
Yanxiang Liang (China University of Mining and Technology, Xuzhou, China)
Chaoran Guo (China University of Mining and Technology, Xuzhou, China)
Bo Meng (China University of Mining and Technology, Xuzhou, China)
Xueqing Zhou (China University of Mining and Technology, Xuzhou, China)
Yuting Hu (China University of Mining and Technology, Xuzhou, China)
Bo Zhang (China University of Mining and Technology, Xuzhou, China)

Engineering, Construction and Architectural Management

ISSN: 0969-9988

Article publication date: 28 December 2023

237

Abstract

Purpose

Safety management plays an important part in coal mine construction. Due to complex data, the implementation of the construction safety knowledge scattered in standards poses a challenge. This paper aims to develop a knowledge extraction model to automatically and efficiently extract domain knowledge from unstructured texts.

Design/methodology/approach

Bidirectional encoder representations from transformers (BERT)-bidirectional long short-term memory (BiLSTM)-conditional random field (CRF) method based on a pre-training language model was applied to carry out knowledge entity recognition in the field of coal mine construction safety in this paper. Firstly, 80 safety standards for coal mine construction were collected, sorted out and marked as a descriptive corpus. Then, the BERT pre-training language model was used to obtain dynamic word vectors. Finally, the BiLSTM-CRF model concluded the entity’s optimal tag sequence.

Findings

Accordingly, 11,933 entities and 2,051 relationships in the standard specifications texts of this paper were identified and a language model suitable for coal mine construction safety management was proposed. The experiments showed that F1 values were all above 60% in nine types of entities such as security management. F1 value of this model was more than 60% for entity extraction. The model identified and extracted entities more accurately than conventional methods.

Originality/value

This work completed the domain knowledge query and built a Q&A platform via entities and relationships identified by the standard specifications suitable for coal mines. This paper proposed a systematic framework for texts in coal mine construction safety to improve efficiency and accuracy of domain-specific entity extraction. In addition, the pretraining language model was also introduced into the coal mine construction safety to realize dynamic entity recognition, which provides technical support and theoretical reference for the optimization of safety management platforms.

Keywords

Acknowledgements

This work was supported by the National Social Science Fund of China (23BGL277), the Social Science Fund of Jiangsu Province (22GLB023) and Graduate Innovation Program of China University of Mining and Technology (2023WLJCRCZL062).

Citation

Xu, N., Liang, Y., Guo, C., Meng, B., Zhou, X., Hu, Y. and Zhang, B. (2023), "Entity recognition in the field of coal mine construction safety based on a pre-training language model", Engineering, Construction and Architectural Management, Vol. ahead-of-print No. ahead-of-print. https://doi.org/10.1108/ECAM-05-2023-0512

Publisher

:

Emerald Publishing Limited

Copyright © 2023, Emerald Publishing Limited

Related articles