Entity recognition in the field of coal mine construction safety based on a pre-training language model
Engineering, Construction and Architectural Management
ISSN: 0969-9988
Article publication date: 28 December 2023
Abstract
Purpose
Safety management plays an important part in coal mine construction. Due to complex data, the implementation of the construction safety knowledge scattered in standards poses a challenge. This paper aims to develop a knowledge extraction model to automatically and efficiently extract domain knowledge from unstructured texts.
Design/methodology/approach
Bidirectional encoder representations from transformers (BERT)-bidirectional long short-term memory (BiLSTM)-conditional random field (CRF) method based on a pre-training language model was applied to carry out knowledge entity recognition in the field of coal mine construction safety in this paper. Firstly, 80 safety standards for coal mine construction were collected, sorted out and marked as a descriptive corpus. Then, the BERT pre-training language model was used to obtain dynamic word vectors. Finally, the BiLSTM-CRF model concluded the entity’s optimal tag sequence.
Findings
Accordingly, 11,933 entities and 2,051 relationships in the standard specifications texts of this paper were identified and a language model suitable for coal mine construction safety management was proposed. The experiments showed that F1 values were all above 60% in nine types of entities such as security management. F1 value of this model was more than 60% for entity extraction. The model identified and extracted entities more accurately than conventional methods.
Originality/value
This work completed the domain knowledge query and built a Q&A platform via entities and relationships identified by the standard specifications suitable for coal mines. This paper proposed a systematic framework for texts in coal mine construction safety to improve efficiency and accuracy of domain-specific entity extraction. In addition, the pretraining language model was also introduced into the coal mine construction safety to realize dynamic entity recognition, which provides technical support and theoretical reference for the optimization of safety management platforms.
Keywords
Acknowledgements
This work was supported by the National Social Science Fund of China (23BGL277), the Social Science Fund of Jiangsu Province (22GLB023) and Graduate Innovation Program of China University of Mining and Technology (2023WLJCRCZL062).
Citation
Xu, N., Liang, Y., Guo, C., Meng, B., Zhou, X., Hu, Y. and Zhang, B. (2023), "Entity recognition in the field of coal mine construction safety based on a pre-training language model", Engineering, Construction and Architectural Management, Vol. ahead-of-print No. ahead-of-print. https://doi.org/10.1108/ECAM-05-2023-0512
Publisher
:Emerald Publishing Limited
Copyright © 2023, Emerald Publishing Limited