Coal Engineering ›› 2024, Vol. 56 ›› Issue (2): 206-212.doi: 10. 11799/ ce202402030
Previous Articles Next Articles
飞翔 Lewis刘,
Received:
Revised:
Online:
Published:
Abstract: In order to fully explore the key text knowledge of coal mine safety hidden danger and help the safety management personnel of coal mine enterprises to better carry out hidden danger investigation and management work, a named entity recognition method based on pre-training language model is proposed. Firstly, entity categories of coal mine safety hidden danger were defined according to the new version of Coal Mine Safety Regulations and Criteria for Determining Potential Major Accidents in Coal Mines, and 7 entity categories and 15 entity labels were constructed using BIO labeling strategy. Then, the collected data of coal mine hidden danger investigation are preprocessed, and relevant entities are manually marked by experts in the field of coal mine safety, and 1500 standard data sets of named entities for coal mine safety hidden danger are obtained. Finally, the ERNIE pre-training model is used to represent the text word vector of coal mine safety hidden danger, and the BiGRU structure is used to extract the context semantic features and the CRF model is used to decode the entity label, and complete the research on the named entity recognition of coal mine safety hidden danger. The experimental results show that: The accuracy rate, recall rate and F1 value of ERNIE-BiGRU-CRF model on sequence labeling tasks are 56.69%, 69.23% and 62.34%, which are respectively 6.85%, 13.74% and 9.83% higher than baseline model of BiLSTM-CRF. And there is little difference between the entity extraction result and the actual label result. In addition, the ablation experiment also verified that BiGRU layer could better capture semantic dependencies of text context for coal mine safety hidden danger and CRF layer could further optimize label sequence. It can be seen that the named entity recognition model based on the ERNIE-BiGRU-CRF algorithm structure has a good entity recognition result in the text information extraction of coal mine safety hidden danger, which provides convenience for the accomplishment of intelligent management of coal mine safety hidden danger.
CLC Number:
TD77
TP391.1
0 / / Recommend
Add to citation manager EndNote|Reference Manager|ProCite|BibTeX|RefWorks
URL: http://www.coale.com.cn/EN/10. 11799/ ce202402030
http://www.coale.com.cn/EN/Y2024/V56/I2/206
Discussion on the design of blast resistant walls for buildings adjacent to gas extraction pump rooms
Spatio-temporal characteristics and causal factors of major and more severe coal mine accidents in China from 2013 to 2023
Multidimensional control mode for major disaster prevention and control in coal mines of CHN Energy Group