A deep neural network model for coreference resolution in geological domain

被引:6
|
作者
Wan, Bo [1 ,3 ]
Dong, Shuai [1 ]
Chu, Deping [2 ]
Li, Hong [2 ]
Liu, Yiyang [3 ]
Fu, Jinming [1 ]
Fang, Fang [1 ]
Li, Shengwen [1 ]
Zhou, Dan [4 ]
机构
[1] China Univ Geosci, Sch Comp Sci, Wuhan 430073, Peoples R China
[2] China Univ Geosci, Sch Geog & Informat Engn, Wuhan 430074, Peoples R China
[3] Natl Engn Res Ctr GIS, Wuhan 430074, Peoples R China
[4] Wuhan Zondy Cyber, Wuhan 430073, Peoples R China
关键词
Geological text mining; Coreference resolution; Deeping learning; Chinese geological texts;
D O I
10.1016/j.ipm.2023.103268
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Coreference resolution of geological entities is an important task in geological information mining. Although the existing generic coreference resolution models can handle geological texts, a dramatic decline in their performance can occur without sufficient domain knowledge. Due to the high diversity of geological terminology, coreference is intricately governed by the semantic and expressive structure of geological terms. In this paper, a framework CorefRoCNN based on RoBERTa and convolutional neural network (CNN) for end-to-end coreference resolution of geological entities is proposed. Firstly, the fine-tuned RoBERTa language model is used to transform words into dynamic vector representations with contextual semantic information. Second, a CNN-based multi-scale structure feature extraction module for geological terms is designed to capture the invariance of geological terms in length, internal structure, and distribution. Thirdly, we incorporate the structural feature and word embedding for further determinations of coreference relations. In addition, attention mechanisms are used to improve the ability of the model to capture valid information in geological texts with long sentence lengths. To validate the effectiveness of the model, we compared it with several state-of-the-art models on the constructed dataset. The results show that our model has the optimal performance with an average F1 value of 79.78%, which is a 1.22% improvement compared to the second-ranked method.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] A deep neural network model for speakers coreference resolution in legal texts
    Ji, Donghong
    Gao, Jun
    Fei, Hao
    Teng, Chong
    Ren, Yafeng
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (06)
  • [2] A set of domain rules and a deep network for protein coreference resolution
    Li, Chen
    Rao, Zhiqiang
    Zheng, Qinghua
    Zhang, Xiangrong
    [J]. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2018,
  • [3] A Deep Learning Framework for Coreference Resolution Based on Convolutional Neural Network
    Wu, Jheng-Long
    Ma, Wei-Yun
    [J]. 2017 11TH IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2017, : 61 - 64
  • [4] Deep Neural Networks for Coreference Resolution for Polish
    Niton, Bartlomiej
    Morawiecki, Pawel
    Ogrodniczuk, Maciej
    [J]. PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 395 - 400
  • [5] A Convolutional Deep Neural Network for Coreference Resolution via Modeling Hierarchical Features
    Xi, Xue-Feng
    Zhou, Guodong
    Hu, Fuyuan
    Fu, Bao-chuan
    [J]. INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: BIG DATA AND MACHINE LEARNING TECHNIQUES, ISCIDE 2015, PT II, 2015, 9243 : 361 - 372
  • [6] A Joint Model of Named Entity Recognition and Coreference Resolution Based on Hybrid Neural Network
    Gao, Cheng-Sheng
    Zhang, Jun-Fu
    Li, Wei-Ping
    Zhao, Wen
    Zhang, Shi-Kong
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2020, 48 (03): : 442 - 448
  • [7] Deep neural network incorporating domain and resolution transformations model for histopathological image classification
    Mudeng, Vicky
    Choe, Se-woon
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2022, 104
  • [8] Deep neural network incorporating domain and resolution transformations model for histopathological image classification
    Mudeng, Vicky
    Choe, Se-woon
    [J]. Computers and Electrical Engineering, 2022, 104
  • [9] An ELECTRA-Based Model for Neural Coreference Resolution
    Gargiulo, Francesco
    Minutolo, Aniello
    Guarasci, Raffaele
    Damiano, Emanuele
    De Pietro, Giuseppe
    Fujita, Hamido
    Esposito, Massimo
    [J]. IEEE ACCESS, 2022, 10 : 75144 - 75157
  • [10] Precision Event Coreference Resolution Using Neural Network Classifiers
    Pandian, Arun
    Mulaffer, Lamana
    Oflazer, Kemal
    AlZeyara, Amna
    [J]. COMPUTACION Y SISTEMAS, 2020, 24 (01): : 5 - 16