Empower event detection with bi-directional neural language model

被引:15
|
作者
Zhang, Yunyan [1 ,2 ]
Xu, Guangluan [1 ]
Wang, Yang [1 ]
Liang, Xiao [1 ]
Wang, Lei [1 ]
Huang, Tinglei [1 ]
机构
[1] Chinese Acad Sci, Inst Elect, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100190, Peoples R China
关键词
Information extraction; Event detection; Multi-task learning; Language model;
D O I
10.1016/j.knosys.2019.01.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Event detection is an essential and challenging task in Information Extraction (IE). Recent advances in neural networks make it possible to build reliable models without complicated feature engineering. However, data scarcity hinders their further performance. Moreover, training data has been underused since majority of labels in datasets are not event triggers and contribute very little to the training process. In this paper, we propose a novel multi-task learning framework to extract more general patterns from raw data and make better use of the training data. Specifically, we present two paradigms to incorporate neural language model into event detection model on both word and character levels: (1) we use the features extracted by language model as an additional input to event detection model. (2) We use a hard parameter sharing approach between language model and event detection model. The extensive experiments demonstrate the benefits of the proposed multi-task learning framework for event detection. Compared to the previous methods, our method does not rely on any additional supervision but still beats the majority of them and achieves a competitive performance on the ACE 2005 benchmark. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:87 / 97
页数:11
相关论文
共 50 条
  • [1] UnsupervisedWord Segmentation with Bi-directional Neural Language Model
    Wang, Lihao
    Zheng, Xiaoqing
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (01)
  • [2] On Training Bi-directional Neural Network Language Model with Noise Contrastive Estimation
    He, Tianxing
    Zhang, Yu
    Droppo, Jasha
    Yu, Kai
    [J]. 2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [3] Chinese Document Classification with Bi-directional Convolutional Language Model
    Liu, Bin
    Yin, Guosheng
    [J]. PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1785 - 1788
  • [4] Electricity Theft Detection in Smart Meters Using a Hybrid Bi-directional GRU Bi-directional LSTM Model
    Munawar, Shoaib
    Asif, Muhammad
    Kabir, Beenish
    Pamir
    Ullah, Ashraf
    Javaid, Nadeem
    [J]. COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS, CISIS-2021, 2021, 278 : 297 - 308
  • [5] An overview of ICT frauds and their detection with Bi-directional artificial neural networks
    Krenker, Andrej
    Mesojednik, Matevž
    Volk, Mojca
    Bešter, Janez
    Kos, Andrej
    [J]. Elektrotehniski Vestnik/Electrotechnical Review, 2007, 74 (03): : 131 - 137
  • [6] An Overview of ICT Frauds and their Detection with Bi-directional Artificial Neural Networks
    Krenker, Andrej
    Mesojednik, Matevz
    Volk, Mojca
    Bester, Janez
    Kos, Andrej
    [J]. ELEKTROTEHNISKI VESTNIK-ELECTROCHEMICAL REVIEW, 2007, 74 (03): : 131 - 137
  • [7] BI-DIRECTIONAL RECURRENT NEURAL NETWORK WITH RANKING LOSS FOR SPOKEN LANGUAGE UNDERSTANDING
    Ngoc Thang Vu
    Gupta, Pankaj
    Adel, Heike
    Schuetze, Hinrich
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6060 - 6064
  • [8] A Bi-directional Message Passing Model for Salient Object Detection
    Zhang, Lu
    Dai, Ju
    Lu, Huchuan
    He, You
    Wang, Gang
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1741 - 1750
  • [9] Contextual sentiment embeddings via bi-directional GRU language model
    Wang, Jin
    Zhang, You
    Yu, Liang-Chih
    Zhang, Xuejie
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 235
  • [10] A bi-directional derivation model of objects
    Swen, B
    [J]. OBJECT-ORIENTED TECHNOLOGY, 1998, : 6 - 11