Chinese Financial Event Extraction Base on Syntactic and Semantic Dependency Parsing

被引:0
|
作者
Wan Q.-Z. [1 ,3 ]
Wan C.-X. [1 ,3 ]
Hu R. [2 ,3 ]
Liu D.-X. [1 ,3 ]
机构
[1] School of Information Technology, Jiangxi University of Finance and Economics, Nanchang
[2] School of Software and Internet of Things Engineering, Jiangxi University of Finance and Economics, Nanchang
[3] Jiangxi Key Laboratory of Data and Knowledge Engineering, Jiangxi University of Finance and Economics, Nanchang
来源
基金
中国国家自然科学基金;
关键词
Chinese event extraction; Core verb chain; Default complement; Event semantics relevance; Syntactic semantic dependency parsing graph;
D O I
10.11897/SP.J.1016.2021.00508
中图分类号
学科分类号
摘要
As a sub-task of information extraction, event extraction plays an important role in nature language process applications, such as stock market trend forecast, which can provide strong clues for events users, e.g. investors, managers and government, to analyze the market and make decisions. At present, most of the studies about event extraction pay more attention to the type correctness of triggers and arguments, and not consider the effect and value of event extraction based on application requirements. We call this type of event extraction traditional event extraction. The event types and standards in traditional event extraction are derived from ACE2005 containing 8 categories and 33 sub-categories, KBP2015 and ERE, et al. However, there are some limitations in application of them to event extraction in specific financial domain. For example, there is not the overweight event type in ACE2005, which is a special behavior in the financial domain. In this paper, we focus on the financial news and extract open events without types. In the field of finance and economics, most event users are more concerned with the objects and actions that events affect. Therefore, combined with the application requirement, we propose to extract the financial event ET(Sub, Pred, Obj), where Sub, Pred and Obj represent subject, predicate and object respectively. However, Chinese financial news generally suffers from the event nesting and component default problem, which result in event omission and key element missing of events. To tackle this issue, with the expression habits and characteristics of Chinese linguistics, we build a Chinese event extraction framework based on syntactic and semantic dependency parsing. Then summarize four common default structures and design corresponding completion rules. In particular, at the beginning of this paper, we summarize four prominent phenomena in the extraction of events from the headlines of financial news, and explore the cause of these problems, no in-depth analyzing the relevance of syntactic and semantic structure or lack of it. After that, we employ the syntactic dependency parsing tree and lexical structure, and propose the core verb chains, which make sure that each core verb corresponds to an event solving event leakage problem. Thirdly, we add semantic dependency relation between events on the basis of syntactic dependency tree, which is called Syntactic Semantic Dependency Parsing (SSDP) tree. In order to better separate the detected events and their properties, we adjust and optimize SSDP tree to form the SSDP graph, where the word nodes of the same syntactic structure are at the same level, providing a way for subsequent event extraction. Fourthly, with the division of default structure in linguistic, we summarize four common default structures and propose ten corresponding completion rules to solve the problem of component default. Meanwhile, the whole Chinese event extraction algorithm based SSDP graph is shown at the end of the section. Finally, this paper depicts a detailed experimental situation. The experimental dataset, labeling standard and evaluation index are given. Subsequently, the method in this paper is verified on two datasets, financial news titles and common field news titles. At the end, we conduct comprehensive benchmarks on Chinese financial news titles and CoNLL2009 Chinese Corpus. The experimental results show that the proposed methods are effective. © 2021, Science Press. All right reserved.
引用
收藏
页码:508 / 530
页数:22
相关论文
共 40 条
  • [21] Hong Y, Zhou W, Zhang J, Et al., Self-regulation: Employing a generative adversarial network to improve event detection, Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics(ACL), pp. 515-526, (2018)
  • [22] Guan C, Cheng Y, Zhao H., Semantic role labeling with associated memory network, Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies(NAACL-HLT), pp. 3361-3371, (2019)
  • [23] Li Z, He S, Zhao H, Et al., Dependency or span, end-to-end uniform semantic role labeling, Proceedings of the 33rd AAAI Conference on Artificial Intelligence(AAAI), pp. 6730-6737, (2019)
  • [24] Xia Q, Li Z, Zhang M, Et al., Syntax-aware neural semantic role labeling, Proceedings of the 33rd AAAI Conference on Artificial Intelligence(AAAI), pp. 7305-7313, (2019)
  • [25] He S, Li Z, Zhao H, Et al., Syntax for semantic role labeling, to be, or not to be, Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics(ACL), pp. 2061-2071, (2018)
  • [26] He L, Lee K, Levy O, Et al., Jointly predicting predicates and arguments in neural semantic role labeling, Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics(ACL), pp. 364-369, (2018)
  • [27] Mehat S V, Lee J Y, Carbonell J., Towards semi-supervised learning for deep semantic role labeling, Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing(EMNLP), pp. 4958-4963, (2018)
  • [28] Tan Z, Wang M, Xie J, Et al., Deep semantic role labeling with self-attention, Proceedings of the 32nd AAAI Conference on Artificial Intelligence(AAAI), pp. 4929-4936, (2018)
  • [29] He L, Lee K, Lewis M, Et al., Deep semantic role labeling: What works and what's next, Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics(ACL), pp. 473-483, (2017)
  • [30] Li Z, He S, Cai J, Et al., A unified syntax-aware framework for semantic role labeling, Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing(EMNLP), pp. 2401-2411, (2018)