Efficient Temporal Information Extraction from Korean Documents

被引:1
|
作者
Lim, Chae-Gyun [1 ]
Choi, Ho-Jin [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Sch Comp, Daejeon, South Korea
关键词
natural language processing; temporal information extraction; time expression;
D O I
10.1109/MDM.2017.63
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As the amount of documents continues to increase steadily, it has become an important issue to shorten processing time in the field of natural language processing. In this paper, we describe a method to reduce the execution speed of the Korean temporal information extraction module from a development perspective. While the rule-based approach is useful for finding time representations from natural language sentences, the process of applying rules for each sentence can take a long time. In addition, in Korean sentences, the linguistic characteristics must be considered together to obtain exact temporal information. We first attempted to reduce the time to look for a time expression using the characteristics of the Korean morpheme, and then modified the module to speed up the search for the entire rule base. And we also show an experimental result that the change of execution speed according to the proposed approaches.
引用
收藏
页码:366 / 370
页数:5
相关论文
共 50 条
  • [1] Extracting Time Information from Korean Documents
    Lee, Seung-Dong
    Jeong, Young-Seob
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING, BIGCOMP, 2023, : 407 - 409
  • [2] Extraction of chemical information from documents
    Villar, Hugo O.
    Betancort, Juan
    Hansen, Mark R.
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2010, 240
  • [3] Information Extraction from Legal Documents
    Cheng, Tin Tin
    Cua, Jeffrey Leonard
    Tan, Mark Davies
    Yao, Kenneth Gerard
    Roxas, Rachel Edita
    [J]. 2009 EIGHTH INTERNATIONAL SYMPOSIUM ON NATURAL LANGUAGE PROCESSING, PROCEEDINGS, 2009, : 157 - +
  • [4] Semantic information extraction from Tamil documents
    Pandian, S. Lakshmana
    Devakumar, J.
    Geetha, T.V.
    [J]. International Journal of Metadata, Semantics and Ontologies, 2008, 3 (03) : 226 - 232
  • [5] Information Extraction from Arabic Law Documents
    Abu Shamma, Samah
    Ayasa, Aseel
    Sleem, Wala'
    Yahya, Adnan
    [J]. 2020 IEEE 14TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT2020), 2020,
  • [6] Information extraction and summarization from medical documents
    Spyropoulos, CD
    Karkatetsis, V
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2005, 33 (02) : 107 - 110
  • [7] Information Extraction from Chinese Judgment Documents
    Zhuang, Chuhan
    Zhou, Yemao
    Ge, Jidong
    Li, Zhongjin
    Li, Chuanyi
    Zhou, Xiaoyu
    Luo, Bin
    [J]. 2017 14TH WEB INFORMATION SYSTEMS AND APPLICATIONS CONFERENCE (WISA 2017), 2017, : 240 - 244
  • [8] Extraction and integration of chemical information from documents
    Villar, Hugo O.
    Betancort, Juan
    Hansen, Mark R.
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2010, 240
  • [9] Data-Efficient Information Extraction from Documents with Pre-trained Language Models
    Sage, Clement
    Douzon, Thibault
    Aussem, Alex
    Eglin, Veronique
    Elghazel, Haytham
    Duffner, Stefan
    Garcia, Christophe
    Espinas, Jeremy
    [J]. DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT II, 2021, 12917 : 455 - 469
  • [10] Personal Information Extraction from Korean Obituaries
    Han, Kyoung-Soo
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (12): : 2873 - 2876