Lightweight Named Entity Extraction for Korean Short Message Service Text

被引:6
|
作者
Seon, Choong-Nyoung [1 ]
Yoo, JinHwan [1 ]
Kim, Harksoo [2 ]
Kim, Ji-Hwan [1 ]
Seo, Jungyun [3 ,4 ]
机构
[1] Sogang Univ, Dept Comp Sci & Engn, Seoul 121742, South Korea
[2] Kangwon Natl Univ, Dept Comp & Commun Engn, Chuncheon Si 200701, Gangwon Do, South Korea
[3] Sogang Univ, Dept Comp Sci, Seoul 121742, South Korea
[4] Sogang Univ, Interdisciplinary Program Integrated Biotechnol, Seoul 121742, South Korea
关键词
Named entity (NE) extraction; machine learning (ML); rule-based; lightweight;
D O I
10.3837/tiis.2011.03.006
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a hybrid method of Machine Learning (ML) algorithm and a rule-based algorithm to implement a lightweight Named Entity (NE) extraction system for Korean SMS text. NE extraction from Korean SMS text is a challenging theme due to the resource limitation on a mobile phone, corruptions in input text, need for extension to include personal information stored in a mobile phone, and sparsity of training data. The proposed hybrid method retaining the advantages of statistical ML and rule-based algorithms provides fully-automated procedures for the combination of ML approaches and their correction rules using a threshold-based soft decision function. The proposed method is applied to Korean SMS texts to extract person's names as well as location names which are key information in personal appointment management system. Our proposed system achieved 80.53% in F-measure in this domain, superior to those of the conventional ML approaches.
引用
收藏
页码:560 / 574
页数:15
相关论文
共 50 条
  • [41] A named entity relation extraction method based on bootstrapping
    He Tingting
    Xu Chao
    Li Jing
    Zhao Junzhe
    [J]. 2005 INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND TECHNOLOGY, PROCEEDINGS, 2005, : 758 - 763
  • [42] Active Learning Technique for Biomedical Named Entity Extraction
    Saha, Sriparna
    Ekbal, Asif
    Verma, Mridula
    Sikdar, Utpal
    Poesio, Massimo
    [J]. PROCEEDINGS OF THE 2012 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI'12), 2012, : 835 - 841
  • [43] Named Entity Relation Extraction Based on Multiple Features
    Li, Yeqing
    [J]. 2015 IEEE 29TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS WAINA 2015, 2015, : 213 - 216
  • [44] Arabic Named Entity Recognition from diverse text types
    Shaalan, Khaled
    Raza, Hafsa
    [J]. ADVANCES IN NATURAL LANGUAGE PROCESSING, PROCEEDINGS, 2008, 5221 : 440 - 451
  • [45] Named Entity Recognition for Mine Electromechanical Equipment Monitoring Text
    Yunfei, Qiu
    Haoran, Xing
    Zhilong, Yu
    Wenwen, Zhang
    [J]. Computer Engineering and Applications, 2024, 60 (11) : 129 - 138
  • [46] Bootstrapped Text-level Named Entity Recognition for Literature
    Brooke, Julian
    Baldwin, Timothy
    Hammond, Adam
    [J]. PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2016), VOL 2, 2016, : 344 - 350
  • [47] Transfer learning for Turkish named entity recognition on noisy text
    Kagan Akkaya, Emre
    Can, Burcu
    [J]. NATURAL LANGUAGE ENGINEERING, 2021, 27 (01) : 35 - 64
  • [48] Adversarial training for named entity recognition of rail fault text
    Qu, J.
    Su, S.
    Li, R.
    Wang, G.
    [J]. 2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 1353 - 1358
  • [49] Named Entity Recognition of Chinese Text Based on Attention Mechanism
    Shen, Tong-Ping
    Dumlao, Menchita
    Meng, Qing-Quan
    Zhan, Zhong-Hua
    [J]. Journal of Network Intelligence, 2023, 8 (02): : 505 - 518
  • [50] Joint Learning of Named Entity Recognition and Relation Extraction
    Xu, Qiuyan
    Li, Fang
    [J]. 2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 1978 - 1982