Named Entity Recognition for Short Text Messages

被引:16
|
作者
Ek, Tobias [1 ]
Kirkegaard, Camilla [1 ]
Jonsson, Hakan [2 ]
Nugues, Pierre [1 ]
机构
[1] Lund Univ, Dept Comp Sci, Box 118, S-22100 Lund, Sweden
[2] Sony Ericsson, S-22188 Lund, Sweden
关键词
Named entity recognition; Short text messages; SMS; Information extraction; Ensemble systems;
D O I
10.1016/j.sbspro.2011.10.596
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
This paper describes a named entity recognition (NER) system for short text messages (SMS) running on a mobile platform. Most NER systems deal with text that is structured, formal, well written, with a good grammatical structure, and few spelling errors. SMS text messages lack these qualities and have instead a short-handed and mixed language studded with emoticons, which makes NER a challenge on this kind of material. We implemented a system that recognizes named entities from SMSes written in Swedish and that runs on an Android cellular telephone. The entities extracted are locations, names, dates, times, and telephone numbers with the idea that extraction of these entities could be utilized by other applications running on the telephone. We started from a regular expression implementation that we complemented with classifiers using logistic regression. We optimized the recognition so that the incoming text messages could be processed on the telephone with a fast response time. We reached an F-score of 86 for strict matches and 89 for partial matches. (C) 2011 Published by Elsevier Ltd. Selection and/or peer-review under responsibility of PACLING Organizing Committee.
引用
收藏
页码:178 / 187
页数:10
相关论文
共 50 条
  • [41] Research on College Academic Text Named Entity Recognition and Dataset Construction
    He, Chen
    Yuan, Yingchun
    Wang, Kejian
    Tao, Jia
    [J]. Computer Engineering and Applications, 2023, 59 (22) : 322 - 328
  • [42] A real time Named Entity Recognition system for Arabic text mining
    Al-Jumaily, Harith
    Martinez, Paloma
    Martinez-Fernandez, Jose L.
    Van der Goot, Erik
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2012, 46 (04) : 543 - 563
  • [43] Named entity recognition and classification in biomedical text using classifier ensemble
    Saha, Sriparna
    Ekbal, Asif
    Sikdar, Utpal Kumar
    [J]. INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2015, 11 (04) : 365 - 391
  • [44] Comparing Different Methods for Named Entity Recognition in Portuguese Neurology Text
    Fábio Lopes
    César Teixeira
    Hugo Gonçalo Oliveira
    [J]. Journal of Medical Systems, 2020, 44
  • [45] Comparing Different Methods for Named Entity Recognition in Portuguese Neurology Text
    Lopes, Fabio
    Teixeira, Cesar
    Oliveira, Hugo Goncalo
    [J]. JOURNAL OF MEDICAL SYSTEMS, 2020, 44 (04)
  • [46] A real time Named Entity Recognition system for Arabic text mining
    Harith Al-Jumaily
    Paloma Martínez
    José L. Martínez-Fernández
    Erik Van der Goot
    [J]. Language Resources and Evaluation, 2012, 46 : 543 - 563
  • [47] Lightweight Named Entity Extraction for Korean Short Message Service Text
    Seon, Choong-Nyoung
    Yoo, JinHwan
    Kim, Harksoo
    Kim, Ji-Hwan
    Seo, Jungyun
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2011, 5 (03): : 560 - 574
  • [48] SocialNER2.0: A comprehensive dataset for enhancing named entity recognition in short human-produced text
    Belbekri, Adel
    Benchikha, Fouzia
    Slimani, Yahya
    Marir, Naila
    [J]. INTELLIGENT DATA ANALYSIS, 2024, 28 (03) : 841 - 865
  • [49] Named Entity Recognition for Vietnamese
    Dat Ba Nguyen
    Son Huu Hoang
    Son Bao Pham
    Thai Phuong Nguyen
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, PT II, PROCEEDINGS, 2010, 5991 : 205 - 214
  • [50] TEXT SEGMENTATION USING NAMED ENTITY RECOGNITION AND CO-REFERENCE RESOLUTION
    Fragkou, Pavlina
    [J]. ICAART 2011: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1, 2011, : 349 - 354