Named Entity Recognition for Short Text Messages

被引:16
|
作者
Ek, Tobias [1 ]
Kirkegaard, Camilla [1 ]
Jonsson, Hakan [2 ]
Nugues, Pierre [1 ]
机构
[1] Lund Univ, Dept Comp Sci, Box 118, S-22100 Lund, Sweden
[2] Sony Ericsson, S-22188 Lund, Sweden
关键词
Named entity recognition; Short text messages; SMS; Information extraction; Ensemble systems;
D O I
10.1016/j.sbspro.2011.10.596
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
This paper describes a named entity recognition (NER) system for short text messages (SMS) running on a mobile platform. Most NER systems deal with text that is structured, formal, well written, with a good grammatical structure, and few spelling errors. SMS text messages lack these qualities and have instead a short-handed and mixed language studded with emoticons, which makes NER a challenge on this kind of material. We implemented a system that recognizes named entities from SMSes written in Swedish and that runs on an Android cellular telephone. The entities extracted are locations, names, dates, times, and telephone numbers with the idea that extraction of these entities could be utilized by other applications running on the telephone. We started from a regular expression implementation that we complemented with classifiers using logistic regression. We optimized the recognition so that the incoming text messages could be processed on the telephone with a fast response time. We reached an F-score of 86 for strict matches and 89 for partial matches. (C) 2011 Published by Elsevier Ltd. Selection and/or peer-review under responsibility of PACLING Organizing Committee.
引用
收藏
页码:178 / 187
页数:10
相关论文
共 50 条
  • [1] Named Entity Recognition on Indonesian Microblog Messages
    Taufik, Natanael
    Wicaksono, Alfan F.
    Adriani, Mirna
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2016, : 358 - 361
  • [2] Product named entity recognition in Chinese text
    Jun Zhao
    Feifan Liu
    [J]. Language Resources and Evaluation, 2008, 42 : 197 - 217
  • [3] Named entity recognition and classification for text in arabic
    Abuleil, S
    Evens, M
    [J]. INTELLIGENT AND ADAPTIVE SYSTEMS AND SOFTWARE ENGINEERING, 2004, : 89 - 94
  • [4] Product named entity recognition in Chinese text
    Zhao, Jun
    Liu, Feifan
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2008, 42 (02) : 197 - 217
  • [5] One Class per Named Entity: Exploiting Unlabeled Text for Named Entity Recognition
    Wong, Yingchuan
    Ng, Hwee Tou
    [J]. 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 1763 - 1768
  • [6] Active Learning-Based Approach for Named Entity Recognition on Short Text Streams
    Cuong Van Tran
    Tuong Tri Nguyen
    Dinh Tuyen Hoang
    Hwang, Dosam
    Ngoc Thanh Nguyen
    [J]. MULTIMEDIA AND NETWORK INFORMATION SYSTEMS, MISSI 2016, 2017, 506 : 321 - 330
  • [7] Nested named entity recognition in historical archive text
    Byrne, Kate
    [J]. ICSC 2007: INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, PROCEEDINGS, 2007, : 589 - 596
  • [8] A Hybrid Named Entity Recognition System for Aviation Text
    Bharathi, A.
    Ramdin, Robin
    Babu, Preeja
    Menon, Vijay Krishna
    Jayaramakrishnan, Chandrasekhar
    Lakshmikumar, Sudarsan
    [J]. EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2024, 11 (01):
  • [9] Named Entity Recognition in Unstructured Medical Text Documents
    Pearson, Cole
    Seliya, Naeem
    Dave, Rushit
    [J]. INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND ENERGY TECHNOLOGIES (ICECET 2021), 2021, : 412 - 417
  • [10] Named Entity Recognition for Russian Judicial Rulings Text
    Averina, Maria
    Levanova, Olga
    Kasatkina, Natalia
    [J]. 2022 32ND CONFERENCE OF OPEN INNOVATIONS ASSOCIATION (FRUCT), 2022, : 49 - 55