Part-Of-Speech Labeling for Reuters Database

被引:0
|
作者
Cretulescu, R. [1 ]
David, A. [1 ]
Morariu, D. [1 ]
Vintan, L. [1 ]
机构
[1] Lucian Blaga Univ Sibiu, Comp Sci & Elect Engn Dept, Sibiu, Romania
关键词
Documents Representation; Vector Space Model; Tagging Algorithms; Part of Speech;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Even if the Vector Space Model used for document representation in information retrieval systems integrates a small quantity of knowledge it continues to be used due to its computational cost, speed execution and simplicity. We try to improve this document representation by adding some syntactic information such as the parts of speech. In this paper, we have evaluated three different tagging algorithms in order to select the most suitable tagger for using it to tag the Reuters dataset. In this work, we have evaluated the taggers using only five different parts of speech: noun, verb, adverb, adjective and others. We considered these particular tags being the most representative for describing the documents into these parts of speech space.
引用
收藏
页码:117 / 122
页数:6
相关论文
共 50 条
  • [1] Part-of-speech persistence: The influence of part-of-speech information on lexical processes
    Melinger, Alissa
    Koenig, Jean-Pierre
    JOURNAL OF MEMORY AND LANGUAGE, 2007, 56 (04) : 472 - 489
  • [2] Part-of-speech tagging
    Martinez, Angel R.
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2012, 4 (01): : 107 - 113
  • [3] ADVERBIAL PART-OF-SPEECH
    CERVONI, J
    LANGUE FRANCAISE, 1990, (88): : 5 - 11
  • [4] A Universal Part-of-Speech Tagset
    Petrov, Slav
    Das, Dipanjan
    McDonald, Ryan
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 2089 - 2096
  • [5] Part-of-speech tagging for Swedish
    Prütz, K
    PARALLEL CORPORA, PARALLEL WORLDS, 2002, (43): : 201 - 206
  • [6] Part-of-Speech Induction for Vietnamese
    Phuong Le-Hong
    Thi Minh Huyen Nguyen
    KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2013), VOL 2, 2014, 245 : 261 - 272
  • [7] PART-OF-SPEECH IMPLICATIONS OF AFFIXES
    EARL, LL
    MECHANICAL TRANSLATION, 1966, 9 (02): : 38 - &
  • [8] Part-of-speech studies in Chinese
    Wang, Lu
    JOURNAL OF QUANTITATIVE LINGUISTICS, 2016, 23 (03) : 235 - 255
  • [9] The Effect of Part-of-speech on Mandarin Speech Recognition
    Gong, Caixia
    Li, Xiangang
    Wu, Xihong
    2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
  • [10] Controlling Complexity in Part-of-Speech Induction
    Graca, Joan V.
    Ganchev, Kuzman
    Coheur, Luisa
    Pereira, Fernando
    Taskar, Ben
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2011, 41 : 527 - 551