A Comparison of Word- and Sense-Based Text Categorization Using Several Classification Algorithms

被引:0
|
作者
Athanasios Kehagias
Vassilios Petridis
Vassilis G. Kaburlasos
Pavlina Fragkou
机构
[1] Aristotle University of Thessaloniki (AUTh),Department of Math., Phys. and Comp. Sciences, Division of Mathematics
[2] Aristotle University of Thessaloniki (AUTh),Department of Electrical and Computer Engineering, Division of Electronics and Computer Engineering
[3] Technological Educational Institute of Kavala,Department of Industrial Informatics, Division of Software Systems
关键词
text categorization; word senses; information retrieval; FLNMAP with voting;
D O I
暂无
中图分类号
学科分类号
摘要
Most of the text categorization algorithms in the literature represent documents as collections of words. An alternative which has not been sufficiently explored is the use of word meanings, also known as senses. In this paper, using several algorithms, we compare the categorization accuracy of classifiers based on words to that of classifiers based on senses. The document collection on which this comparison takes place is a subset of the annotated Brown Corpus semantic concordance. A series of experiments indicates that the use of senses does not result in any significant categorization improvement.
引用
收藏
页码:227 / 247
页数:20
相关论文
共 50 条
  • [1] A comparison of word- and sense-based text categorization using several classification algorithms
    Kehagias, A
    Petridis, V
    Kaburlasos, VG
    Fragkou, P
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2003, 21 (03) : 227 - 247
  • [2] A method for automatic text categorization using word sense disambiguation
    Montes Rendon, Azucena
    Vargas A., Rocio
    Estrada Esquivel, Hugo
    Gonzalez Serna, Juan G.
    Ruiz Ascencio, Jose
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2008, PT 2, PROCEEDINGS, 2008, 5073 : 1158 - 1169
  • [3] Word Sense Disambiguation for Arabic Text Categorization
    Hadni, Meryeme
    El Alaoui, Said
    Lachkar, Abdelmonaime
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2016, 13 (1A) : 215 - 222
  • [4] Comparison of Text Categorization Algorithms
    SHI Yong-feng
    [J]. Wuhan University Journal of Natural Sciences, 2004, (05) : 798 - 804
  • [5] Word Sense Representation based-method for Arabic Text Categorization
    El-Alami, Fatima-Zahra
    Ouatik El Alaoui, Said
    [J]. 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL, IMAGE, VIDEO AND COMMUNICATIONS (ISIVC 2018), 2018, : 141 - 146
  • [6] The role of word sense disambiguation in automated text categorization
    Hidalgo, JMG
    Rodríguez, MD
    Pérez, JCC
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PROCEEDINGS, 2005, 3513 : 298 - 309
  • [7] Text categorization algorithms using semantic approaches, corpus-based thesaurus and Word Net
    Li, Cheng Hua
    Yang, Ju Cheng
    Park, Soon Cheol
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (01) : 765 - 772
  • [8] Sense-Based Topic Word Embedding Model for Item Recommendation
    Xiao, Ya
    Fan, Zhijie
    Tan, Chengxiang
    Xu, Qian
    Zhu, Wenye
    Cheng, Fujia
    [J]. IEEE ACCESS, 2019, 7 : 44748 - 44760
  • [9] A comparison of several ensemble methods for text categorization
    Dong, YS
    Han, KS
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING, PROCEEDINGS, 2004, : 419 - 422
  • [10] Comparison of Genres in Word Sense Disambiguation using Automatically Generated Text Collections
    Bolshina, Angelina
    Loukachevitch, Natalia
    [J]. PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE COMPUTATIONAL LINGUISTICS IN BULGARIA (CLIB '20), 2020, : 155 - 164