A fuzzy-based approach for text representation in text categorization

被引:0
|
作者
Doan, S
机构
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Document representation is one of the most important tasks in text processing, especially in text categorization. This task has many applications that include document management, information retrieval, text routing, etc. In this paper, we proposes a novel scheme for text representation based on fuzzy set theory. A new algorithm for choosing a term set that characterizes a document in the corpus is given under the view of fuzzy set. Experimental results applied to text categorization problem using the relevance feedback technique show that our proposed method reduced the number of dimensions and achieves higher performances compared to other baseline methods. In addition, it also produces results that compare favorably to the result achieved with the all vocabulary method.
引用
收藏
页码:1008 / 1013
页数:6
相关论文
共 50 条
  • [21] A New Fuzzy Hierarchical Classification Based on SVM for Text Categorization
    Guernine, Taoufik
    Zeroual, Kacem
    [J]. IMAGE ANALYSIS AND RECOGNITION, PROCEEDINGS, 2009, 5627 : 865 - 874
  • [22] Fuzzy Rough Set-Based Unstructured Text Categorization
    Bharadwaj, Aditya
    Ramanna, Sheela
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, CANADIAN AI 2017, 2017, 10233 : 335 - 340
  • [23] Text multi-categorization based on Fuzzy Correlation Analysis
    Lin, Nancy P.
    Chueh, Hao-En
    [J]. WSEAS Transactions on Systems, 2007, 6 (02): : 273 - 278
  • [24] A Redundancy Based Term Weighting Approach for Text Categorization
    Lu, Zhen-Yu
    Lin, Yong-Min
    Zhao, Shuang
    Chen, Jing-Nian
    Zhu, Wei-Dong
    [J]. 2009 WRI WORLD CONGRESS ON SOFTWARE ENGINEERING, VOL 2, PROCEEDINGS, 2009, : 36 - +
  • [25] Discriminative Topic Sparse Representation for Text Categorization
    Zheng, Wenbin
    Liu, Yanqiu
    Lu, Huijuan
    Tang, Hong
    [J]. 2017 10TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL. 1, 2017, : 454 - 457
  • [26] Using intuitionistic fuzzy sets in text categorization
    Szmidt, Eulalia
    Kacprzyk, Janusz
    [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING - ICAISC 2008, PROCEEDINGS, 2008, 5097 : 351 - 362
  • [27] Word Sense Representation based-method for Arabic Text Categorization
    El-Alami, Fatima-Zahra
    Ouatik El Alaoui, Said
    [J]. 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL, IMAGE, VIDEO AND COMMUNICATIONS (ISIVC 2018), 2018, : 141 - 146
  • [28] Component tree based categorization: An novel categorization approach for rich format text
    Zhu, Fei
    [J]. 2008 PROCEEDINGS OF INFORMATION TECHNOLOGY AND ENVIRONMENTAL SYSTEM SCIENCES: ITESS 2008, VOL 2, 2008, : 1196 - 1202
  • [29] An approach for text categorization in digital library
    Wang, Tao
    Desai, Bipin C.
    [J]. IDEAS 2007: 11TH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2007, : 21 - 27
  • [30] Modeling with words: an approach to text categorization
    Shanahan, J
    [J]. 10TH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3: MEETING THE GRAND CHALLENGE: MACHINES THAT SERVE PEOPLE, 2001, : 63 - 66