A fuzzy-based approach for text representation in text categorization

被引:0
|
作者
Doan, S
机构
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Document representation is one of the most important tasks in text processing, especially in text categorization. This task has many applications that include document management, information retrieval, text routing, etc. In this paper, we proposes a novel scheme for text representation based on fuzzy set theory. A new algorithm for choosing a term set that characterizes a document in the corpus is given under the view of fuzzy set. Experimental results applied to text categorization problem using the relevance feedback technique show that our proposed method reduced the number of dimensions and achieves higher performances compared to other baseline methods. In addition, it also produces results that compare favorably to the result achieved with the all vocabulary method.
引用
收藏
页码:1008 / 1013
页数:6
相关论文
共 50 条
  • [1] A general fuzzy-based framework for text representation and its application to text categorization
    Doan, Son
    Ha, Quang-Thuy
    Horiguchi, Susumu
    [J]. FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2006, 4223 : 611 - 620
  • [2] An incremental approach to text representation, categorization, and retrieval
    ONeil, P
    [J]. PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, 1997, : 714 - 717
  • [3] A comparative study on text representation schemes in text categorization
    Song, FX
    Liu, SH
    Yang, JY
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2005, 8 (1-2) : 199 - 209
  • [4] A comparative study on text representation schemes in text categorization
    Fengxi Song
    Shuhai Liu
    Jingyu Yang
    [J]. Pattern Analysis and Applications, 2005, 8 : 199 - 209
  • [5] A Genetic-Fuzzy Approach for Automatic Text Categorization
    Kumbhar, Pradnya
    Mali, Manisha
    Atique, Mohammad
    [J]. 2017 7TH IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2017, : 572 - 578
  • [6] Item Categorization Algorithm Based on Improved Text Representation
    Zhenchao, Tu
    Jing, Ma
    [J]. Data Analysis and Knowledge Discovery, 2022, 6 (05) : 34 - 43
  • [7] Cluster Based Symbolic Representation for Skewed Text Categorization
    Raju, Lavanya Narayana
    Suhil, Mahamad
    Guru, D. S.
    Gowda, Harsha S.
    [J]. RECENT TRENDS IN IMAGE PROCESSING AND PATTERN RECOGNITION (RTIP2R 2016), 2017, 709 : 202 - 216
  • [8] Text categorization based on dissimilarity representation and prototype selection
    Pinheiro, Roberto H. W.
    Cavalcanti, George D. C.
    Ren, Tsang Ing
    [J]. 2015 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS 2015), 2015, : 163 - 168
  • [9] Multilabel Text Categorization Based on Fuzzy Relevance Clustering
    Lee, Shie-Jue
    Jiang, Jung-Yi
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2014, 22 (06) : 1457 - 1471
  • [10] Text Categorization Based on Fuzzy Soft Set Theory
    Handaga, Bana
    Deris, Mustafa Mat
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2012, PT IV, 2012, 7336 : 340 - 352