A fuzzy-based approach for text representation in text categorization

被引:0
|
作者
Doan, S
机构
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Document representation is one of the most important tasks in text processing, especially in text categorization. This task has many applications that include document management, information retrieval, text routing, etc. In this paper, we proposes a novel scheme for text representation based on fuzzy set theory. A new algorithm for choosing a term set that characterizes a document in the corpus is given under the view of fuzzy set. Experimental results applied to text categorization problem using the relevance feedback technique show that our proposed method reduced the number of dimensions and achieves higher performances compared to other baseline methods. In addition, it also produces results that compare favorably to the result achieved with the all vocabulary method.
引用
收藏
页码:1008 / 1013
页数:6
相关论文
共 50 条
  • [31] A term weighting approach for text categorization
    Lee, KC
    Kang, SS
    Hahn, KS
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2005, 3689 : 673 - 678
  • [32] An enhanced text categorization method based on improved text frequency approach and mutual information algorithm
    Pei Zhili
    Shi Xiaohu
    Marchese, Maurizio
    Liang Yanchun
    [J]. PROGRESS IN NATURAL SCIENCE-MATERIALS INTERNATIONAL, 2007, 17 (12) : 1494 - 1500
  • [33] An enhanced text categorization method based on improved text frequency approach and mutual information algorithm
    Maurizio Marchese
    [J]. Progress in Natural Science:Materials International, 2007, (12) : 1494 - 1500
  • [34] An enhanced text categorization method based on improved text frequency approach and mutual information algorithm
    Key Laboratory of Symbol Computation and Knowledge Engineering, College of Computer Science and Technology, Jilin University, Changchun 130012, China
    不详
    不详
    [J]. Prog Nat Sci, 2007, 12 (1494-1500):
  • [35] A WordNet-based approach to feature selection in text categorization
    Zhang, K
    Sun, J
    Wang, B
    [J]. INTELLIGENT INFORMATION PROCESSING II, 2005, 163 : 475 - 484
  • [36] Hybrid approach for text categorization based on machine learning and rules
    Villena-Roman, Julio
    Collada-Perez, Sonia
    Lana-Serrano, Sara
    Carlos Gonzalez-Cristobal, Jose
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2011, (46): : 35 - 42
  • [37] Improving text categorization A fully automated ontology based approach
    Machhour, Hamid
    Kassou, Ismail
    [J]. 2013 THIRD INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND INFORMATION TECHNOLOGY (ICCIT), 2013, : 67 - 72
  • [38] An Overview of Unsupervised Deep Feature Representation for Text Categorization
    Wang, Shiping
    Cai, Jinyu
    Lin, Qihao
    Guo, Wenzhong
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2019, 6 (03) : 504 - 517
  • [39] Effective Collaborative Representation Learning for Multilabel Text Categorization
    Wu, Hao
    Qin, Shaowei
    Nie, Rencan
    Cao, Jinde
    Gorbachev, Sergey
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (10) : 5200 - 5214
  • [40] A Text Categorization Method Using Graph Convolutional Network based on Sparse Representation
    Yang, Kun
    Wang, Xiaoping
    Zheng, Ying
    [J]. PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 8756 - 8759