Adapting Associative Classification to Text Categorization

被引:0
|
作者
Li, Baoli [1 ]
Sugandh, Neha [1 ]
Garcia, Ernest V.
Ram, Ashwin [1 ]
机构
[1] Georgia Inst Technol, Coll Comp, Atlanta, GA 30332 USA
关键词
Text Categorization; Associative Classification;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Associative classification, which originates from numerical data mining, has been applied to deal with text data recently. Text data is firstly digitalized to database of transactions, and then training and prediction is actually conducted on the derived numerical dataset. This intuitive strategy has demonstrated quite good performance. However, it doesn't take into consideration the inherent characteristics of text data as much as possible, although it has to deal with some specific problems of text data such as lemmatizing and stemming during digitalization. In this paper, we propose a bottom-up strategy to adapt associative classification to text categorization, in which we take into account structure information of text. Experiments on Reuters-21578 dataset show that the proposed strategy can make use of text structure information and achieve better performance.
引用
下载
收藏
页码:205 / 207
页数:3
相关论文
共 50 条
  • [31] Integrating associative rule-based classification with Naive Bayes for text classification
    Hadi, Wa'el
    Al-Radaideh, Qasem A.
    Alhawari, Samer
    APPLIED SOFT COMPUTING, 2018, 69 : 344 - 356
  • [32] Text Associative Classification Approach for Mining Arabic Data Set
    Ghareb, Abdullah S.
    Hamdan, Abdul Razak
    Abu Bakar, Azuraliza
    2012 4TH CONFERENCE ON DATA MINING AND OPTIMIZATION (DMO), 2012, : 114 - 120
  • [33] Termset weighting by adapting term weighting schemes to utilize cardinality statistics for binary text categorization
    Dima Badawi
    Hakan Altınçay
    Applied Intelligence, 2017, 47 : 456 - 472
  • [34] Termset weighting by adapting term weighting schemes to utilize cardinality statistics for binary text categorization
    Badawi, Dima
    Altincay, Hakan
    APPLIED INTELLIGENCE, 2017, 47 (02) : 456 - 472
  • [35] Supervised classification by thresholds: Application to automated text categorization and opinion mining
    Cherif, Walid
    Madani, Abdellah
    Kissi, Mohamed
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (04):
  • [36] Research on Multi-Classification and Multi-Label in Text Categorization
    Hua, Liu
    2009 INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS, VOL 2, PROCEEDINGS, 2009, : 86 - 89
  • [37] Software design patterns classification and selection using text categorization approach
    Hussain, Shahid
    Keung, Jacky
    Khan, Arif Ali
    APPLIED SOFT COMPUTING, 2017, 58 : 225 - 244
  • [38] Refinement of Index Term Set and Improvement of Classification Accuracy on Text Categorization
    Suzuki, Makoto
    Ishida, Takashi
    Goto, Masayuki
    2008 INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY AND ITS APPLICATIONS, VOLS 1-3, 2008, : 449 - +
  • [39] Text categorization using SVMs with Rocchio ensemble for Internet information classification
    Xu, X
    Zhang, BF
    Zhong, QX
    NETWORKING AND MOBILE COMPUTING, PROCEEDINGS, 2005, 3619 : 1022 - 1031
  • [40] Text Categorization: Implementation
    Jo, Taeho
    Studies in Big Data, 2019, 45 : 129 - 156