Deep learning- and word embedding-based heterogeneous classifier ensembles for text classification

被引:21
|
作者
Kilimci Z.H. [1 ]
Akyokus S. [2 ]
机构
[1] Computer Engineering Department, Dogus University, Istanbul
[2] Computer Engineering Department, İstanbul Medipol University, Istanbul
关键词
All Open Access; Gold;
D O I
10.1155/2018/7130146
中图分类号
学科分类号
摘要
The use of ensemble learning, deep learning, and effective document representation methods is currently some of the most common trends to improve the overall accuracy of a text classification/categorization system. Ensemble learning is an approach to raise the overall accuracy of a classification system by utilizing multiple classifiers. Deep learning-based methods provide better results in many applications when compared with the other conventional machine learning algorithms. Word embeddings enable representation of words learned from a corpus as vectors that provide a mapping of words with similar meaning to have similar representation. In this study, we use different document representations with the benefit of word embeddings and an ensemble of base classifiers for text classification. The ensemble of base classifiers includes traditional machine learning algorithms such as naïve Bayes, support vector machine, and random forest and a deep learning-based conventional network classifier. We analysed the classification accuracy of different document representations by employing an ensemble of classifiers on eight different datasets. Experimental results demonstrate that the usage of heterogeneous ensembles together with deep learning methods and word embeddings enhances the classification performance of texts. Copyright © 2018 Zeynep H. Kilimci and Selim Akyokus.
引用
收藏
相关论文
共 50 条
  • [1] Deep Learning- and Word Embedding-Based Heterogeneous Classifier Ensembles for Text Classification
    Kilimci, Zeynep H.
    Akyokus, Seim
    COMPLEXITY, 2018,
  • [2] Word embedding and text classification based on deep learning methods
    Li, Saihan
    Gong, Bing
    2020 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE COMMUNICATION AND NETWORK SECURITY (CSCNS2020), 2021, 336
  • [3] Word Embedding-Based Biomedical Text Summarization
    Rouane, Oussama
    Belhadef, Hacene
    Bouakkaz, Mustapha
    EMERGING TRENDS IN INTELLIGENT COMPUTING AND INFORMATICS: DATA SCIENCE, INTELLIGENT INFORMATION SYSTEMS AND SMART COMPUTING, 2020, 1073 : 288 - 297
  • [4] Sequential Embedding-based Attentive (SEA) classifier for malware classification
    Ahmed, Muhammad
    Qureshi, Anam
    Shamsi, Jawwad Ahmed
    Marvi, Murk
    2022 INTERNATIONAL CONFERENCE ON CYBER WARFARE AND SECURITY (ICCWS), 2022, : 28 - 35
  • [5] PETGEN: Personalized Text Generation Attack on Deep Sequence Embedding-based Classification Models
    He, Bing
    Ahamad, Mustaque
    Kumar, Srijan
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 575 - 584
  • [6] Word embedding-based relation modeling in a heterogeneous information network
    Jiwan Seo
    Seungjin Choi
    Yura Alex Kim
    Karam Yoo
    Sangyong Han
    Multimedia Tools and Applications, 2018, 77 : 18529 - 18543
  • [7] Word embedding-based relation modeling in a heterogeneous information network
    Seo, Jiwan
    Choi, Seungjin
    Kim, Yura Alex
    Yoo, Karam
    Han, Sangyong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (14) : 18529 - 18543
  • [8] The Evaluation of Word Embedding Models and Deep Learning Algorithms for Turkish Text Classification
    Kilimci, Zeynep Hilal
    Akyokus, Selim
    2019 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2019, : 548 - 553
  • [9] Word Embedding-based Web Service Representations for Classification and Clustering
    Zhang, Xiangping
    Liu, Jianxun
    Shi, Min
    Cao, Buqing
    2021 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING (SCC 2021), 2021, : 34 - 43
  • [10] Word Embedding-based Text Processing for Comprehensive Summarization and Distinct Information Extraction
    Wan, Xiangpeng
    Ghazzai, Hakim
    Massoud, Yehia
    2020 IEEE TECHNOLOGY & ENGINEERING MANAGEMENT CONFERENCE (TEMSCON 2020), 2020,