A Novel Active Learning Method Using SVM for Text Classification

被引:145
|
作者
Goudjil M. [1 ]
Koudil M. [1 ]
Bedda M. [2 ]
Ghoggali N. [3 ]
机构
[1] École nationale Supérieure d’Informatique (ESI), Oued Smar, Algiers
[2] AL Jouf University, Sakaka
[3] LAAAS laboratory, Faculté de Technologie, Université Batna 2, Batna
关键词
active learning; pairwise coupling; pool-based active learning; support vector machine (SVM); Text categorization;
D O I
10.1007/s11633-015-0912-z
中图分类号
学科分类号
摘要
Support vector machines (SVMs) are a popular class of supervised learning algorithms, and are particularly applicable to large and high-dimensional classification problems. Like most machine learning methods for data classification and information retrieval, they require manually labeled data samples in the training stage. However, manual labeling is a time consuming and errorprone task. One possible solution to this issue is to exploit the large number of unlabeled samples that are easily accessible via the internet. This paper presents a novel active learning method for text categorization. The main objective of active learning is to reduce the labeling effort, without compromising the accuracy of classification, by intelligently selecting which samples should be labeled. The proposed method selects a batch of informative samples using the posterior probabilities provided by a set of multi-class SVM classifiers, and these samples are then manually labeled by an expert. Experimental results indicate that the proposed active learning method significantly reduces the labeling effort, while simultaneously enhancing the classification accuracy. © 2016, Institute of Automation, Chinese Academy of Sciences and Springer-Verlag GmbH Germany, part of Springer Nature.
引用
收藏
页码:290 / 298
页数:8
相关论文
共 50 条
  • [1] A Novel Active Learning Method Using SVM for Text Classification
    Mohamed Goudjil
    Mouloud Koudil
    Mouldi Bedda
    Noureddine Ghoggali
    Machine Intelligence Research, 2018, (03) : 290 - 298
  • [2] Arabic Text Categorization Using SVM Active Learning Technique : An Overview
    Goudjil, Mohamed
    Koudil, Mouloud
    Hammami, Nacereddine
    Bedda, Mouldi
    Alruily, Meshrif
    WORLD CONGRESS ON COMPUTER & INFORMATION TECHNOLOGY (WCCIT 2013), 2013,
  • [3] Using Active Learning in Text Classification of Quranic Sciences
    Goudjil, Mohamed
    Bedda, Mouldi
    Koudil, Mouloud
    Ghoggali, Noureddine
    2013 TAIBAH UNIVERSITY INTERNATIONAL CONFERENCE ON ADVANCES IN INFORMATION TECHNOLOGY FOR THE HOLY QURAN AND ITS SCIENCES, 2013, : 209 - 213
  • [4] Text classification with active learning
    Novak, B
    Mladenic, D
    Grobelnik, M
    FROM DATA AND INFORMATION ANALYSIS TO KNOWLEDGE ENGINEERING, 2006, : 398 - +
  • [5] AN ACTIVE LEARNING METHOD BASED ON SVM CLASSIFIER FOR HYPERSPECTRAL IMAGES CLASSIFICATION
    Sun, Shujin
    Zhong, Ping
    Xiao, Huaitie
    Liu, Fang
    Wang, Runsheng
    2015 7TH WORKSHOP ON HYPERSPECTRAL IMAGE AND SIGNAL PROCESSING: EVOLUTION IN REMOTE SENSING (WHISPERS), 2015,
  • [6] On Text-based Mining with Active Learning and Background Knowledge Using SVM
    Catarina Silva
    Bernardete Ribeiro
    Soft Computing, 2007, 11 : 519 - 530
  • [7] On text-based mining with active learning and background knowledge using SVM
    Silva, Catarina
    Ribeiro, Bernardete
    SOFT COMPUTING, 2007, 11 (06) : 519 - 530
  • [8] SVM Active Learning Approach for Image Classification Using Spatial Information
    Pasolli, Edoardo
    Melgani, Farid
    Tuia, Devis
    Pacifici, Fabio
    Emery, William J.
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2014, 52 (04): : 2217 - 2233
  • [9] Text Classification Using SVM with Exponential Kernel
    Chen, Junting
    Zhong, Jian
    Xie, Yicai
    Cai, Caiyun
    COMPUTER AND INFORMATION TECHNOLOGY, 2014, 519-520 : 807 - +
  • [10] SVM based adaptive learning method for text classification from positive and unlabeled documents
    Peng, Tao
    Zuo, Wanli
    He, Fengling
    KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 16 (03) : 281 - 301