A multi-one-class dynamic classifier for adaptive digitization of document streams

被引:0
|
作者
Ho, Anh Khoi Ngo [1 ]
Eglin, Veronique [1 ]
Ragot, Nicolas [2 ]
Ramel, Jean-Yves [2 ]
机构
[1] Univ Lyon, CNRS, INSA Lyon, LIRIS,UMR 5205, F-69621 Lyon, France
[2] Univ Francois Rabelais Tours, Lab Informat LI EA 6300, Tours, France
关键词
Stream-based document images classification; Online document content and quality classification; Incremental learning; Concept drift; One-class SVM; NEURAL-NETWORK; FUZZY ARTMAP; ARCHITECTURE; ENSEMBLE;
D O I
10.1007/s10032-017-0286-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a new dynamic classifier design based on a set of one-class independent SVM for image data stream categorization. Dynamic or continuous learning and classification has been recently investigated to deal with different situations, like online learning of fixed concepts, learning in non-stationary environments (concept drift) or learning from imbalanced data. Most of solutions are not able to deal at the same time with many of these specificities. Particularly, adding new concepts, merging or splitting concepts are most of the time considered as less important and are consequently less studied, whereas they present a high interest for stream-based document image classification. To deal with that kind of data, we explore a learning and classification scheme based on one-class SVM classifiers that we call mOC-iSVM (multi-one-class incremental SVM). Even if one-class classifiers are suffering from a lack of discriminative power, they have, as a counterpart, a lot of interesting properties coming from their independent modeling. The experiments presented in the paper show the theoretical feasibility on different benchmarks considering addition of new classes. Experiments also demonstrate that the mOC-iSVM model can be efficiently used for tasks dedicated to documents classification (by image quality and image content) in a context of streams, handling many typical scenarii for concepts extension, drift, split and merge.
引用
收藏
页码:137 / 154
页数:18
相关论文
共 50 条
  • [1] A multi-one-class dynamic classifier for adaptive digitization of document streams
    Anh Khoi Ngo Ho
    Véronique Eglin
    Nicolas Ragot
    Jean-Yves Ramel
    International Journal on Document Analysis and Recognition (IJDAR), 2017, 20 : 137 - 154
  • [2] MultiKOC: Multi-One-Class Classifier Based K-Means Clustering
    Abdallah, Loai
    Badarna, Murad
    Khalifa, Waleed
    Yousef, Malik
    ALGORITHMS, 2021, 14 (05)
  • [3] Dynamic classifier selection for one-class classification
    Krawczyk, Bartosz
    Wozniak, Michal
    KNOWLEDGE-BASED SYSTEMS, 2016, 107 : 43 - 53
  • [4] A Fuzzy One Class Classifier for Multi Layer Model
    Lo Bosco, Giosue
    Pinello, Luca
    FUZZY LOGIC AND APPLICATIONS, 2009, 5571 : 124 - 131
  • [5] Incremental weighted one-class classifier for mining stationary data streams
    Krawczyk, Bartosz
    Wozniak, Michal
    JOURNAL OF COMPUTATIONAL SCIENCE, 2015, 9 : 19 - 25
  • [6] A Compliant Document Image Classification System based on One-Class Classifier
    Sidere, Nicolas
    Ramel, Jean-Yves
    Barrat, Sabine
    D'Andecy, Vincent Poulain
    Kebairi, Saddok
    PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, : 96 - 101
  • [7] One-Class Classification Ensemble with Dynamic Classifier Selection
    Krawczyk, Bartosz
    Wozniak, Michal
    ADVANCES IN NEURAL NETWORKS - ISNN 2014, 2014, 8866 : 542 - 549
  • [8] Multiple One-Class Classifier Combination for Multi-Class Classification
    Hadjadji, Bilal
    Chibani, Youcef
    Guerbai, Yasmine
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 2832 - 2837
  • [9] Gabor filter based multi-class classifier for scanned document images
    Ma, HF
    Doermann, D
    SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 968 - 972
  • [10] On the usefulness of one-class classifier ensembles for decomposition of multi-class problems
    Krawczyk, Bartosz
    Wozniak, Michal
    Herrera, Francisco
    PATTERN RECOGNITION, 2015, 48 (12) : 3969 - 3982