Generic and optimized framework for multi-content analysis based on learning approaches

被引:0
|
作者
Besnehard, Quentin [1 ]
Marchessoux, Cedric [1 ]
Kimpe, Tom [1 ]
机构
[1] Barco NV, B-8500 Kortrijk, Belgium
关键词
Multi-content analysis; machine learning; AdaBoost; decision trees; optimization;
D O I
10.1117/12.838616
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
During the European Cantata project (ITEA project, 2006-2009), a Multi-Content Analysis framework for the classification of compound images in various categories (text, graphical user interface, medical images, other complex images) was developed within Barco. The framework consists of six parts: a dataset, a feature selection method, a machine learning based Multi-Content Analysis (MCA) algorithm, a Ground Truth, an evaluation module based on metrics and a presentation module. This methodology was built on a cascade of decision tree-based classifiers combined and trained with the AdaBoost meta-algorithm. In order to be able to train these classifiers on large training datasets without excessively increasing the training time, various optimizations were implemented. These optimizations were performed at two levels: the methodology itself (feature selection / elimination, dataset pre-computation) and the decision-tree training algorithm (binary threshold search, dataset presorting and alternate splitting algorithm). These optimizations have little or no negative impact on the classification performance of the resulting classifiers. As a result, the training time of the classifiers was significantly reduced, mainly because the optimized decision-tree training algorithm has a lower algorithmic complexity. The time saved through this optimized methodology was used to compare the results of a greater number of different training parameters.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] OPAD: An Optimized Policy-based Active Learning Framework for Document Content Analysis
    Shekhar, Sumit
    Guda, Bhanu Prakash Reddy
    Chaubey, Ashutosh
    Jindal, Ishan
    Jain, Avneet
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 2825 - 2835
  • [2] MC-eLDA: Towards Pathogenesis Analysis in Traditional Chinese Medicine by Multi-Content Embedding LDA
    Zhang, Ying
    Ji, Wendi
    Wang, Haofen
    Wang, Xiaoling
    Chen, Jin
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2019, PT I, 2019, 11439 : 489 - 500
  • [3] Generic content-based audio indexing and retrieval framework
    Kiranyaz, S.
    Gabbouj, M.
    [J]. IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2006, 153 (03): : 285 - 297
  • [4] Multi-Content Merging Network Based on Focal Loss and Convolutional Block Attention in Hyperspectral Image Classification
    Yang, Lina
    Zhang, Fengqi
    Wang, Patrick Shen-Pei
    Li, Xichun
    Luo, Huiwu
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (06)
  • [5] Phenotype Analysis of Arabidopsis thaliana Based on Optimized Multi-Task Learning
    Yuan, Peisen
    Xu, Shuning
    Zhai, Zhaoyu
    Xu, Huanliang
    [J]. MATHEMATICS, 2023, 11 (18)
  • [6] Generic framework for content-based stereo image/video retrieval
    Feng, Y.
    Ren, J.
    Jiang, J.
    [J]. ELECTRONICS LETTERS, 2011, 47 (02) : 97 - +
  • [7] A generic content-based image retrieval framework for mobile devices
    Iftikhar Ahmad
    Moncef Gabbouj
    [J]. Multimedia Tools and Applications, 2011, 55 : 423 - 442
  • [8] A generic content-based image retrieval framework for mobile devices
    Ahmad, Iftikhar
    Gabbouj, Moncef
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2011, 55 (03) : 423 - 442
  • [9] A FRAMEWORK FOR EVALUATION OF WEB BASED LEARNING CONTENT
    Atanasov, Valentin
    Ivanova, Aneliya
    [J]. INTERNATIONAL JOURNAL ON INFORMATION TECHNOLOGIES AND SECURITY, 2022, 14 (04): : 13 - 24
  • [10] Optimized blockchain-based healthcare framework empowered by mixed multi-agent reinforcement learning
    Al-Marridi, Abeer Z.
    Mohamed, Amr
    Erbad, Aiman
    [J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2024, 224