Chart classification: a survey and benchmarking of different state-of-the-art methods

被引:0
|
作者
Thiyam, Jennil [1 ]
Singh, Sanasam Ranbir [1 ]
Bora, Prabin Kumar [1 ]
机构
[1] IIT Guwahati, Gauhati, India
关键词
Chart survey; Chart image classification; Chart dataset; Chart classification error analysis; Chart's noise; Confusing chart pairs; FIGURES;
D O I
10.1007/s10032-023-00443-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the increase in the number of documents with various types of charts available on the internet, automatic chart classification has become an essential task for various downstream applications such as chart data recovery, chart replenishment. This paper presents a comprehensive survey of the studies reported in the literature since 2001 from the perspective of the corpus, pre-processing techniques, feature extraction, and methodologies. Considering that the majority of the existing studies use small datasets with a smaller number of chart types and also reported varying performances, this paper implements and evaluates 44 different machine learning-based chart classification models. The evaluation is done over a large dataset curated locally and benchmarks the performances of these 44 different models over a common experimental framework. It also performs a comprehensive error analysis, identifying two core challenging issues (noise in the charts and confusing chart pairs) that affect the chart classification performances. Compared with the existing survey papers, this paper presents a more comprehensive review and experimental analysis.
引用
收藏
页码:19 / 44
页数:26
相关论文
共 50 条
  • [1] Chart classification: a survey and benchmarking of different state-of-the-art methods
    Jennil Thiyam
    Sanasam Ranbir Singh
    Prabin Kumar Bora
    [J]. International Journal on Document Analysis and Recognition (IJDAR), 2024, 27 : 19 - 44
  • [2] Benchmarking state-of-the-art classification algorithms for credit scoring
    Baesens, B
    Van Gestel, T
    Viaene, S
    Stepanova, M
    Suykens, J
    Vanthienen, J
    [J]. JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2003, 54 (06) : 627 - 635
  • [3] Texture Classification: State-of-the-art Methods and Prospects
    [J]. Liu, Li (liuli_nudt@nudt.edu.cn), 2018, Science Press (44):
  • [4] Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research
    Lessmann, Stefan
    Baesens, Bart
    Seow, Hsin-Vonn
    Thomas, Lyn C.
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2015, 247 (01) : 124 - 136
  • [5] A survey of state-of-the-art methods for securing medical databases
    Kelarev, Andrei V.
    Yi, Xun
    Cui, Hui
    Rylands, Leanne
    Jelinek, Herbert F.
    [J]. AIMS MEDICAL SCIENCE, 2018, 5 (01): : 1 - 22
  • [6] Benchmarking State-of-the-art DIRECT-type Methods on the BBOB Noiseless Testbed
    Kudela, Jakub
    [J]. PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2023 COMPANION, 2023, : 1620 - 1627
  • [7] Ensemble Multisensor Data Using State-of-the-Art Classification Methods
    Twala, Bhekisipho
    Mekuria, Fisseha
    [J]. AFRICON, 2013, 2013, : 1255 - 1260
  • [8] Benchmarking state-of-the-art symbolic regression algorithms
    Jan Žegklitz
    Petr Pošík
    [J]. Genetic Programming and Evolvable Machines, 2021, 22 : 5 - 33
  • [9] Benchmarking state-of-the-art symbolic regression algorithms
    Zegklitz, Jan
    Posik, Petr
    [J]. GENETIC PROGRAMMING AND EVOLVABLE MACHINES, 2021, 22 (01) : 5 - 33
  • [10] Learning path personalization and recommendation methods: A survey of the state-of-the-art
    Nabizadeh, Amir Hossein
    Leal, Jose Paulo
    Rafsanjani, Hamed N.
    Shah, Rajiv Ratn
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2020, 159