A Hybrid Swarm and Gravitation-based feature selection algorithm for handwritten Indic script classification problem

被引:7
|
作者
Guha, Ritam [1 ]
Ghosh, Manosij [1 ]
Singh, Pawan Kumar [2 ]
Sarkar, Ram [1 ]
Nasipuri, Mita [1 ]
机构
[1] Jadavpur Univ, Dept Comp Sci & Engn, Kolkata 700032, W Bengal, India
[2] Jadavpur Univ, Dept Informat Technol, Kolkata 700032, W Bengal, India
关键词
Feature selection; Hybrid Swarm and Gravitation-based Feature Selection; Particle swarm optimization; Gravitational search algorithm; Handwritten script classification; Indic script; IDENTIFICATION; OPTIMIZATION; SEARCH;
D O I
10.1007/s40747-020-00237-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In any multi-script environment, handwritten script classification is an unavoidable pre-requisite before the document images are fed to their respective Optical Character Recognition (OCR) engines. Over the years, this complex pattern classification problem has been solved by researchers proposing various feature vectors mostly having large dimensions, thereby increasing the computation complexity of the whole classification model. Feature Selection (FS) can serve as an intermediate step to reduce the size of the feature vectors by restricting them only to the essential and relevant features. In the present work, we have addressed this issue by introducing a new FS algorithm, called Hybrid Swarm and Gravitation-based FS (HSGFS). This algorithm has been applied over three feature vectors introduced in the literature recently-Distance-Hough Transform (DHT), Histogram of Oriented Gradients (HOG), and Modified log-Gabor (MLG) filter Transform. Three state-of-the-art classifiers, namely, Multi-Layer Perceptron (MLP), K-Nearest Neighbour (KNN), and Support Vector Machine (SVM), are used to evaluate the optimal subset of features generated by the proposed FS model. Handwritten datasets at block, text line, and word level, consisting of officially recognized 12 Indic scripts, are prepared for experimentation. An average improvement in the range of 2-5% is achieved in the classification accuracy by utilizing only about 75-80% of the original feature vectors on all three datasets. The proposed method also shows better performance when compared to some popularly used FS models. The codes used for implementing HSGFS can be found in the following Github link: https://github.com/Ritam-Guha/HSGFS.
引用
收藏
页码:823 / 839
页数:17
相关论文
共 50 条
  • [1] A Hybrid Swarm and Gravitation-based feature selection algorithm for handwritten Indic script classification problem
    Ritam Guha
    Manosij Ghosh
    Pawan Kumar Singh
    Ram Sarkar
    Mita Nasipuri
    [J]. Complex & Intelligent Systems, 2021, 7 : 823 - 839
  • [2] A clustering-based feature selection framework for handwritten Indic script classification
    Chatterjee, Iman
    Ghosh, Manosij
    Sing, Pawan Kumar
    Sarkar, Ram
    Nasipuri, Mita
    [J]. EXPERT SYSTEMS, 2019, 36 (06)
  • [3] Recognition of handwritten indic script using clonal selection algorithm
    Garain, Utpal
    Chakraborty, Mangal P.
    Dasgupta, Dipankar
    [J]. ARTIFICIAL IMMUNE SYSTEMS, PROCEEDINGS, 2006, 4163 : 256 - 266
  • [4] A Hybrid Algorithm for Feature Selection and Classification
    Sathish, B. R.
    Senthilkumar, Radha
    [J]. JOURNAL OF INTERNET TECHNOLOGY, 2023, 24 (03): : 593 - 602
  • [5] On the gravitation-based classification: A novel algorithm using equilibrium points for enhanced learning and dimensionality reduction
    Monemizadeh, Mostafa
    Hashemi, Seyed Rouhollah Samareh
    Monemizadeh, Morteza
    [J]. EXPERT SYSTEMS, 2024,
  • [6] Visual Analytic-Based Technique for Handwritten Indic Script Identification-A Greedy Heuristic Feature Fusion Framework
    Obaidullah, Sk Md
    Halder, Chayan
    Das, Nibaran
    Roy, Kaushik
    [J]. PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON FRONTIERS IN INTELLIGENT COMPUTING: THEORY AND APPLICATIONS (FICTA) 2015, 2016, 404 : 211 - 219
  • [7] Zone based Feature Extraction Algorithm for Handwritten Numeral Recognition of Kannada Script
    Rajashekararadhya, S. V.
    Ranjan, P. Vanaja
    [J]. 2009 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE, VOLS 1-3, 2009, : 525 - 528
  • [8] A hybrid feature selection algorithm for the QSAR problem
    Viorel Craciun, Marian
    Cocu, Adina
    Dumitriu, Luminita
    Segal, Cristina
    [J]. COMPUTATIONAL SCIENCE - ICCS 2006, PT 1, PROCEEDINGS, 2006, 3991 : 172 - 178
  • [9] Hybrid Particle Swarm Optimization Feature Selection for Crime Classification
    Anuar, Syahid
    Selamat, Ali
    Sallehuddin, Roselina
    [J]. NEW TRENDS IN INTELLIGENT INFORMATION AND DATABASE SYSTEMS, 2015, 598 : 101 - 110
  • [10] A Feature Selection Method Based on Hybrid Dung Beetle Optimization Algorithm and Slap Swarm Algorithm
    Liu, Wei
    Ren, Tengteng
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (02): : 2979 - 3000