CNN based spatial classification features for clustering offline handwritten mathematical expressions

被引:25
|
作者
Cuong Tuan Nguyen [1 ]
Vu Tran Minh Khuong [1 ]
Hung Tuan Nguyen [1 ]
Nakagawa, Masaki [1 ]
机构
[1] Tokyo Univ Agr & Technol, Dept Comp & Informat Sci, 2-24-16 Naka Cho, Koganei, Tokyo 1848588, Japan
关键词
Clustering images; Offline handwritten; Mathematical expression; CNN; Weakly supervised learning;
D O I
10.1016/j.patrec.2019.12.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To help human markers mark a large number of answers of handwritten mathematical expressions (HMEs), clustering them makes marking more efficient and reliable. Clustering HMEs, however, faces the problem of extracting both localization and classification representation of mathematical symbols for an HME image and defining the distance between two HME images. First, we propose a method based on Convolutional Neural Networks (CNN) to extract the representations for an HME. Symbols in various scales are located and classified by a combination of features from a multi-scale CNN. We use weakly supervised training combined with symbols attention to enhance localization and classification predictions. Second, we propose a multi-level spatial distance between two representations for clustering HMEs. Experiments on CROHME 2016 and CROHME 2019 dataset show the promising results of 0.99 and 0.96 in purity, respectively. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:113 / 120
页数:8
相关论文
共 50 条
  • [41] A clustering-based feature selection framework for handwritten Indic script classification
    Chatterjee, Iman
    Ghosh, Manosij
    Sing, Pawan Kumar
    Sarkar, Ram
    Nasipuri, Mita
    EXPERT SYSTEMS, 2019, 36 (06)
  • [42] Geometrical features extraction and KNN based Classification of Handwritten Marathi characters
    Kamble, Parshuram M.
    Hegadi, Ravindra S.
    2017 2ND WORLD CONGRESS ON COMPUTING AND COMMUNICATION TECHNOLOGIES (WCCCT), 2017, : 219 - 222
  • [43] Classification of Mammograms Using Texture and CNN Based Extracted Features
    Debelee, Taye Girma
    Gebreselasie, Abrham
    Schwenker, Friedhelm
    Aminan, Mohammadreza
    Yohannes, Dereje
    JOURNAL OF BIOMIMETICS BIOMATERIALS AND BIOMEDICAL ENGINEERING, 2019, 42 : 79 - 97
  • [44] Effective offline handwritten text recognition model based on a sequence-to-sequence approach with CNN-RNN networks
    Geetha, R.
    Thilagam, T.
    Padmavathy, T.
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (17): : 10923 - 10934
  • [45] Stacked Features Based CNN for Rotation Invariant Digit Classification
    Jain, Ayushi
    Subrahmanyam, Gorthi R. K. Sai
    Mishra, Deepak
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017, 2017, 10597 : 527 - 533
  • [46] CNN Based Features Extraction for Age Estimation and Gender Classification
    Benkaddour, Mohammed Kamel
    INFORMATICA-AN INTERNATIONAL JOURNAL OF COMPUTING AND INFORMATICS, 2021, 45 (05): : 697 - 703
  • [47] CNN based features extraction for age estimation and gender classification
    Benkaddour M.K.
    Informatica (Slovenia), 2021, 45 (05): : 697 - 703
  • [48] CNN-based features for retrieval and classification of food images
    Ciocca, Gianluigi
    Napoletano, Paolo
    Schettini, Raimondo
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2018, 176 : 70 - 77
  • [49] Architectural style classification based on CNN and channel–spatial attention
    Bo Wang
    Sulan Zhang
    Jifu Zhang
    Zhenjiao Cai
    Signal, Image and Video Processing, 2023, 17 : 99 - 107
  • [50] CNN based hyperspectral image classification using unsupervised band selection and structure-preserving spatial features
    Vaddi, Radhesyam
    Manoharan, Prabukumar
    INFRARED PHYSICS & TECHNOLOGY, 2020, 110