CNN based spatial classification features for clustering offline handwritten mathematical expressions

被引:25
|
作者
Cuong Tuan Nguyen [1 ]
Vu Tran Minh Khuong [1 ]
Hung Tuan Nguyen [1 ]
Nakagawa, Masaki [1 ]
机构
[1] Tokyo Univ Agr & Technol, Dept Comp & Informat Sci, 2-24-16 Naka Cho, Koganei, Tokyo 1848588, Japan
关键词
Clustering images; Offline handwritten; Mathematical expression; CNN; Weakly supervised learning;
D O I
10.1016/j.patrec.2019.12.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To help human markers mark a large number of answers of handwritten mathematical expressions (HMEs), clustering them makes marking more efficient and reliable. Clustering HMEs, however, faces the problem of extracting both localization and classification representation of mathematical symbols for an HME image and defining the distance between two HME images. First, we propose a method based on Convolutional Neural Networks (CNN) to extract the representations for an HME. Symbols in various scales are located and classified by a combination of features from a multi-scale CNN. We use weakly supervised training combined with symbols attention to enhance localization and classification predictions. Second, we propose a multi-level spatial distance between two representations for clustering HMEs. Experiments on CROHME 2016 and CROHME 2019 dataset show the promising results of 0.99 and 0.96 in purity, respectively. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:113 / 120
页数:8
相关论文
共 50 条
  • [31] Subexpression and Dominant Symbol Histograms for Spatial Relation Classification in Mathematical Expressions
    Julca-Aguilar, Frank
    Hirata, Nina S. T.
    Mouchere, Harold
    Viard-Gaudin, Christian
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3446 - 3451
  • [32] CNN Classification Based on Global and Local Features
    Zheng, Yufeng
    Huang, Jun
    Chen, Tianwen
    Ou, Yang
    Zhou, Wu
    REAL-TIME IMAGE PROCESSING AND DEEP LEARNING 2019, 2019, 10996
  • [33] A New Design Based-SVM of the CNN Classifier Architecture with Dropout for Offline Arabic Handwritten Recognition
    Elleuch, Mohamed
    Maalej, Rania
    Kherallah, Monji
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE 2016 (ICCS 2016), 2016, 80 : 1712 - 1723
  • [34] Writer-aware CNN for parsimonious HMM-based offline handwritten Chinese text recognition
    Wang, Zi-Rui
    Du, Jun
    Wang, Jia-Ming
    PATTERN RECOGNITION, 2020, 100
  • [35] A novel methodology for offline English handwritten character recognition using ELBP-based sequential (CNN)
    Humayun, Muniba
    Siddiqi, Raheel
    Uddin, Mueen
    Kandhro, Irfan Ali
    Abdelhaq, Maha
    Alsaqour, Raed
    Neural Computing and Applications, 2024, 36 (30) : 19139 - 19156
  • [36] Recognition of Offline Handwritten Chinese Characters of Amount in Words Based on Integrated Features and HMM
    Liu, Caifeng
    Tian, Xuedong
    Yang, Fang
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY, PTS 1-4, 2013, 263-266 : 2639 - 2642
  • [37] Tree-based data augmentation and mutual learning for offline handwritten mathematical expression recognition
    Yang, Chen
    Du, Jun
    Zhang, Jianshu
    Wu, Changjie
    Chen, Mingjun
    Wu, JiaJia
    PATTERN RECOGNITION, 2022, 132
  • [38] Hyperspectral image classification using CNN with spectral and spatial features integration
    Vaddi, Radhesyam
    Manoharan, Prabukumar
    INFRARED PHYSICS & TECHNOLOGY, 2020, 107 (107)
  • [39] Eigen Value Based Features for Offline Handwritten Signature Verification Using Neural Network Approach
    Jagtap, Amruta B.
    Hegadi, Ravindra S.
    RECENT TRENDS IN IMAGE PROCESSING AND PATTERN RECOGNITION (RTIP2R 2016), 2017, 709 : 39 - 48
  • [40] Mutual Learning Offline Handwritten Mathematical Expression Recognition Based on Multi-Scale Feature Fusion
    Fu P.
    Xu Y.
    Yang H.
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2024, 52 (02): : 23 - 31