An Improved Formula Extraction Method of Printed Chinese Layouts Based on Connected Component Run-length Feature

被引:0
|
作者
Yang, Fang [1 ]
Hou, Chunning [1 ]
Tian, Xuedong [1 ]
机构
[1] Hebei Univ, Sch Comp Sci & Technol, Baoding, Peoples R China
基金
中国国家自然科学基金;
关键词
formula image; Chinese; formula location; connected component; run-length;
D O I
10.1109/ICVISP.2017.28
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The mathematical formula extraction is the prerequisite of formula structure analysis, recognition and retrieval. This paper studies the formula extraction method for the printed Chinese scientific and technical document images, proposes a criterion based on connected component run-length feature to estimate formulae in text lines, and then improves the formula location method based on rules. The connected component run-length's change regularity was analyzed firstly for all symbols in a text line. Then Change-rate threshold was set to estimate whether there is formula in this line. Finally, improved formula extraction method was given. The experimental results on the samples collected from printed Chinese scientific and technical documents showed that the proposed method is effective in estimate the embedded formula, and improves the accuracy of the formula location.
引用
收藏
页码:114 / 117
页数:4
相关论文
共 50 条
  • [21] Signal feature extraction based on an improved EMD method
    Li Lin
    Ji Hongbing
    MEASUREMENT, 2009, 42 (05) : 796 - 803
  • [22] Research on Feature Extraction Method for Handwritten Chinese Character Recognition Based on Supervised Independent Component Analysis
    Liu Zemin
    He Zhiguo
    Cao Yudong
    ADVANCED TECHNOLOGIES IN MANUFACTURING, ENGINEERING AND MATERIALS, PTS 1-3, 2013, 774-776 : 1636 - 1641
  • [23] Adaptive Run-length Encoding Circuit Based on Cascaded Structure for Target Region Data Extraction of Remote Sensing Image
    Li, Haoyang
    Zheng, Hong
    Han, Chuanzhao
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCE ON INTEGRATED CIRCUITS AND MICROSYSTEMS (ICICM), 2016, : 20 - 27
  • [24] A Printed Chinese Character Recognition Method Based on Area Brightness Feature
    Ke, Yonghong
    CHINESE LEXICAL SEMANTICS (CLSW 2019), 2020, 11831 : 329 - 336
  • [25] Feature extraction method based on improved linear LBP operator
    Sun Yan-yi
    Chen Shuai
    Gao Liang
    PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 1536 - 1540
  • [26] An Improved Texture Feature Extraction Method Based on Radon Transform
    Yan, Haoyang
    Liu, Ying
    2014 SEVENTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2014), VOL 1, 2014, : 481 - 485
  • [27] Fast Hand Feature Extraction Based on Connected Component Labeling, Distance Transform and Hough Transform
    Dung, Le
    Mizukawa, Makoto
    JOURNAL OF ROBOTICS AND MECHATRONICS, 2009, 21 (06) : 726 - 738
  • [28] An Improved Color Cast Feature and Feature Extraction Method Based on Lab Chromaticity Histogram
    Zou, Xiaochun
    Shen, Zhixi
    Kang, Jie
    Dai, Donglin
    2016 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP), 2016, : 128 - 132
  • [29] A Sparse Feature Extraction Method Based on Improved Quantum Evolutionary Algorithm
    Yu F.-J.
    Liu Y.-C.
    Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2020, 40 (05): : 512 - 518
  • [30] A Manifold Learning Based Feature Extraction Method with Improved Discriminative Ability
    Iman, Maryam
    Ghassemian, Hassan
    2015 9TH IRANIAN CONFERENCE ON MACHINE VISION AND IMAGE PROCESSING (MVIP), 2015, : 29 - 32