Layout and Perspective Distortion Independent Recognition of Captured Chinese Document Image

被引:3
|
作者
Wang, Yanwei [1 ]
Sun, Yuefang [1 ]
Liu, Changsong [1 ]
机构
[1] Tsinghua Univ, Tsinghua Natl Lab Informat Sci & Technol, State Key Lab Intelligent Technol & Syst, Dept Elect Engn, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
layout independent text line extraction; perspective distortion correction; captured document image recognition system; TEXT DETECTION;
D O I
10.1109/ICDAR.2017.102
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduced a layout and perspective distortion independent recognition framework for captured Chinese document image. Under the framework, 1) Conditional random field (CRF) is employed for text line extraction from a global point of view. As the text line extraction is layout independent it could be widely used in different type of document images 2) A text line image based perspective distortion correction method is detailed and used in three different ways. 3) The text line extraction and perspective distortion correction are combined with character recognition to construct a recognition system. On three captured document image datasets, the proposed framework improves the accuracies from 94.03% to 95.20%, 13.01% to 93.71% and 10.63% to 92.68% respectively for different distortion degrees. The experimental results demonstrate that the introduced recognition framework is promising for solving layout and perspective distortion problems in captured document image recognition.
引用
收藏
页码:591 / 596
页数:6
相关论文
共 50 条
  • [1] Camera-captured document image perspective distortion correction using vanishing point detection based on Radon Transform
    Takezawa, Yusuke
    Hasegawa, Makoto
    Tabbone, Salvatore
    [J]. 2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3968 - 3974
  • [2] Farsi document image recognition system using word layout signature
    Ergun, Cem
    Norozpour, Sajedeh
    [J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (02) : 1477 - 1488
  • [3] DOCUMENT IMAGE SEGMENTATION AND LAYOUT ANALYSIS
    SAITOH, T
    YAMAAI, T
    TACHIKAWA, M
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1994, E77D (07) : 778 - 784
  • [4] MC-OCR Challenge: Mobile-Captured Image Document Recognition for Vietnamese Receipts
    Xuan-Son Vu
    Quang-Anh Bui
    Nhu-Van Nguyen
    Thi Tuyet Hai Nguyen
    Thanh Vu
    [J]. 2021 RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES (RIVF 2021), 2021, : 88 - 93
  • [5] Progressive classification scheme for document layout recognition
    Minguillón, J
    Pujol, J
    Zeger, K
    [J]. MATHEMATICAL MODELING, BAYESIAN ESTIMATION, AND INVERSE PROBLEMS, 1999, 3816 : 241 - 251
  • [6] DOCUMENT RECOGNITION SYSTEM WITH LAYOUT STRUCTURE GENERATOR
    MIZUNO, M
    TSUJI, Y
    TANAKA, T
    TANAKA, H
    IWASHITA, M
    TEMMA, T
    [J]. NEC RESEARCH & DEVELOPMENT, 1991, 32 (03): : 430 - 437
  • [7] Document recognition system with layout structure generator
    Mizuno, Masaaki
    Tsuji, Yoshitake
    Tanaka, Toshiyuki
    Tanaka, Haruhiko
    Iwashita, Masao
    Temma, Tsutomu
    [J]. NEC Research and Development, 1991, 32 (03): : 430 - 437
  • [8] Parameter-independent geometric document layout analysis
    Ryu, DS
    Kang, SM
    Lee, SW
    [J]. 15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS: APPLICATIONS, ROBOTICS SYSTEMS AND ARCHITECTURES, 2000, : 397 - 400
  • [9] Comparative Semantic Document Layout Analysis for Enhanced Document Image Retrieval
    Jaha, Emad Sami
    [J]. IEEE Access, 2024, 12 : 150451 - 150467
  • [10] Robust perspective rectification of camera-captured document images
    Takezawa, Yusuke
    Hasegawa, Makoto
    Tabbone, Salvatore
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2017), VOL 6, 2017, : 27 - 32