Document Image Rectification in Complex Scene Using Stacked Siamese Networks

被引:3
|
作者
Xu, Zhen [1 ,2 ]
Yin, Fei [2 ,3 ]
Yang, Peipei [2 ,3 ]
Liu, Cheng-Lin [2 ,3 ]
机构
[1] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Peoples R China
[2] Inst Automat Chinese Acad Sci, Natl Lab Pattern Recognit NLPR, Beijing 100190, Peoples R China
[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
3D RECONSTRUCTION; SHAPE;
D O I
10.1109/ICPR56361.2022.9956331
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the popularity of digital cameras and smartphones, capturing document images of physical documents for electronic storage has become popular, but the captured document images suffer various deformations. Document image rectification has been studied intensively, but existing methods do not perform sufficiently for document images captured in complex scenes due to the various environmental factors. In this paper, we propose an end-to-end rectification model by stacking 3D and 2D Siamese networks. Three regularization terms are used to enforce 3D reconstruction consistency and 2D texture consistency, respectively. Experimental results on real world datasets demonstrate that the three regularization terms with Siamese networks can significantly improve the rectification performance, and our method performs superiorly compared to state-of-the-art methods.
引用
收藏
页码:1550 / 1556
页数:7
相关论文
共 50 条
  • [31] Heat flux distribution and rectification of complex networks
    Liu, Zonghua
    Wu, Xiang
    Yang, Huijie
    Gupte, Neelima
    Li, Baowen
    NEW JOURNAL OF PHYSICS, 2010, 12
  • [32] Distance Transform Based Active Contour Approach for Document Image Rectification
    Salvi, Dhaval
    Zheng, Kang
    Zhou, Youjie
    Wang, Song
    2015 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2015, : 757 - 764
  • [33] Am I readable? Transfer learning based document image rectification
    Kumari, Pooja
    Das, Sukhendu
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2024, 27 (03) : 433 - 446
  • [34] Document Image Classification with Intra-Domain Transfer Learning and Stacked Generalization of Deep Convolutional Neural Networks
    Das, Arindam
    Roy, Saikat
    Bhattacharya, Ujjwal
    Parui, Swapan K.
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3180 - 3185
  • [35] Multi-view document rectification using boundary
    Tsoi, Yau-Chat
    Brown, Michael S.
    2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 2158 - +
  • [36] The Rectification of Document Images using Text-features
    Sun, Ringming
    Li, Nannan
    Wang, Shengfa
    Ji, Lin
    Wang, Zhenyu
    2017 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY AND VISUALIZATION (ICVRV 2017), 2017, : 223 - 228
  • [37] METEOSAT IMAGE RECTIFICATION USING LANDMARKS
    MAURY, A
    ALENGRIN, G
    BLANCKE, B
    RECHERCHE AEROSPATIALE, 1993, (06): : 67 - 81
  • [38] Stacked generative adversarial networks for image compositing
    Bing Yu
    Youdong Ding
    Zhifeng Xie
    Dongjin Huang
    EURASIP Journal on Image and Video Processing, 2021
  • [39] Stacked generative adversarial networks for image compositing
    Yu, Bing
    Ding, Youdong
    Xie, Zhifeng
    Huang, Dongjin
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2021, 2021 (01)
  • [40] Stacked Attention Networks for Image Question Answering
    Yang, Zichao
    He, Xiaodong
    Gao, Jianfeng
    Deng, Li
    Smola, Alex
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 21 - 29