Document Image Rectification in Complex Scene Using Stacked Siamese Networks

被引:3
|
作者
Xu, Zhen [1 ,2 ]
Yin, Fei [2 ,3 ]
Yang, Peipei [2 ,3 ]
Liu, Cheng-Lin [2 ,3 ]
机构
[1] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Peoples R China
[2] Inst Automat Chinese Acad Sci, Natl Lab Pattern Recognit NLPR, Beijing 100190, Peoples R China
[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
3D RECONSTRUCTION; SHAPE;
D O I
10.1109/ICPR56361.2022.9956331
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the popularity of digital cameras and smartphones, capturing document images of physical documents for electronic storage has become popular, but the captured document images suffer various deformations. Document image rectification has been studied intensively, but existing methods do not perform sufficiently for document images captured in complex scenes due to the various environmental factors. In this paper, we propose an end-to-end rectification model by stacking 3D and 2D Siamese networks. Three regularization terms are used to enforce 3D reconstruction consistency and 2D texture consistency, respectively. Experimental results on real world datasets demonstrate that the three regularization terms with Siamese networks can significantly improve the rectification performance, and our method performs superiorly compared to state-of-the-art methods.
引用
收藏
页码:1550 / 1556
页数:7
相关论文
共 50 条
  • [1] Curved Document Image Rectification
    Dhanalakshmy, Dhanya M.
    Menon, Hema P.
    2017 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2017, : 783 - 786
  • [2] Using Siamese capsule networks for remote sensing scene classification
    Zhou, Song
    Zhou, Yong
    Liu, Bing
    REMOTE SENSING LETTERS, 2020, 11 (08) : 757 - 766
  • [3] Snake Image Classification using Siamese Networks
    Abeysinghe, Chamath
    Welivita, Anuradha
    Perera, Indika
    ICGSP '19 - PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON GRAPHICS AND SIGNAL PROCESSING, 2019, : 8 - 12
  • [4] Zero-Shot Sketch-Based Image Retrieval Using StyleGen and Stacked Siamese Neural Networks
    Gopu, Venkata Rama Muni Kumar
    Dunna, Madhavi
    JOURNAL OF IMAGING, 2024, 10 (04)
  • [5] Document image rectification using fuzzy sets and morphological operators
    Lu, SJ
    Chen, BM
    Ko, CC
    ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 2877 - 2880
  • [6] Deep Unrestricted Document Image Rectification
    Feng, Hao
    Liu, Shaokai
    Deng, Jiajun
    Zhou, Wengang
    Li, Houqiang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 6142 - 6154
  • [7] Maximizing steganalysis performance using siamese networks for image
    Fan, Lingyan
    Qiu, Jinxin
    Wang, Zichi
    Wang, Hongbo
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (31) : 76953 - 76962
  • [8] Scene image retrieval with siamese spatial attention pooling
    Ma, Jinyu
    Gu, Xiaodong
    NEUROCOMPUTING, 2020, 412 : 252 - 261
  • [9] Detection of Image Manipulations Using Siamese Convolutional Neural Networks
    Mazumdar, Aniruddha
    Singh, Jaya
    Tomar, Yosha Singh
    Bora, P. K.
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2019, PT I, 2019, 11941 : 226 - 233
  • [10] Geometric Representation Learning for Document Image Rectification
    Feng, Hao
    Zhou, Wengang
    Deng, Jiajun
    Wang, Yuechen
    Li, Houqiang
    COMPUTER VISION, ECCV 2022, PT XXXVII, 2022, 13697 : 475 - 492