Document Image Rectification in Complex Scene Using Stacked Siamese Networks

被引：3

作者：

Xu, Zhen ^{[1
,2
]}

Yin, Fei ^{[2
,3
]}

Yang, Peipei ^{[2
,3
]}

Liu, Cheng-Lin ^{[2
,3
]}

机构：

[1] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Peoples R China

[2] Inst Automat Chinese Acad Sci, Natl Lab Pattern Recognit NLPR, Beijing 100190, Peoples R China

[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China

来源：

2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2022年

基金：

中国国家自然科学基金;

关键词：

3D RECONSTRUCTION; SHAPE;

D O I：

10.1109/ICPR56361.2022.9956331

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

With the popularity of digital cameras and smartphones, capturing document images of physical documents for electronic storage has become popular, but the captured document images suffer various deformations. Document image rectification has been studied intensively, but existing methods do not perform sufficiently for document images captured in complex scenes due to the various environmental factors. In this paper, we propose an end-to-end rectification model by stacking 3D and 2D Siamese networks. Three regularization terms are used to enforce 3D reconstruction consistency and 2D texture consistency, respectively. Experimental results on real world datasets demonstrate that the three regularization terms with Siamese networks can significantly improve the rectification performance, and our method performs superiorly compared to state-of-the-art methods.

引用

页码：1550 / 1556

页数：7

共 50 条

[31] Heat flux distribution and rectification of complex networks
Liu, Zonghua
Wu, Xiang
Yang, Huijie
Gupte, Neelima
Li, Baowen
NEW JOURNAL OF PHYSICS, 2010, 12
[32] Distance Transform Based Active Contour Approach for Document Image Rectification
Salvi, Dhaval
Zheng, Kang
Zhou, Youjie
Wang, Song
2015 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2015, : 757 - 764
[33] Am I readable? Transfer learning based document image rectification
Kumari, Pooja
Das, Sukhendu
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2024, 27 (03) : 433 - 446
[34] Document Image Classification with Intra-Domain Transfer Learning and Stacked Generalization of Deep Convolutional Neural Networks
Das, Arindam
Roy, Saikat
Bhattacharya, Ujjwal
Parui, Swapan K.
2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3180 - 3185
[35] Multi-view document rectification using boundary
Tsoi, Yau-Chat
Brown, Michael S.
2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 2158 - +
[36] The Rectification of Document Images using Text-features
Sun, Ringming
Li, Nannan
Wang, Shengfa
Ji, Lin
Wang, Zhenyu
2017 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY AND VISUALIZATION (ICVRV 2017), 2017, : 223 - 228
[37] METEOSAT IMAGE RECTIFICATION USING LANDMARKS
MAURY, A
ALENGRIN, G
BLANCKE, B
RECHERCHE AEROSPATIALE, 1993, (06): : 67 - 81
[38] Stacked generative adversarial networks for image compositing
Bing Yu
Youdong Ding
Zhifeng Xie
Dongjin Huang
EURASIP Journal on Image and Video Processing, 2021
[39] Stacked generative adversarial networks for image compositing
Yu, Bing
Ding, Youdong
Xie, Zhifeng
Huang, Dongjin
EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2021, 2021 (01)
[40] Stacked Attention Networks for Image Question Answering
Yang, Zichao
He, Xiaodong
Gao, Jianfeng
Deng, Li
Smola, Alex
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 21 - 29

← 1 2 3 4 5 →