Mobile Video Capture of Multi-page Documents

被引:7
|
作者
Kumar, Jayant [1 ]
Bala, Raja [2 ]
Ding, Hengzhou [2 ]
Emmett, Phillip [2 ]
机构
[1] Univ Maryland, College Pk, MD 20742 USA
[2] Xerox Res Ctr, Webster, NY USA
关键词
D O I
10.1109/CVPRW.2013.10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a mobile application for capturing images of printed multi-page documents with a smartphone camera. With today's available document capture applications, the user has to carefully capture individual photographs of each page and assemble them into a document, leading to a cumbersome and time consuming user experience. We propose a novel approach of using video to capture multipage documents. Our algorithm automatically selects the best still images corresponding to individual pages of the document from the video. The technique combines video motion analysis, inertial sensor signals, and an image quality (IQ) prediction technique to select the best page images from the video. For the latter, we extend a previous no-reference IQ prediction algorithm to suit the needs of our video application. The algorithm has been implemented on an iPhone 4S. Individual pages are successfully extracted for a wide variety of multi-page documents. OCR analysis shows that the quality of document images produced by our app is comparable to that of standard still captures. At the same time, user studies confirm that in the majority of trials, video capture provides an experience that is faster and more convenient than multiple still captures.
引用
收藏
页码:35 / 40
页数:6
相关论文
共 50 条
  • [21] Non-dominated sorting based multi-page photo collage
    Yu Song
    Fan Tang
    Weiming Dong
    Changsheng Xu
    ComputationalVisualMedia, 2022, 8 (02) : 199 - 212
  • [22] Coldmap: Extending SSD Lifetime Exploiting Multi-Page Mapping Information
    Seo, Jaewon
    Koo, Gunjae
    2024 13TH NON-VOLATILE MEMORY SYSTEMS AND APPLICATIONS SYMPOSIUM, NVMSA 2024, 2024, : 13 - 18
  • [23] An Approach to the Segmentation of Multi-page Document Flow Using Binary Classification
    Agin, Onur
    Ulas, Cagdas
    Ahat, Mehmet
    Bekar, Can
    SIXTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2014), 2015, 9443
  • [24] Non-dominated sorting based multi-page photo collage
    Yu Song
    Fan Tang
    Weiming Dong
    Changsheng Xu
    Computational Visual Media, 2022, 8 : 199 - 212
  • [25] Non-dominated sorting based multi-page photo collage
    Song, Yu
    Tang, Fan
    Dong, Weiming
    Xu, Changsheng
    COMPUTATIONAL VISUAL MEDIA, 2022, 8 (02) : 199 - 212
  • [27] XWRAPComposer: A multi-page data extraction service for bio-computing applications
    Liu, L
    Zhang, JJ
    Han, W
    Pu, C
    Caverlee, J
    Park, S
    Critchlow, T
    Coleman, M
    Buttler, D
    2005 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING, VOL 1, PROCEEDINGS, 2005, : 271 - 278
  • [28] DYNAMIC HIERARCHICAL DICTIONARY DESIGN FOR MULTI-PAGE BINARY DOCUMENT IMAGE COMPRESSION
    Guo, Yandong
    Depalov, Dejan
    Bauer, Peter
    Bradburn, Brent
    Allebach, Jan P.
    Bouman, Charles A.
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 2294 - 2298
  • [29] Point feature label placement for multi-page maps on small-screen devices
    Gedicke, Sven
    Jabrayilov, Adalat
    Niedermann, Benjamin
    Mutzel, Petra
    Haunert, Jan-Henrik
    COMPUTERS & GRAPHICS-UK, 2021, 100 : 66 - 78
  • [30] A multi-page cell architecture for high-speed programming multi-level NAND flash memories
    Takeuchi, K
    Tanaka, T
    Tanzawa, T
    1997 SYMPOSIUM ON VLSI CIRCUITS: DIGEST OF TECHNICAL PAPERS, 1997, : 67 - 68