Mobile Video Capture of Multi-page Documents

被引:7
|
作者
Kumar, Jayant [1 ]
Bala, Raja [2 ]
Ding, Hengzhou [2 ]
Emmett, Phillip [2 ]
机构
[1] Univ Maryland, College Pk, MD 20742 USA
[2] Xerox Res Ctr, Webster, NY USA
关键词
D O I
10.1109/CVPRW.2013.10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a mobile application for capturing images of printed multi-page documents with a smartphone camera. With today's available document capture applications, the user has to carefully capture individual photographs of each page and assemble them into a document, leading to a cumbersome and time consuming user experience. We propose a novel approach of using video to capture multipage documents. Our algorithm automatically selects the best still images corresponding to individual pages of the document from the video. The technique combines video motion analysis, inertial sensor signals, and an image quality (IQ) prediction technique to select the best page images from the video. For the latter, we extend a previous no-reference IQ prediction algorithm to suit the needs of our video application. The algorithm has been implemented on an iPhone 4S. Individual pages are successfully extracted for a wide variety of multi-page documents. OCR analysis shows that the quality of document images produced by our app is comparable to that of standard still captures. At the same time, user studies confirm that in the majority of trials, video capture provides an experience that is faster and more convenient than multiple still captures.
引用
收藏
页码:35 / 40
页数:6
相关论文
共 50 条
  • [31] Naturalistic reading of multi-page texts elicits spatially extended modulation of oscillatory activity in the right hemisphere
    Makela, Sasu
    Kujala, Jan
    Ojala, Pauliina
    Hyona, Jukka
    Salmelin, Riitta
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [32] Information foraging on the web: The effects of "acceptable" Internet delays on multi-page information search behavior
    Dennis, Alan R.
    Taylor, Nolan J.
    DECISION SUPPORT SYSTEMS, 2006, 42 (02) : 810 - 824
  • [33] Multi-page Document Visual Question Answering Using Self-attention Scoring Mechanism
    Kang, Lei
    Tito, Ruben
    Valveny, Ernest
    Karatzas, Dimosthenis
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT VI, 2024, 14809 : 219 - 232
  • [34] Multi-page list extraction: an agent-oriented approach to user-driven information extraction
    Arpteg, A
    2005 INTERNATIONAL CONFERENCE ON INTEGRATION OF KNOWLEDGE INTENSIVE MULTI-AGENT SYSTEMS: KIMAS'05: MODELING, EXPLORATION, AND ENGINEERING, 2005, : 431 - 437
  • [35] Single or Multi-page Learning Analytics Dashboards? Relationships Between Teachers' Cognitive Load and Visualisation Literacy
    Pozdniakov, Stanislav
    Martinez-Maldonado, Roberto
    Tsai, Yi-Shan
    Srivastava, Namrata
    Liu, Yuchen
    Gasevic, Dragan
    RESPONSIVE AND SUSTAINABLE EDUCATIONAL FUTURES, EC-TEL 2023, 2023, 14200 : 339 - 355
  • [36] Automatic Comic Generation with Stylistic Multi-page Layouts and Emotion-driven Text Balloon Generation
    Yang, Xin
    Ma, Zongliang
    Yu, Letian
    Cao, Ying
    Yin, Baocai
    Wei, Xiaopeng
    Zhang, Qiang
    Lau, Rynson W. H.
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (02)
  • [37] Dave's quick 'n' easy web pages 2: A guide to creating multi-page web sites.
    Gordon, RS
    LIBRARY JOURNAL, 2004, 129 (20) : 152 - 152
  • [38] SmartDoc 2017 Video Capture: Mobile Document Acquisition in Video Mode
    Chazalon, J.
    Gomez-Kraemer, P.
    Burie, J. -C.
    Coustaty, M.
    Eskenazi, S.
    Luqman, M.
    Nayef, N.
    Rusinol, M.
    Sidere, N.
    Ogier, J. -M.
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2017), VOL 4, 2017, : 11 - 16
  • [39] Persistence and Attrition among Participants in a Multi-Page Online Survey Recruited via Reddit's Social Media Network
    Spennemann, Dirk H. R.
    SOCIAL SCIENCES-BASEL, 2022, 11 (02):
  • [40] Quality and noise measurements in mobile phone video capture
    Petrescu, Doina
    Pincenti, John
    MULTIMEDIA ON MOBILE DEVICES 2011 AND MULTIMEDIA CONTENT ACCESS: ALGORITHMS AND SYSTEMS V, 2011, 7881