LDRNet: Enabling Real-Time Document Localization on Mobile Devices

被引:0
|
作者
Wu, Han [1 ]
Qian, Holland [2 ]
Wu, Huaming [3 ]
van Moorsel, Aad [4 ]
机构
[1] Newcastle Univ, Newcastle Upon Tyne, Tyne & Wear, England
[2] Tencent, Shenzhen, Peoples R China
[3] Tianjin Univ, Tianjin, Peoples R China
[4] Univ Birmingham, Birmingham, W Midlands, England
关键词
Document localization; Real time; Mobile devices;
D O I
10.1007/978-3-031-23618-1_42
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Modern online services often require mobile devices to convert paper-based information into its digital counterpart, e.g., passport, ownership documents, etc. This process relies on Document Localization (DL) technology to detect the outline of a document within a photograph. In recent years, increased demand for real-time DL in live video has emerged, especially in financial services. However, existing machinelearning approaches to DL cannot be easily applied due to the large size of the underlying models and the associated long inference time. In this paper, we propose a lightweight DL model, LDRNet, to localize documents in real-time video captured on mobile devices. On the basis of a lightweight backbone neural network, we design three prediction branches for LDRNet: (1) corner points prediction; (2) line borders prediction and (3) document classification. To improve the accuracy, we design novel supplementary targets, the equal-division points, and use a new loss function named Line Loss. We compare the performance of LDRNet with other popular approaches on localization for general documents in a number of datasets. The experimental results show that LDRNet takes significantly less inference time, while still achieving comparable accuracy.
引用
收藏
页码:618 / 629
页数:12
相关论文
共 50 条
  • [1] SubTrack: Enabling Real-time Tracking of Subway Riding on Mobile Devices
    Liu, Guo
    Liu, Jian
    Li, Fangmin
    Ma, Xiaolin
    Chen, Yingying
    Liu, Hongbo
    2017 IEEE 14TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SENSOR SYSTEMS (MASS), 2017, : 90 - 98
  • [2] Real Time Rectangular Document Detection on Mobile Devices
    Skoryukina, Natalya
    Nikolaev, Dmitry P.
    Sheshkus, Alexander
    Polevoy, Dmitry
    SEVENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2014), 2015, 9445
  • [3] Real-Time Self-Localization from Panoramic Images on Mobile Devices
    Arth, Clemens
    Klopschitz, Manfred
    Reitmayr, Gerhard
    Schmalstieg, Dieter
    2011 10TH IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY (ISMAR), 2011,
  • [4] Real-time Attitude Tracking of Mobile Devices
    Li, You
    Lan, Haiyu
    Zhuang, Yuan
    Zhang, Peng
    Niu, Xiaoji
    El-Sheimy, Naser
    2015 INTERNATIONAL CONFERENCE ON INDOOR POSITIONING AND INDOOR NAVIGATION (IPIN), 2015,
  • [5] Real-time emotion recognition on mobile devices
    Sokolov, Denis
    Patkin, Mikhail
    PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, : 787 - 787
  • [6] Real-time facial animation on mobile devices
    Weng, Yanlin
    Cao, Chen
    Hou, Qiming
    Zhou, Kun
    GRAPHICAL MODELS, 2014, 76 : 172 - 179
  • [7] Real-Time View Correction for Mobile Devices
    Schops, Thomas
    Oswald, Martin R.
    Speciale, Pablo
    Yang, Shuoran
    Pollefeys, Marc
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2017, 23 (11) : 2455 - 2462
  • [8] Real-time Photorealistic Rendering for Mobile Devices
    Ha, Inwoo
    Ahn, Minsu
    Lee, Hyong-Euk
    2014 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2014, : 500 - 501
  • [9] Real-time bus information on mobile devices
    Maclean, SD
    Dailey, DJ
    2001 IEEE INTELLIGENT TRANSPORTATION SYSTEMS - PROCEEDINGS, 2001, : 988 - 993
  • [10] Robust Methods for Real-Time Diabetic Foot Ulcer Detection and Localization on Mobile Devices
    Goyal, Manu
    Reeves, Neil D.
    Rajbhandari, Satyan
    Yap, Moi Hoon
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2019, 23 (04) : 1730 - 1741