LDRNet: Enabling Real-Time Document Localization on Mobile Devices

被引:0
|
作者
Wu, Han [1 ]
Qian, Holland [2 ]
Wu, Huaming [3 ]
van Moorsel, Aad [4 ]
机构
[1] Newcastle Univ, Newcastle Upon Tyne, Tyne & Wear, England
[2] Tencent, Shenzhen, Peoples R China
[3] Tianjin Univ, Tianjin, Peoples R China
[4] Univ Birmingham, Birmingham, W Midlands, England
关键词
Document localization; Real time; Mobile devices;
D O I
10.1007/978-3-031-23618-1_42
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Modern online services often require mobile devices to convert paper-based information into its digital counterpart, e.g., passport, ownership documents, etc. This process relies on Document Localization (DL) technology to detect the outline of a document within a photograph. In recent years, increased demand for real-time DL in live video has emerged, especially in financial services. However, existing machinelearning approaches to DL cannot be easily applied due to the large size of the underlying models and the associated long inference time. In this paper, we propose a lightweight DL model, LDRNet, to localize documents in real-time video captured on mobile devices. On the basis of a lightweight backbone neural network, we design three prediction branches for LDRNet: (1) corner points prediction; (2) line borders prediction and (3) document classification. To improve the accuracy, we design novel supplementary targets, the equal-division points, and use a new loss function named Line Loss. We compare the performance of LDRNet with other popular approaches on localization for general documents in a number of datasets. The experimental results show that LDRNet takes significantly less inference time, while still achieving comparable accuracy.
引用
收藏
页码:618 / 629
页数:12
相关论文
共 50 条
  • [11] Enabling Real-time AI Inference on Mobile Devices via GPU-CPU Collaborative Execution
    Li, Hao
    Ng, Joseph K.
    Abdelzaher, Tarek
    2022 IEEE 28TH INTERNATIONAL CONFERENCE ON EMBEDDED AND REAL-TIME COMPUTING SYSTEMS AND APPLICATIONS (RTCSA 2022), 2022, : 195 - 204
  • [12] Real-time indoor staircase detection on mobile devices
    Ciobanu, Andrei
    Morar, Anca
    Moldoveanu, Florica
    Petrescu, Lucian
    Ferche, Oana
    Moldoveanu, Alin
    2017 21ST INTERNATIONAL CONFERENCE ON CONTROL SYSTEMS AND COMPUTER SCIENCE (CSCS), 2017, : 287 - 293
  • [13] Smart and real-time image dehazing on mobile devices
    Yucel Cimtay
    Journal of Real-Time Image Processing, 2021, 18 : 2063 - 2072
  • [14] Smart and real-time image dehazing on mobile devices
    Cimtay, Yucel
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2021, 18 (06) : 2063 - 2072
  • [15] Real-time Wireless ECG Biometrics with Mobile Devices
    Derawi, Mohammad
    Voitenko, Iurii
    Endrerud, Pal Erik
    2014 INTERNATIONAL CONFERENCE ON MEDICAL BIOMETRICS (ICMB 2014), 2014, : 151 - 156
  • [16] Real-Time Facial Affective Computing on Mobile Devices
    Guo, Yuanyuan
    Xia, Yifan
    Wang, Jing
    Yu, Hui
    Chen, Rung-Ching
    SENSORS, 2020, 20 (03)
  • [17] HIGH QUALITY REAL-TIME PANORAMA ON MOBILE DEVICES
    Bajpai, Pankaj
    Upadhyay, Akshay
    Jana, Sandeep
    Kim, Jaehyun
    Bandlamudi, Vamsee Kalyan
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW 2018), 2018,
  • [18] Real-time deep hair matting on mobile devices
    Levinshtein, Alex
    Chang, Cheng
    Phung, Edmund
    Kezele, Irina
    Guo, Wenzhangzhi
    Aarabi, Parham
    2018 15TH CONFERENCE ON COMPUTER AND ROBOT VISION (CRV), 2018, : 1 - 7
  • [19] Real-time double JPEG forensics for mobile devices
    Aanchal Agarwal
    Abhinav Gupta
    Journal of Real-Time Image Processing, 2022, 19 : 727 - 737
  • [20] Real-Time Head Pose Estimation on Mobile Devices
    Cheng, Zhengxin
    Bai, Fangyu
    COMPUTER VISION - ACCV 2016 WORKSHOPS, PT I, 2017, 10116 : 599 - 609