LDRNet: Enabling Real-Time Document Localization on Mobile Devices

被引:0
|
作者
Wu, Han [1 ]
Qian, Holland [2 ]
Wu, Huaming [3 ]
van Moorsel, Aad [4 ]
机构
[1] Newcastle Univ, Newcastle Upon Tyne, Tyne & Wear, England
[2] Tencent, Shenzhen, Peoples R China
[3] Tianjin Univ, Tianjin, Peoples R China
[4] Univ Birmingham, Birmingham, W Midlands, England
关键词
Document localization; Real time; Mobile devices;
D O I
10.1007/978-3-031-23618-1_42
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Modern online services often require mobile devices to convert paper-based information into its digital counterpart, e.g., passport, ownership documents, etc. This process relies on Document Localization (DL) technology to detect the outline of a document within a photograph. In recent years, increased demand for real-time DL in live video has emerged, especially in financial services. However, existing machinelearning approaches to DL cannot be easily applied due to the large size of the underlying models and the associated long inference time. In this paper, we propose a lightweight DL model, LDRNet, to localize documents in real-time video captured on mobile devices. On the basis of a lightweight backbone neural network, we design three prediction branches for LDRNet: (1) corner points prediction; (2) line borders prediction and (3) document classification. To improve the accuracy, we design novel supplementary targets, the equal-division points, and use a new loss function named Line Loss. We compare the performance of LDRNet with other popular approaches on localization for general documents in a number of datasets. The experimental results show that LDRNet takes significantly less inference time, while still achieving comparable accuracy.
引用
收藏
页码:618 / 629
页数:12
相关论文
共 50 条
  • [31] Enabling real-time analytics
    Gonzales, Michael L.
    DB2 Magazine, 2006, 11 (03): : 21 - 22
  • [32] Enabling the real-time enterprise
    Skeen, D
    BYTE, 1998, 23 (01): : 51 - 52
  • [33] A Framework for Real-Time Localization in Constrained Devices Connected to the IoT
    Nnaemeka, Asogwa Emmanuel
    Macharia, Ngari Crisphine
    Bajpai, Ambar
    Telagam, Nagarjuna
    10TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTING AND COMMUNICATION TECHNOLOGIES, CONECCT 2024, 2024,
  • [34] Real-time Document Localization in Natural Images by Recursive Application of a CNN
    Javed, Khurram
    Shafait, Faisal
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 105 - 110
  • [35] A Real-Time GNSS/PDR Navigation System for Mobile Devices
    Basso, Michele
    Martinelli, Alessio
    Morosi, Simone
    Sera, Fabrizio
    REMOTE SENSING, 2021, 13 (08)
  • [36] Real-time image processing for augmented reality on mobile devices
    Zhihan Lv
    Jaime Lloret
    Houbing Song
    Journal of Real-Time Image Processing, 2021, 18 : 245 - 248
  • [37] Real-Time Scoring of an Oral Reading Assessment on Mobile Devices
    Cheng, Jian
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1621 - 1625
  • [38] Real-time Interactive Learning Through Smart Mobile Devices
    Fok, Wilton
    Lam, Jacqueline
    2012 SIXTH INTERNATIONAL CONFERENCE ON DISTRIBUTED SMART CAMERAS (ICDSC), 2012,
  • [39] Representing scenes for real-time context classification on mobile devices
    Farinella, G. M.
    Ravi, D.
    Tomaselli, V.
    Guarnera, M.
    Battiato, S.
    PATTERN RECOGNITION, 2015, 48 (04) : 1086 - 1100
  • [40] Pelee: A Real-Time Object Detection System on Mobile Devices
    Wang, Robert J.
    Li, Xiang
    Ling, Charles X.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31