Fast End-to-End Deep Learning Identity Document Detection, Classification and Cropping

被引:3
|
作者
Chiron, Guillaume [1 ]
Arrestier, Florian [1 ]
Awal, Ahmad Montaser [1 ]
机构
[1] Res Dept AriadNEXT, Cesson Sevigne, France
关键词
ID documents segmentation; Deeplearning; Classification;
D O I
10.1007/978-3-030-86337-1_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The growing use of Know Your Customer online services generates a massive flow of dematerialised personal Identity Documents under variable capturing conditions and qualities (e.g. webcam, smartphone, scan, or even handcrafted pdfs). IDs are designed, depending on their issuing country/model, with a specific layout (i.e. background, photo(s), fixed/variable text fields) along with various anti-fraud features (e.g. checksums, Optical Variable Devices) which are non-trivial to analyse. This paper tackles the problem of detecting, classifying, and aligning captured documents onto their reference model. This task is essential in the process of document reading and fraud verification. However, due to the high variation of capture conditions and models' layout, classical handcrafted approaches require deep knowledge of documents and hence are hard to maintain. A modular approach using a fully multi-stage deep learning based approach is proposed in this work. The proposed approach allows to accurately classify the document and estimates its quadrilateral (localization). As opposed to approaches relying on a single end-to-end network, the proposed modular framework offers more flexibility and a potential for future incremental learning. All networks used in this work are derivatives of recent state-of-the-art ones. Experiments show the superiority of the proposed approach in terms of speed while maintaining good accuracy, both on the MIDV-500 academic dataset and on an industrial based dataset compared to hand crafted solutions.
引用
收藏
页码:333 / 347
页数:15
相关论文
共 50 条
  • [21] DeepQCD: An end-to-end deep learning approach to quickest change detection
    Kurt, Mehmet Necip
    Zheng, Jiaohao
    Yilmaz, Yasin
    Wang, Xiaodong
    [J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2024, 361 (18):
  • [22] DeepOrigin: End-to-End Deep Learning for Detection of New Malware Families
    Cordonsky, Ilay
    Rosenberg, Ishai
    Sicard, Guillaume
    David, Eli
    [J]. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [23] An End-to-End Deep Learning Framework for Fault Detection in Marine Machinery
    Rigas, Spyros
    Tzouveli, Paraskevi
    Kollias, Stefanos
    [J]. SENSORS, 2024, 24 (16)
  • [24] FluentNet: End-to-End Detection of Stuttered Speech Disfluencies With Deep Learning
    Kourkounakis, Tedd
    Hajavi, Amirhossein
    Etemad, Ali
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 2986 - 2999
  • [25] AffordanceNet: An End-to-End Deep Learning Approach for Object Affordance Detection
    Thanh-Toan Do
    Anh Nguyen
    Reid, Ian
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 5882 - 5889
  • [26] End-to-end deep learning classification of vocal pathology using stacked vowels
    Liu, George S.
    Hodges, Jordan M.
    Yu, Jingzhi
    Sung, C. Kwang
    Erickson-DiRenzo, Elizabeth
    Doyle, Philip C.
    [J]. LARYNGOSCOPE INVESTIGATIVE OTOLARYNGOLOGY, 2023, 8 (05): : 1312 - 1318
  • [27] Skin Lesion Primary Morphology Classification With End-To-End Deep Learning Network
    Polevaya, Tatyana
    Ravodin, Roman
    Filchenkov, Andrey
    [J]. 2019 1ST INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION (ICAIIC 2019), 2019, : 247 - 250
  • [28] End-to-end encrypted network traffic classification method based on deep learning
    Tian Shiming
    Gong Feixiang
    Mo Shuang
    Li Meng
    Wu Wenrui
    Xiao Ding
    [J]. The Journal of China Universities of Posts and Telecommunications, 2020, 27 (03) : 21 - 30
  • [29] An End-to-End Deep Learning Architecture for Classification of Malware's Binary Content
    Gibert, Daniel
    Mateu, Carles
    Planes, Jordi
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III, 2018, 11141 : 383 - 391
  • [30] End-to-End Soccer Video Scene and Event Classification with Deep Transfer Learning
    Hong, Yuxi
    Ling, Chen
    Ye, Zuochang
    [J]. 2018 INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND COMPUTER VISION (ISCV2018), 2018,