Fast End-to-End Deep Learning Identity Document Detection, Classification and Cropping

被引:3
|
作者
Chiron, Guillaume [1 ]
Arrestier, Florian [1 ]
Awal, Ahmad Montaser [1 ]
机构
[1] Res Dept AriadNEXT, Cesson Sevigne, France
关键词
ID documents segmentation; Deeplearning; Classification;
D O I
10.1007/978-3-030-86337-1_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The growing use of Know Your Customer online services generates a massive flow of dematerialised personal Identity Documents under variable capturing conditions and qualities (e.g. webcam, smartphone, scan, or even handcrafted pdfs). IDs are designed, depending on their issuing country/model, with a specific layout (i.e. background, photo(s), fixed/variable text fields) along with various anti-fraud features (e.g. checksums, Optical Variable Devices) which are non-trivial to analyse. This paper tackles the problem of detecting, classifying, and aligning captured documents onto their reference model. This task is essential in the process of document reading and fraud verification. However, due to the high variation of capture conditions and models' layout, classical handcrafted approaches require deep knowledge of documents and hence are hard to maintain. A modular approach using a fully multi-stage deep learning based approach is proposed in this work. The proposed approach allows to accurately classify the document and estimates its quadrilateral (localization). As opposed to approaches relying on a single end-to-end network, the proposed modular framework offers more flexibility and a potential for future incremental learning. All networks used in this work are derivatives of recent state-of-the-art ones. Experiments show the superiority of the proposed approach in terms of speed while maintaining good accuracy, both on the MIDV-500 academic dataset and on an industrial based dataset compared to hand crafted solutions.
引用
收藏
页码:333 / 347
页数:15
相关论文
共 50 条
  • [1] An End-to-End Deep Learning Architecture for Graph Classification
    Zhang, Muhan
    Cui, Zhicheng
    Neumann, Marion
    Chen, Yixin
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 4438 - 4445
  • [2] End-to-end Multimodel Deep Learning for Malware Classification
    Snow, Elijah
    Alam, Mahbubul
    Glandon, Alexander
    Iftekharuddin, Khan
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [3] Automated Classification Using End-to-End Deep Learning
    Jaipurkar, Shobhit Sandeep
    Jie, Wang
    Zeng, Zeng
    Gee, Teo Sin
    Veeravalli, Bharadwaj
    Chua, Matthew
    [J]. 2018 40TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2018, : 706 - 709
  • [4] An End-to-End Deep Learning System for Hop Classification
    Castro, Pedro
    Moreira, Gladston
    Luz, Eduardo
    [J]. IEEE LATIN AMERICA TRANSACTIONS, 2022, 20 (03) : 430 - 442
  • [5] End-To-End Deep-Learning-Based Tamil Handwritten Document Recognition and Classification Model
    Vinotheni, C.
    Pandian, S. Lakshmana
    [J]. IEEE ACCESS, 2023, 11 : 43195 - 43204
  • [6] An End-to-End Detection Method for WebShell with Deep Learning
    Qi, Longchen
    Kong, Rui
    Lu, Yang
    Zhuang, Honglin
    [J]. 2018 EIGHTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION AND MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2018), 2018, : 660 - 665
  • [7] Fast ultrasonic imaging using end-to-end deep learning
    Pilikos, Georgios
    Horchens, Lars
    Batenburg, Kees Joost
    van Leeuwen, Tristan
    Lucka, Felix
    [J]. PROCEEDINGS OF THE 2020 IEEE INTERNATIONAL ULTRASONICS SYMPOSIUM (IUS), 2020,
  • [8] An End-to-End Deep Learning Method for Voltage Sag Classification
    Turovic, Radovan
    Dragan, Dinu
    Gojic, Gorana
    Petrovic, Veljko B.
    Gajic, Dusan B.
    Stanisavljevic, Aleksandar M.
    Katic, Vladimir A.
    [J]. ENERGIES, 2022, 15 (08)
  • [9] An efficient end-to-end deep learning architecture for activity classification
    Amel Ben Mahjoub
    Mohamed Atri
    [J]. Analog Integrated Circuits and Signal Processing, 2019, 99 : 23 - 32
  • [10] An end-to-end deep learning approach for Raman spectroscopy classification
    Zhou, Mengfei
    Hu, Yinchao
    Wang, Ruizhen
    Guo, Tian
    Yu, Qiqing
    Xia, Luyue
    Sun, Xiaofang
    [J]. JOURNAL OF CHEMOMETRICS, 2023, 37 (02)