Deep Neural Networks for Page Stream Segmentation and Classification

被引:0
|
作者
Gallo, Ignazio [1 ]
Noce, Lucia [1 ]
Zamberletti, Alessandro [1 ]
Calefati, Alessandro [1 ]
机构
[1] Univ Insubria, Dept Theoret & Appl Sci DiSTA, Via Mazzini 5, Varese, Italy
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this manuscript we propose a novel method for jointly page stream segmentation and multi-page document classification. The end goal is to classify a stream of pages as belonging to different classes of documents. We take advantage of the recent state-of-the-art results achieved using deep architectures in related fields such as document image classification, and we adopt similar models to obtain satisfying classification accuracies and a low computational complexity. Our contribution is twofold: first, the extraction of visual features from the processed documents is automatically performed by the chosen Convolutional Neural Network; second, the predictions of the same network are further refined using an additional deep model which processes them in a classic sliding-window manner to help finding and solving classification errors committed by the first network. The proposed pipeline has been evaluated on a publicly available dataset composed of more than half a million multi-page documents collected by an on-line loan comparison company, showing excellent results and high efficiency.
引用
收藏
页码:127 / 133
页数:7
相关论文
共 50 条
  • [1] Multi-modal page stream segmentation with convolutional neural networks
    Wiedemann, Gregor
    Heyer, Gerhard
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2021, 55 (01) : 127 - 150
  • [2] Multi-modal page stream segmentation with convolutional neural networks
    Gregor Wiedemann
    Gerhard Heyer
    [J]. Language Resources and Evaluation, 2021, 55 : 127 - 150
  • [3] Lung Lesions Segmentation and Classification with Deep Neural Networks
    Phan, Thuong-Cang
    Phan, Anh-Cang
    Tran, Quoc-Thinh
    Trieu, Thanh-Ngoan
    [J]. FUTURE DATA AND SECURITY ENGINEERING. BIG DATA, SECURITY AND PRIVACY, SMART CITY AND INDUSTRY 4.0 APPLICATIONS, FDSE 2022, 2022, 1688 : 653 - 664
  • [4] Leveraging effectiveness and efficiency in Page Stream Deep Segmentation
    Braz, Fabricio Ataides
    Silva, Nilton Correia da
    Lima, Jonathan Alis Salgado
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 105
  • [5] Deep Memristive Cellular Neural Networks for Image Classification and Segmentation
    Horvath, Andras
    Rajki, Franciska
    Ascoli, Alon
    Tetzlaff, Ronald
    [J]. IEEE Transactions on Nanotechnology, 2024, 23 : 718 - 726
  • [6] Document Classification and Page Stream Segmentation for Digital Mailroom Applications
    Gordo, Albert
    Al Rusinol, Marcal
    Karatzas, Dimosthenis
    Bagdanov, Andrew D.
    [J]. 2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 621 - 625
  • [7] Vision Based Segmentation and Classification of Cracks Using Deep Neural Networks
    Reghukumar, Arathi
    Anbarasi, L. Jani
    Prassanna, J.
    Manikandan, R.
    Al-Turjman, Fadi
    [J]. INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2021, 29 (SUPPL 1) : 141 - 156
  • [8] Tongue Segmentation and Color Classification Using Deep Convolutional Neural Networks
    Yan, Bo
    Zhang, Sheng
    Yang, Zijiang
    Su, Hongyi
    Zheng, Hong
    [J]. MATHEMATICS, 2022, 10 (22)
  • [9] Stage classification using two-stream deep convolutional neural networks
    Chefranov, Alexander
    Khan, Altaf
    Demirel, Hasan
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (02) : 311 - 319
  • [10] Stage classification using two-stream deep convolutional neural networks
    Alexander Chefranov
    Altaf Khan
    Hasan Demirel
    [J]. Signal, Image and Video Processing, 2022, 16 : 311 - 319