Pre-training transformer with dual-branch context content module for table detection in document images

被引:0
|
作者
Li, Yongzhi [1 ]
Zhang, Pengle [1 ]
Sun, Meng [2 ]
Huang, Jin [1 ,3 ]
He, Ruhan [1 ,3 ]
机构
[1] School of Computer Science and Artificial Intelligence, Wuhan Textile University, Wuhan,430064, China
[2] School of Computer Science, South-Central Minzu University, Wuhan,430064, China
[3] Hubei Provincial Engineering Research Center for Intelligent Textile and Fashion, Wuhan Textile University, Wuhan,430064, China
来源
Virtual Reality and Intelligent Hardware | 2024年 / 6卷 / 05期
关键词
Deformable convolution - Dilated convolution - Document image analysis - Document images - Features extraction - Features fusions - Pre-training - Shape and size - Table detection - Transformer;
D O I
10.1016/j.vrih.2024.06.003
中图分类号
学科分类号
摘要
33
引用
收藏
页码:408 / 420
相关论文
共 47 条
  • [31] Pre-training using pseudo images and fine-tuning using real images for nighttime traffic Sign Detection
    Yamamoto M.
    Ohashi G.
    IEEJ Transactions on Electronics, Information and Systems, 2021, 141 (09) : 969 - 976
  • [32] Dual-Branch Multiscale Optimization Network for Enhancing Low-Light Images in Rail Transit Obstacle Detection and Segmentation
    Liu, Qi
    He, Deqiang
    Zhang, Mingchao
    Wu, Jinxin
    IEEE SENSORS JOURNAL, 2025, 25 (03) : 5697 - 5710
  • [33] ASI-DBNet: An Adaptive Sparse Interactive ResNet-Vision Transformer Dual-Branch Network for the Grading of Brain Cancer Histopathological Images
    Xiaoli Zhou
    Chaowei Tang
    Pan Huang
    Sukun Tian
    Francesco Mercaldo
    Antonella Santone
    Interdisciplinary Sciences: Computational Life Sciences, 2023, 15 : 15 - 31
  • [34] ASI-DBNet: An Adaptive Sparse Interactive ResNet-Vision Transformer Dual-Branch Network for the Grading of Brain Cancer Histopathological Images
    Zhou, Xiaoli
    Tang, Chaowei
    Huang, Pan
    Tian, Sukun
    Mercaldo, Francesco
    Santone, Antonella
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2023, 15 (01) : 15 - 31
  • [35] A CNN- and Transformer-Based Dual-Branch Network for Change Detection with Cross-Layer Feature Fusion and Edge Constraints
    Wang, Xiaofeng
    Guo, Zhongyu
    Feng, Ruyi
    REMOTE SENSING, 2024, 16 (14)
  • [36] A method for freshness detection of pork using two-dimensional correlation spectroscopy images combined with dual-branch deep learning
    Sun, Jun
    Cheng, Jiehong
    Xu, Min
    Yao, Kunshan
    JOURNAL OF FOOD COMPOSITION AND ANALYSIS, 2024, 129
  • [37] Snow Detection in Gaofen-1 Multi-Spectral Images Based on Swin-Transformer and U-Shaped Dual-Branch Encoder Structure Network with Geographic Information
    Wu, Yue
    Shi, Chunxiang
    Shen, Runping
    Gu, Xiang
    Tie, Ruian
    Ge, Lingling
    Sun, Shuai
    REMOTE SENSING, 2024, 16 (17)
  • [38] MAT-SED: A Masked Audio Transformer with Masked-Reconstruction Based Pre-training for Sound Event Detection
    Car, Pengfei
    Song, Yan
    Li, Kang
    Song, Haoyu
    McLoughlin, Ian
    INTERSPEECH 2024, 2024, : 557 - 561
  • [39] A Lightweight Dual-Branch Network for Building Change Detection in Remote Sensing Images Integrating Cross-Scale Coupling and Boundary Constraint
    Dai, Yanshuai
    Shen, Li
    Wang, Yong
    Liu, Shichuan
    Li, Zhilin
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [40] Self-supervised pseudo multi-class pre-training for unsupervised anomaly detection and segmentation in medical images
    Tian, Yu
    Liu, Fengbei
    Pang, Guansong
    Chen, Yuanhong
    Liu, Yuyuan
    Verjans, Johan W.
    Singh, Rajvinder
    Carneiro, Gustavo
    MEDICAL IMAGE ANALYSIS, 2023, 90