TOD-Net: An end-to-end transformer-based object detection network

被引:2
|
作者
Sirisha, Museboyina [1 ]
Sudha, S. V. [1 ]
机构
[1] VIT AP Univ, Sch Comp Sci & Engn, Amaravathi 522237, Andhra Pradesh, India
关键词
Object detection; Network learning; Feature representation; Transformer; Predictor module; Feature analysis; Local features; Scaling; Semantic analysis; Network layer;
D O I
10.1016/j.compeleceng.2023.108695
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Various object detection approaches using a learning model intends to learn the semantic and multi-scaling information to attain superior object saliency. This research employs a transformer-based network framework for object detection (TOD -Net) for object detection. It is composed of encoders, decoders, and transformer and predictor module. The predictor model bridges the connectivity between the encoder and the transformer module and offers better insight into the transformer module's local measures. Here, feature extraction is performed to measure the local features and establishes dense modeling by analyzing local features. The model gives broader knowledge of local and global features. Python programming was used to experiment with the MS COCO dataset (Microsoft Common Objects in Context) where the experimentation gives better results over existing models. In contrast to existing methods, the proposed method achieves 68.7% precision and 4% accuracy. The proposed model outperforms different prevailing ap-proaches and establishes a better trade-off.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] TOD-NET: TRANSFORMER-BASED NEURAL NETWORK FOR TINY OBJECT DETECTION IN SPERM MICROSCOPIC VIDEOS
    Zhang, J.
    Zou, S.
    Li, C.
    Yao, Y.
    Rahaman, M.
    Qian, W.
    Sun, H.
    Grzegorzek, M.
    Wang, G.
    [J]. 2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [2] Transformer-based End-to-End Object Detection in Aerial Images
    Vo, Nguyen D.
    Le, Nguyen
    Ngo, Giang
    Doan, Du
    Le, Do
    Nguyen, Khang
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (10) : 1072 - 1079
  • [3] TMSS: An End-to-End Transformer-Based Multimodal Network for Segmentation and Survival Prediction
    Saeed, Numan
    Sobirov, Ikboljon
    Al Majzoub, Roba
    Yaqub, Mohammad
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT VII, 2022, 13437 : 319 - 329
  • [4] Transformer-based end-to-end scene text recognition
    Zhu, Xinghao
    Zhang, Zhi
    [J]. PROCEEDINGS OF THE 2021 IEEE 16TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2021), 2021, : 1691 - 1695
  • [5] End-to-End Asbestos Roof Detection on Orthophotos Using Transformer-Based YOLO Deep Neural Network
    Pace, Cesare Davide
    Bria, Alessandro
    Focareta, Mariano
    Lozupone, Gabriele
    Marrocco, Claudio
    Meoli, Giuseppe
    Molinara, Mario
    [J]. IMAGE ANALYSIS AND PROCESSING, ICIAP 2023, PT I, 2023, 14233 : 232 - 244
  • [6] End-to-End lightweight Transformer-Based neural network for grasp detection towards fruit robotic handling
    Guo, Congmin
    Zhu, Chenhao
    Liu, Yuchen
    Huang, Renjun
    Cao, Boyuan
    Zhu, Qingzhen
    Zhang, Ranxin
    Zhang, Baohua
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2024, 221
  • [7] SRDD: a lightweight end-to-end object detection with transformer
    Zhu, Yuan
    Xia, Qingyuan
    Jin, Wen
    [J]. CONNECTION SCIENCE, 2022, 34 (01) : 2448 - 2465
  • [8] Rotated-DETR: an End-to-End Transformer-based Oriented Object Detector for Aerial Images
    Kim, Jinbeom
    Lee, Giljun
    Kim, Taejune
    Woo, Simon S.
    [J]. 38TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2023, 2023, : 1248 - 1255
  • [9] Transformer-Based End-to-End Anatomical and Functional Image Fusion
    Zhang, Jing
    Liu, Aiping
    Wang, Dan
    Liu, Yu
    Wang, Z. Jane
    Chen, Xun
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71 : 1 - 1
  • [10] A Transformer-Based End-to-End Automatic Speech Recognition Algorithm
    Dong, Fang
    Qian, Yiyang
    Wang, Tianlei
    Liu, Peng
    Cao, Jiuwen
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 1592 - 1596