Transfer Learning for Object Detection using State-of-the-Art Deep Neural Networks

被引:0
|
作者
Talukdar, J. [1 ]
Gupta, S. [2 ]
Rajpura, P. S. [3 ]
Hegde, R. S. [3 ]
机构
[1] Nirma Univ, Dept Elect & Commun Engn, Ahmadabad, Gujarat, India
[2] BITS Pilani, Dept Comp Engn & Informat Sci, Hyderabad Campus, Hyderabad, Telangana, India
[3] Indian Inst Technol, High Performance Comp Lab, Gandhinagar, Gujarat, India
关键词
Transfer learning; Computer Vision; Deep Neural Networks; Synthetic Datasets; Artificial Intelligence;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Transfer learning through the use of synthetic images and pretrained convolutional neural networks offers a promising approach to improve the object detection performance of deep neural networks. In this paper, we explore different strategies to generate synthetic datasets and subsequently improve them to achieve better object detection accuracy (mAP) when trained with state-of-the-art deep neural networks, focusing on detection of packed food products in a refrigerator. We develop novel techniques like dynamic stacking, pseudo random placement, variable object pose, distractor noise etc. which not only aid in diversifying the synthetic data but also help in improving the overall object detection mAP by more than 40%. The synthetic images, generated using Blender-Python API, are clustered in a variety of configurations to cater to the diversity of real scenes. These datasets are then utilized to train TensorFlow implementations of state-of-the-art deep neural networks like Faster-RCNN, R-FCN, and SSD and their performance is tested on real scenes. The object detection performance of various deep CNN architectures is also studied, with Faster-RCNN proving to be the most suitable choice, achieving the highest mAP of 70.67.
引用
收藏
页码:78 / 83
页数:6
相关论文
共 50 条
  • [1] State-of-the-Art Model for Music Object Recognition with Deep Learning
    Huang, Zhiqing
    Jia, Xiang
    Guo, Yifan
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (13):
  • [2] Object Detection Using Deep Neural Networks
    Shah, Malay
    Kapdi, Rupal
    [J]. 2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2017, : 787 - 790
  • [3] Handwritten Bangla Character Recognition Using the State-of-the-Art Deep Convolutional Neural Networks
    Alom, Md Zahangir
    Sidike, Paheding
    Hasan, Mahmudul
    Taha, Tarek M.
    Asari, Vijayan K.
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2018, 2018
  • [4] Transfer Learning Based Crop Disease Identification Using State-of-the-art Deep Learning Framework
    Kang, Gaobi
    Wang, Jian
    Yue, Xuejun
    Zeng, Guofan
    Feng, Zekai
    [J]. 2021 IEEE INTERNATIONAL PERFORMANCE, COMPUTING, AND COMMUNICATIONS CONFERENCE (IPCCC), 2021,
  • [5] State-of-the-art review on energy and load forecasting in microgrids using artificial neural networks, machine learning, and deep learning techniques
    Wazirali, Raniyah
    Yaghoubi, Elnaz
    Abujazar, Mohammed Shadi S.
    Ahmad, Rami
    Vakili, Amir Hossein
    [J]. ELECTRIC POWER SYSTEMS RESEARCH, 2023, 225
  • [6] Deep Learning on the Web: State-of-the-art Object Detection using Web-based Client-side Frameworks
    Pournaras, Xenofon
    Koutsomitropoulos, Dimitrios A.
    [J]. 2020 11TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS (IISA 2020), 2020, : 254 - 261
  • [7] Transfer Learning for Maritime Vessel Detection using Deep Neural Networks
    Farahnakian, Fahimeh
    Zelioli, Luca
    Heikkonen, Jukka
    [J]. 2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 1 - 6
  • [8] Credit Card Fraud Detection Using State-of-the-Art Machine Learning and Deep Learning Algorithms
    Alarfaj, Fawaz Khaled
    Malik, Iqra
    Khan, Hikmat Ullah
    Almusallam, Naif
    Ramzan, Muhammad
    Ahmed, Muzamil
    [J]. IEEE ACCESS, 2022, 10 : 39700 - 39715
  • [9] Enhancing multimodal disaster tweet classification using state-of-the-art deep learning networks
    Divakaran Adwaith
    Ashok Kumar Abishake
    Siva Venkatesh Raghul
    Elango Sivasankar
    [J]. Multimedia Tools and Applications, 2022, 81 : 18483 - 18501
  • [10] Enhancing multimodal disaster tweet classification using state-of-the-art deep learning networks
    Adwaith, Divakaran
    Abishake, Ashok Kumar
    Raghul, Siva Venkatesh
    Sivasankar, Elango
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (13) : 18483 - 18501