Pedestrian Counting using Deep Models Trained on Synthetically Generated Images

被引:2
|
作者
Ghosh, Sanjukta [1 ,2 ]
Amon, Peter [2 ]
Hutter, Andreas [2 ]
Kaup, Andre [1 ]
机构
[1] Friedrich Alexander Univ Erlangen Nurnberg FAU, Multimedia Commun & Signal Proc, Erlangen, Germany
[2] Siemens Corp Technol, Sensing & Ind Imaging, Munich, Germany
关键词
Pedestrian Counting; Deep Learning; Convolutional Neural Networks; Synthetic Images; Transfer Learning; Cross Entropy Cost Function; Squared Error Cost Function;
D O I
10.5220/0006132600860097
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Counting pedestrians in surveillance applications is a common scenario. However, it is often challenging to obtain sufficient annotated training data, especially so for creating models using deep learning which require a large amount of training data. To address this problem, this paper explores the possibility of training a deep convolutional neural network (CNN) entirely from synthetically generated images for the purpose of counting pedestrians. Nuances of transfer learning are exploited to train models from a base model trained for image classification. A direct approach and a hierarchical approach are used during training to enhance the capability of the model for counting higher number of pedestrians. The trained models are then tested on natural images of completely different scenes captured by different acquisition systems not experienced by the model during training. Furthermore, the effectiveness of the cross entropy cost function and the squared error cost function are evaluated and analyzed for the scenario where a model is trained entirely using synthetic images. The performance of the trained model for the test images from the target site can be improved by fine-tuning using the image of the background of the target site.
引用
收藏
页码:86 / 97
页数:12
相关论文
共 50 条
  • [1] Can Segmentation Models Be Trained with Fully Synthetically Generated Data?
    Fernandez, Virginia
    Pinaya, Walter Hugo Lopez
    Borges, Pedro
    Tudosiu, Petru-Daniel
    Graham, Mark S.
    Vercauteren, Tom
    Cardoso, M. Jorge
    [J]. SIMULATION AND SYNTHESIS IN MEDICAL IMAGING, SASHIMI 2022, 2022, 13570 : 79 - 90
  • [2] Deep Learning Model for Static Ocular Torsion Detection Using Synthetically Generated Fundus Images
    Wang, Chen
    Bai, Yunong
    Tsang, Ashley
    Bian, Yuhan
    Gou, Yifan
    Lin, Yan X.
    Zhao, Matthew
    Wei, Tony Y.
    Desman, Jacob M.
    Taylor, Casey Overby
    Greenstein, Joseph L.
    Otero-Millan, Jorge
    Liu, Tin Yan Alvin
    Kheradmand, Amir
    Zee, David S.
    Green, Kemar E.
    [J]. TRANSLATIONAL VISION SCIENCE & TECHNOLOGY, 2023, 12 (01):
  • [3] A Deep Learning Framework for Segmenting Brain Tumors Using MRI and Synthetically Generated CT Images
    Islam, Kh Tohidul
    Wijewickrema, Sudanthi
    O'Leary, Stephen
    [J]. SENSORS, 2022, 22 (02)
  • [4] RELIABLE PEDESTRIAN DETECTION USING A DEEP NEURAL NETWORK TRAINED ON PEDESTRIAN COUNTS
    Ghosh, Sanjukta
    Amon, Peter
    Hutter, Andreas
    Kaup, Andre
    [J]. 2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 685 - 689
  • [5] Using Adversarial Images to Assess the Robustness of Deep Learning Models Trained on Diagnostic Images in Oncology
    Joel, Marina Z.
    Umrao, Sachin
    Chang, Enoch
    Choi, Rachel
    Yang, Daniel X.
    Duncan, James S.
    Omuro, Antonio
    Herbst, Roy
    Krumholz, Harlan M.
    Aneja, Sanjay
    [J]. JCO CLINICAL CANCER INFORMATICS, 2022, 6
  • [6] First-person reading activity recognition by deep learning with synthetically generated images
    Yuta Segawa
    Kazuhiko Kawamoto
    Kazushi Okamoto
    [J]. EURASIP Journal on Image and Video Processing, 2018
  • [7] First-person reading activity recognition by deep learning with synthetically generated images
    Segawa, Yuta
    Kawamoto, Kazuhiko
    Okamoto, Kazushi
    [J]. EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2018,
  • [8] MixedPeds: Pedestrian Detection in Unannotated Videos Using Synthetically Generated Human-Agents for Training
    Cheung, Ernest
    Wong, Anson
    Bera, Aniket
    Manocha, Dinesh
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 6738 - 6747
  • [9] Correction of AFM data artifacts using a convolutional neural network trained with synthetically generated data
    Kocur, Viktor
    Hegrova, Veronika
    Patocka, Marek
    Neuman, Jan
    Herout, Adam
    [J]. ULTRAMICROSCOPY, 2023, 246
  • [10] Detecting Images Generated by Deep Diffusion Models using their Local Intrinsic Dimensionality
    Lorenz, Peter
    Durall, Ricard L.
    Keuper, Janis
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 448 - 459