Pedestrian Counting using Deep Models Trained on Synthetically Generated Images

被引：2

作者：

Ghosh, Sanjukta ^{[1
,2
]}

Amon, Peter ^{[2
]}

Hutter, Andreas ^{[2
]}

Kaup, Andre ^{[1
]}

机构：

[1] Friedrich Alexander Univ Erlangen Nurnberg FAU, Multimedia Commun & Signal Proc, Erlangen, Germany

[2] Siemens Corp Technol, Sensing & Ind Imaging, Munich, Germany

来源：

PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2017), VOL 5 | 2017年

关键词：

Pedestrian Counting; Deep Learning; Convolutional Neural Networks; Synthetic Images; Transfer Learning; Cross Entropy Cost Function; Squared Error Cost Function;

D O I：

10.5220/0006132600860097

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Counting pedestrians in surveillance applications is a common scenario. However, it is often challenging to obtain sufficient annotated training data, especially so for creating models using deep learning which require a large amount of training data. To address this problem, this paper explores the possibility of training a deep convolutional neural network (CNN) entirely from synthetically generated images for the purpose of counting pedestrians. Nuances of transfer learning are exploited to train models from a base model trained for image classification. A direct approach and a hierarchical approach are used during training to enhance the capability of the model for counting higher number of pedestrians. The trained models are then tested on natural images of completely different scenes captured by different acquisition systems not experienced by the model during training. Furthermore, the effectiveness of the cross entropy cost function and the squared error cost function are evaluated and analyzed for the scenario where a model is trained entirely using synthetic images. The performance of the trained model for the test images from the target site can be improved by fine-tuning using the image of the background of the target site.

引用

页码：86 / 97

页数：12

共 50 条

[1] Can Segmentation Models Be Trained with Fully Synthetically Generated Data?
Fernandez, Virginia
Pinaya, Walter Hugo Lopez
Borges, Pedro
Tudosiu, Petru-Daniel
Graham, Mark S.
Vercauteren, Tom
Cardoso, M. Jorge
[J]. SIMULATION AND SYNTHESIS IN MEDICAL IMAGING, SASHIMI 2022, 2022, 13570 : 79 - 90
[2] Deep Learning Model for Static Ocular Torsion Detection Using Synthetically Generated Fundus Images
Wang, Chen
Bai, Yunong
Tsang, Ashley
Bian, Yuhan
Gou, Yifan
Lin, Yan X.
Zhao, Matthew
Wei, Tony Y.
Desman, Jacob M.
Taylor, Casey Overby
Greenstein, Joseph L.
Otero-Millan, Jorge
Liu, Tin Yan Alvin
Kheradmand, Amir
Zee, David S.
Green, Kemar E.
[J]. TRANSLATIONAL VISION SCIENCE & TECHNOLOGY, 2023, 12 (01):
[3] A Deep Learning Framework for Segmenting Brain Tumors Using MRI and Synthetically Generated CT Images
Islam, Kh Tohidul
Wijewickrema, Sudanthi
O'Leary, Stephen
[J]. SENSORS, 2022, 22 (02)
[4] RELIABLE PEDESTRIAN DETECTION USING A DEEP NEURAL NETWORK TRAINED ON PEDESTRIAN COUNTS
Ghosh, Sanjukta
Amon, Peter
Hutter, Andreas
Kaup, Andre
[J]. 2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 685 - 689
[5] Using Adversarial Images to Assess the Robustness of Deep Learning Models Trained on Diagnostic Images in Oncology
Joel, Marina Z.
Umrao, Sachin
Chang, Enoch
Choi, Rachel
Yang, Daniel X.
Duncan, James S.
Omuro, Antonio
Herbst, Roy
Krumholz, Harlan M.
Aneja, Sanjay
[J]. JCO CLINICAL CANCER INFORMATICS, 2022, 6
[6] First-person reading activity recognition by deep learning with synthetically generated images
Yuta Segawa
Kazuhiko Kawamoto
Kazushi Okamoto
[J]. EURASIP Journal on Image and Video Processing, 2018
[7] First-person reading activity recognition by deep learning with synthetically generated images
Segawa, Yuta
Kawamoto, Kazuhiko
Okamoto, Kazushi
[J]. EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2018,
[8] MixedPeds: Pedestrian Detection in Unannotated Videos Using Synthetically Generated Human-Agents for Training
Cheung, Ernest
Wong, Anson
Bera, Aniket
Manocha, Dinesh
[J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 6738 - 6747
[9] Correction of AFM data artifacts using a convolutional neural network trained with synthetically generated data
Kocur, Viktor
Hegrova, Veronika
Patocka, Marek
Neuman, Jan
Herout, Adam
[J]. ULTRAMICROSCOPY, 2023, 246
[10] Detecting Images Generated by Deep Diffusion Models using their Local Intrinsic Dimensionality
Lorenz, Peter
Durall, Ricard L.
Keuper, Janis
[J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 448 - 459

← 1 2 3 4 5 →