EVENT AND ACTIVITY RECOGNITION IN AERIAL VIDEOS USING DEEP NEURAL NETWORKS AND A NEW DATASET

被引:3
|
作者
Mou, Lichao [1 ,2 ]
Hua, Yuansheng [1 ,2 ]
Jin, Pu [3 ]
Zhu, Xiao Xiang [1 ,2 ]
机构
[1] German Aerosp Ctr DLR, Remote Sensing Technol Inst IMF, Cologne, Germany
[2] Tech Univ Munich TUM, Signal Proc Earth Observat SiPEO, Munich, Germany
[3] Tech Univ Munich TUM, Munich, Germany
关键词
Unmanned aerial vehicle (UAV) video; deep learning; event recognition; activity recognition;
D O I
10.1109/IGARSS39084.2020.9324182
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unmanned aerial vehicles (UAVs) are now widespread available. Yet the more UAVs there are in the skies, the more video data they create. It is unrealistic for humans to screen such big data and understand their contents. Hence methodological research on UAV video content understanding is of great importance. In this paper, we introduce a novel task of event recognition in unconstrained aerial videos in the remote sensing community and present a dataset for this task. Organized in a rich semantic taxonomy, the proposed dataset covers a wide range of events involving diverse environments and scales. We report results of plenty of deep networks in two ways: single-frame classification and video classification. The dataset and trained models can be downloaded from https://1cmou.github.i0/ERA_Dataset/.
引用
收藏
页码:952 / 955
页数:4
相关论文
共 50 条
  • [31] Deep Convolutional Neural Networks and Data Augmentation for Acoustic Event Recognition
    Takahashi, Naoya
    Gygli, Michael
    Pfister, Beat
    Van Goole, Luc
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2982 - 2986
  • [32] Event Recognition on Images by Fine-Tuning of Deep Neural Networks
    Yudin, Dmitry
    Zeno, Bassel
    PROCEEDINGS OF THE SECOND INTERNATIONAL SCIENTIFIC CONFERENCE INTELLIGENT INFORMATION TECHNOLOGIES FOR INDUSTRY (IITI'17), VOL 1, 2018, 679 : 479 - 487
  • [33] Diving deep into human action recognition in aerial videos: A survey
    Kapoor, Surbhi
    Sharma, Akashdeep
    Verma, Amandeep
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 104
  • [34] Activity Recognition for Locomotion and Transportation Dataset Using Deep Learning
    Naseeb, Chan
    Al Saeedi, Bilal
    UBICOMP/ISWC '20 ADJUNCT: PROCEEDINGS OF THE 2020 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING AND PROCEEDINGS OF THE 2020 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, 2020, : 329 - 334
  • [35] Direct Aerial Visual Geolocalization Using Deep Neural Networks
    Harvey, Winthrop
    Rainwater, Chase
    Cothren, Jackson
    REMOTE SENSING, 2021, 13 (19)
  • [36] Emotion Recognition from Face Dataset Using Deep Neural Nets
    Das, Deepjoy
    Chakrabarty, Alok
    PROCEEDINGS OF THE 2016 INTERNATIONAL SYMPOSIUM ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (INISTA), 2016,
  • [37] Multi-stream with Deep Convolutional Neural Networks for Human Action Recognition in Videos
    Liu, Xiao
    Yang, Xudong
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT I, 2018, 11301 : 251 - 262
  • [38] Violence Detection in Videos using Deep Recurrent and Convolutional Neural Networks
    Traore, Abdarahmane
    Akhloufi, Moulay A.
    2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 154 - 159
  • [39] Facial Action Unit Detection Using Deep Neural Networks in Videos
    Akay, Simge
    Arica, Nafiz
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [40] Detection of tampered real time videos using deep neural networks
    Koshy L.
    Shyry S.P.
    Neural Computing and Applications, 2025, 37 (11) : 7691 - 7703