Deep Multimodality Learning for UAV Video Aesthetic Quality Assessment

被引:19
|
作者
Kuang, Qi [1 ]
Jin, Xin [2 ]
Zhao, Qinping [1 ]
Zhou, Bin [1 ]
机构
[1] Beihang Univ, State Key Lab Virtual Real Technol & Syst, Sch Comp Sci & Engn, Beijing 100191, Peoples R China
[2] Beijing Elect Sci & Technol Inst, Beijing 100070, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Aesthetic quality assessment; aerial video aesthetic; deep multimodality learning; OBSTACLE AVOIDANCE; CLASSIFICATION; CATEGORIZATION; GENERATION; PHOTO;
D O I
10.1109/TMM.2019.2960656
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Despite the growing number of unmanned aerial vehicles (UAVs) and aerial videos, there is a paucity of studies focusing on the aesthetics of aerial videos that can provide valuable information for improving the aesthetic quality of aerial photography. In this article, we present a method of deep multimodality learning for UAV video aesthetic quality assessment. More specifically, a multistream framework is designed to exploit aesthetic attributes from multiple modalities, including spatial appearance, drone camera motion, and scene structure. A novel specially designed motion stream network is proposed for this new multistream framework. We construct a dataset with 6,000 UAV video shots captured by drone cameras. Our model can judge whether a UAV video was shot by professional photographers or amateurs together with the scene type classification. The experimental results reveal that our method outperforms the video classification methods and traditional SVM-based methods for video aesthetics. In addition, we present three application examples of UAV video grading, professional segment detection and aesthetic-based UAV path planning using the proposed method.
引用
收藏
页码:2623 / 2634
页数:12
相关论文
共 50 条
  • [31] A deep learning approach for quality enhancement of surveillance video
    Ding, Dandan
    Tong, Junchao
    Kong, Lingyi
    [J]. JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 24 (03) : 304 - 314
  • [32] Photo Aesthetic Quality Assessment via Label Distribution Learning
    Zhang, Xiaowei
    Gao, Fei
    Huang, Di
    Tan, Min
    Yu, Jun
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2016, : 1467 - 1470
  • [33] Deep Learning for Video-Based Assessment in Surgery
    Yanik, Erim
    Schwaitzberg, Steven
    De, Suvranu
    [J]. JAMA SURGERY, 2024, 159 (08) : 957 - 958
  • [34] Hierarchical aesthetic quality assessment using deep convolutional neural networks
    Kao, Yueying
    Huang, Kaiqi
    Maybank, Steve
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2016, 47 : 500 - 510
  • [35] Analyzing Multimodality of Video for User Engagement Assessment
    Salim, Fahim A.
    Haider, Fasih
    Conlan, Owen
    Luz, Saturnino
    Campbell, Nick
    [J]. ICMI'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2015, : 287 - 290
  • [36] Radiometric Quality Assessment of Video Sequences Acquired from UAV Photogrammetric Systems
    Wierzbicki, Damian
    Fryskowska, Anna
    [J]. 10TH INTERNATIONAL CONFERENCE ENVIRONMENTAL ENGINEERING (10TH ICEE), 2017,
  • [37] Blind Video Quality Assessment for Ultra-High-Definition Video Based on Super-Resolution and Deep Reinforcement Learning
    Ying, Zefeng
    Pan, Da
    Shi, Ping
    [J]. SENSORS, 2023, 23 (03)
  • [38] Optimal Trajectory Learning for UAV-BS Video Provisioning System: A Deep Reinforcement Learning Approach
    Kwon, Dohyun
    Kim, Joongheon
    [J]. 33RD INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2019), 2019, : 372 - 374
  • [39] Quality Assessment with Deep Learning for Imaging Applications
    Voronin, V.
    Zelensky, A.
    Zhdanova, M.
    Semenishchev, E.
    Frantc, V
    Siryakov, A.
    [J]. MULTIMODAL IMAGE EXPLOITATION AND LEARNING 2022, 2022, 12100
  • [40] Meat Quality Assessment based on Deep Learning
    Ulucan, Oguzhan
    Karakaya, Diclehan
    Turkan, Mehmet
    [J]. 2019 INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS CONFERENCE (ASYU), 2019, : 62 - 66