Deep Multimodality Learning for UAV Video Aesthetic Quality Assessment

被引:19
|
作者
Kuang, Qi [1 ]
Jin, Xin [2 ]
Zhao, Qinping [1 ]
Zhou, Bin [1 ]
机构
[1] Beihang Univ, State Key Lab Virtual Real Technol & Syst, Sch Comp Sci & Engn, Beijing 100191, Peoples R China
[2] Beijing Elect Sci & Technol Inst, Beijing 100070, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Aesthetic quality assessment; aerial video aesthetic; deep multimodality learning; OBSTACLE AVOIDANCE; CLASSIFICATION; CATEGORIZATION; GENERATION; PHOTO;
D O I
10.1109/TMM.2019.2960656
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Despite the growing number of unmanned aerial vehicles (UAVs) and aerial videos, there is a paucity of studies focusing on the aesthetics of aerial videos that can provide valuable information for improving the aesthetic quality of aerial photography. In this article, we present a method of deep multimodality learning for UAV video aesthetic quality assessment. More specifically, a multistream framework is designed to exploit aesthetic attributes from multiple modalities, including spatial appearance, drone camera motion, and scene structure. A novel specially designed motion stream network is proposed for this new multistream framework. We construct a dataset with 6,000 UAV video shots captured by drone cameras. Our model can judge whether a UAV video was shot by professional photographers or amateurs together with the scene type classification. The experimental results reveal that our method outperforms the video classification methods and traditional SVM-based methods for video aesthetics. In addition, we present three application examples of UAV video grading, professional segment detection and aesthetic-based UAV path planning using the proposed method.
引用
收藏
页码:2623 / 2634
页数:12
相关论文
共 50 条
  • [1] Deep Learning for Quality Assessment in Live Video Streaming
    Vega, Maria Torres
    Mocanu, Decebal Constantin
    Famaey, Jeroen
    Stavrou, Stavros
    Liotta, Antonio
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (06) : 736 - 740
  • [2] A Deep Learning Methodology for Automatic Assessment of Portrait Image Aesthetic Quality
    Wettayakorn, Poom
    Traivijitkhun, Siripong
    Phetchai, Ponpat
    Tuarob, Suppawong
    [J]. 2018 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE), 2018, : 129 - 134
  • [3] A deep learning framework for quality assessment and restoration in video endoscopy
    Ali, Sharib
    Zhou, Felix
    Bailey, Adam
    Braden, Barbara
    East, James E.
    Lu, Xin
    Rittscher, Jens
    [J]. MEDICAL IMAGE ANALYSIS, 2021, 68
  • [4] Deep Learning for Image/Video Compression and Visual Quality Assessment
    Pan, Zhaoqing
    Jeon, Byeungwoo
    Ling, Nam
    Peng, Bo
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (29) : 42483 - 42483
  • [5] Deep Learning for Image/Video Compression and Visual Quality Assessment
    [J]. Multimedia Tools and Applications, 2022, 81 : 42483 - 42483
  • [6] Query-Dependent Aesthetic Model With Deep Learning for Photo Quality Assessment
    Tian, Xinmei
    Dong, Zhe
    Yang, Kuiyuan
    Mei, Tao
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (11) : 2035 - 2048
  • [7] Deep Aesthetic Quality Assessment With Semantic Information
    Kao, Yueying
    He, Ran
    Huang, Kaiqi
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (03) : 1482 - 1495
  • [8] Research Progress on the Aesthetic Quality Assessment of Complex Layout Images Based on Deep Learning
    Pu, Yumei
    Liu, Danfei
    Chen, Siyuan
    Zhong, Yunfei
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (17):
  • [9] Deep learning based hierarchical classifier for weapon stock aesthetic quality control assessment
    Manuel Vargas, Victor
    Antonio Gutierrez, Pedro
    Rosati, Riccardo
    Romeo, Luca
    Frontoni, Emanuele
    Hervas-Martinez, Cesar
    [J]. COMPUTERS IN INDUSTRY, 2023, 144
  • [10] Deep Learning and Video Quality Analysis
    Topiwala, P.
    Krishnan, M.
    Dai, W.
    [J]. APPLICATIONS OF DIGITAL IMAGE PROCESSING XLII, 2019, 11137