Best Frame Selection in a Short Video

被引:0
|
作者
Ren, Jian [1 ,4 ]
Shen, Xiaohui [2 ,4 ]
Lin, Zhe [3 ]
Mech, Radomir [3 ]
机构
[1] Snap Inc, Santa Monica, CA 90405 USA
[2] ByteDance AI Lab, Beijing, Peoples R China
[3] Adobe Res, San Jose, CA USA
[4] Adobe, San Jose, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
People usually take short videos to record meaningful moments in their lives. However, selecting the most representative frame, which not only has high image visual quality but also captures video content, from a short video to share or keep is a time-consuming process for one may need to manually go through all the frames in a video to make a decision. In this paper, we introduce the problem of the best frame selection in a short video and aim to solve it automatically. Towards this end, we collect and will release a diverse large-scale short video dataset that includes 11, 000 videos shoot in our daily life. All videos are assumed to be short (e.g., a few seconds) and each video has human-annotated of the best frame. Then we introduce a deep convolutional neural network (CNN) based approach with ranking objective to automatically pick the best frame from frame sequences extracted via short videos. Additionally, we propose new evaluation metrics, especially for the best frame selection. In experiments, we show our approach outperforms various other methods significantly.
引用
收藏
页码:3201 / 3210
页数:10
相关论文
共 50 条
  • [31] HYBRID DISTRIBUTED VIDEO CODING WITH FRAME LEVEL CODING MODE SELECTION
    Chiu, Chieh-Chuan
    Chien, Shao-Yi
    Lee, Chia-Han
    Somayazulu, V. Srinivasa
    Chen, Yen-Kuang
    2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 1561 - 1564
  • [32] Multimodal emotion recognition based on peak frame selection from video
    Zhalehpour, Sara
    Akhtar, Zahid
    Erdem, Cigdem Eroglu
    SIGNAL IMAGE AND VIDEO PROCESSING, 2016, 10 (05) : 827 - 834
  • [33] New Multi-reference Frame Selection for Multiview Video Coding
    Si, Yuehou
    Yu, Mei
    Peng, Zongju
    Jiang, Gangyi
    2009 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION SYSTEMS AND APPLICATIONS, PROCEEDINGS, 2009, : 39 - 42
  • [34] TEMPORALLY CONSISTENT KEY FRAME SELECTION FROM VIDEO FOR FACE RECOGNITION
    Saeed, Usman
    Dugelay, Jean-Luc
    18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010), 2010, : 1311 - 1315
  • [35] Effective Video Data Retrieval Using Image Key Frame Selection
    Saravanan, D.
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS, ICCII 2016, 2017, 507 : 145 - 155
  • [36] Secure video steganography using key frame and region selection technique
    Roselinkiruba R.
    Saranya Jothi C.
    Tamil Thendral M.
    Hemalatha R.
    International Journal of Information Technology, 2023, 15 (3) : 1299 - 1308
  • [37] Adaptive Lagrange multiplier selection for intra-frame video coding
    Li, Xiang
    Oertel, Norbert
    Kaup, Andre
    2007 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, 2007, : 3643 - +
  • [38] Frame Selection for Producing Recipe with Pictures from an Execution Video of a Recipe
    Nishimura, Taichi
    Hashimoto, Atsushi
    Yamakata, Yoko
    Mori, Shinsuke
    CEA'19: PROCEEDINGS OF THE 11TH WORKSHOP ON MULTIMEDIA FOR COOKING AND EATING ACTIVITIES, 2019, : 9 - 16
  • [39] Multiview video coding method with adaptive selection of reference frame modes
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao, 2007, 9 (1132-1137):
  • [40] Video Summarization Using a Key Frame Selection Based on Shot Segmentation
    Widiarto, Wisnu
    Yuniarno, Eko Mulyanto
    Hariadi, Mochamad
    2015 International Conference on Science in Information Technology (ICSITech), 2015, : 207 - 212