Automatic Text-based Clip Composition for Video News

被引:0
|
作者
Quandt, Dennis [1 ]
Altmeyer, Philipp [1 ]
Ruppel, Wolfgang [1 ]
Narroschke, Matthias [1 ]
机构
[1] RheinMain Univ Appl Sci, Wiesbaden, Germany
关键词
News Clip Editing; AI Video Editing; Computational Cinematography; Text-based Clip Sequencing;
D O I
10.1145/3665026.3665042
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
News broadcasters must produce engaging video clips quicker than ever to ensure their successful positioning in the market. This is due, in part, to the growing number of news sources and changes in media consumption amongst target audiences. This evolution has amplified the need to quickly produce news clips, a requirement that remains at odds with the traditionally manual and time-consuming video editing processes. Besides advances in automating video news production, current systems are yet to meet the sufficient automation level and quality standards required for professional news broadcasting. Addressing this gap, we propose a novel transformer-based framework for automatically composing news clips to streamline the editing process. Our framework is predicated on a vision-language feature embedding mechanism and a cross-attention transformer architecture designed to generate multi-shot news clips semantically coherent with the editorial text and stylistically consistent with professional editing benchmarks. Our framework composes news clips with a length of 2 minutes from source material ranging from 20 minutes to 2 hours in less than 5 minutes using a single GPU. In our user study, target groups with different experience levels rated the generated videos on a 6-point Likert scale. Users rated the news clips generated by our framework with an average score of 4.13 and the manually edited news clips with an average score of 4.58.
引用
收藏
页码:106 / 112
页数:7
相关论文
共 50 条
  • [31] Semi-Automatic News Video Annotation Framework for Arabic Text
    Zayene, Oussama
    Touj, Sameh Masmoudi
    Hennebert, Jean
    Ingold, Rolf
    Ben Amara, Najoua Essoukri
    2014 4TH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA), 2014, : 227 - 232
  • [32] Style-A-Video: Agile Diffusion for Arbitrary Text-Based Video Style Transfer
    Huang, Nisha
    Zhang, Yuxin
    Dong, Weiming
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1494 - 1498
  • [33] Automatic video clip and mixing based on semantic sentence matching
    Zixi Jia
    Jiao Li
    Zhengjun Du
    Jingyu Ru
    Yating Wang
    Chengdong Wu
    Yutong Zhang
    Shuangjiang Yu
    Zhou Wang
    Changsheng Sun
    Ao Lyu
    Applied Intelligence, 2023, 53 : 2133 - 2146
  • [34] Automatic video clip and mixing based on semantic sentence matching
    Jia, Zixi
    Li, Jiao
    Du, Zhengjun
    Ru, Jingyu
    Wang, Yating
    Wu, Chengdong
    Zhang, Yutong
    Yu, Shuangjiang
    Wang, Zhou
    Sun, Changsheng
    Lyu, Ao
    APPLIED INTELLIGENCE, 2023, 53 (02) : 2133 - 2146
  • [35] A comparison of video and text-based questionnaire methods for testing older adults
    Sleik, RJ
    Brown, LA
    Wong, IE
    Bocksnick, J
    RESEARCH QUARTERLY FOR EXERCISE AND SPORT, 2002, 73 (02) : 219 - 224
  • [36] Text-based informatics
    Valdes-Perez, RE
    SCIENTIST, 1998, 12 (14): : 10 - 10
  • [37] A comparison of preservice teacher perceptions of instructor video and text-based feedback
    Erik Kormos
    SN Social Sciences, 2 (8):
  • [38] Text-based Question Difficulty Prediction: A Systematic Review of Automatic Approaches
    Alkhuzaey, Samah
    Grasso, Floriana
    Payne, Terry R.
    Tamma, Valentina
    INTERNATIONAL JOURNAL OF ARTIFICIAL INTELLIGENCE IN EDUCATION, 2023, 34 (3) : 862 - 914
  • [39] Towards automatic text-based estimation of depression through symptom prediction
    Milintsevich, Kirill
    Sirts, Kairit
    Dias, Gael
    BRAIN INFORMATICS, 2023, 10 (01)
  • [40] Automatic Detection and Recognition of Text-Based Traffic Signs from Images
    Oliveira, G.
    Silva, F.
    Pereira, D.
    Almeida, L.
    Artero, A.
    Bonora, A.
    de Albuquerque, V.
    IEEE LATIN AMERICA TRANSACTIONS, 2018, 16 (12) : 2947 - 2953