AVscript: Accessible Video Editing with Audio-Visual Scripts

被引:8
|
作者
Huh, Mina [1 ]
Yang, Saelyne [2 ]
Peng, Yi-Hao [3 ]
Chen, Xiang 'Anthony' [4 ]
Kim, Young-Ho [5 ]
Pavel, Amy [1 ]
机构
[1] Univ Texas Austin, Austin, TX 78712 USA
[2] Korea Adv Inst Sci & Technol, Daejeon, South Korea
[3] Carnegie Mellon Univ, Pittsburgh, PA USA
[4] Univ Calif Los Angeles, Los Angeles, CA USA
[5] NAVER AI Lab, Bundangdong, South Korea
关键词
video; authoring tools; accessibility;
D O I
10.1145/3544548.3581494
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sighted and blind and low vision (BLV) creators alike use videos to communicate with broad audiences. Yet, video editing remains inaccessible to BLV creators. Our formative study revealed that current video editing tools make it difficult to access the visual content, assess the visual quality, and efciently navigate the timeline. We present AVscript, an accessible text-based video editor. AVscript enables users to edit their video using a script that embeds the video's visual content, visual errors (e.g., dark or blurred footage), and speech. Users can also efficiently navigate between scenes and visual errors or locate objects in the frame or spoken words of interest. A comparison study (N=12) showed that AVscript signifcantly lowered BLV creators' mental demands while increasing confidence and independence in video editing. We further demonstrate the potential of AVscript through an exploratory study (N=3) where BLV creators edited their own footage.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Audio-Visual Glance Network for Efficient Video Recognition
    Nugroho, Muhammad Adi
    Woo, Sangmin
    Lee, Sumin
    Kim, Changick
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 10116 - 10125
  • [22] Audio-visual speaker recognition for video broadcast news
    Maison, B
    Neti, C
    Senior, A
    [J]. JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2001, 29 (1-2): : 71 - 79
  • [23] Spotting Audio-Visual Inconsistencies (SAVI) in Manipulated Video
    Bolles, Robert
    Burns, J. Brian
    Graciarena, Martin
    Kathol, Andreas
    Lawson, Aaron
    McLaren, Mitchell
    Mensink, Thomas
    [J]. 2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 1907 - 1914
  • [24] Audio-visual synchrony for detection of monologues in video archives
    Iyengar, G
    Nock, HJ
    Neti, C
    [J]. 2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I, PROCEEDINGS, 2003, : 329 - 332
  • [25] Audio-Visual Speaker Recognition for Video Broadcast News
    Benoît Maison
    Chalapathy Neti
    Andrew Senior
    [J]. Journal of VLSI signal processing systems for signal, image and video technology, 2001, 29 : 71 - 79
  • [26] Audio-visual event recognition in surveillance video sequences
    Cristani, Marco
    Bicego, Manuele
    Murino, Vittorio
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2007, 9 (02) : 257 - 267
  • [27] An audio-visual distance for audio-visual speech vector quantization
    Girin, L
    Foucher, E
    Feng, G
    [J]. 1998 IEEE SECOND WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 1998, : 523 - 528
  • [28] Catching audio-visual mice:: The extrapolation of audio-visual speed
    Hofbauer, MM
    Wuerger, SM
    Meyer, GF
    Röhrbein, F
    Schill, K
    Zetzsche, C
    [J]. PERCEPTION, 2003, 32 : 96 - 96
  • [29] Identification of story units in audio-visual sequences by joint audio and video processing
    Saraceno, C
    Leonardi, R
    [J]. 1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 1, 1998, : 363 - 367
  • [30] Educators on the internet. Development and editing of audio-visual materials for teaching
    Sotomayor Baca, Angelica
    [J]. EDUCATIO SIGLO XXI, 2012, 30 (02): : 473 - 476