AVscript: Accessible Video Editing with Audio-Visual Scripts

被引：8

作者：

Huh, Mina ^{[1
]}

Yang, Saelyne ^{[2
]}

Peng, Yi-Hao ^{[3
]}

Chen, Xiang 'Anthony' ^{[4
]}

Kim, Young-Ho ^{[5
]}

Pavel, Amy ^{[1
]}

机构：

[1] Univ Texas Austin, Austin, TX 78712 USA

[2] Korea Adv Inst Sci & Technol, Daejeon, South Korea

[3] Carnegie Mellon Univ, Pittsburgh, PA USA

[4] Univ Calif Los Angeles, Los Angeles, CA USA

[5] NAVER AI Lab, Bundangdong, South Korea

来源：

PROCEEDINGS OF THE 2023 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI 2023) | 2023年

关键词：

video; authoring tools; accessibility;

D O I：

10.1145/3544548.3581494

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Sighted and blind and low vision (BLV) creators alike use videos to communicate with broad audiences. Yet, video editing remains inaccessible to BLV creators. Our formative study revealed that current video editing tools make it difficult to access the visual content, assess the visual quality, and efciently navigate the timeline. We present AVscript, an accessible text-based video editor. AVscript enables users to edit their video using a script that embeds the video's visual content, visual errors (e.g., dark or blurred footage), and speech. Users can also efficiently navigate between scenes and visual errors or locate objects in the frame or spoken words of interest. A comparison study (N=12) showed that AVscript signifcantly lowered BLV creators' mental demands while increasing confidence and independence in video editing. We further demonstrate the potential of AVscript through an exploratory study (N=3) where BLV creators edited their own footage.

引用

页数：17

共 50 条

[21] Audio-Visual Glance Network for Efficient Video Recognition
Nugroho, Muhammad Adi
Woo, Sangmin
Lee, Sumin
Kim, Changick
[J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 10116 - 10125
[22] Audio-visual speaker recognition for video broadcast news
Maison, B
Neti, C
Senior, A
[J]. JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2001, 29 (1-2): : 71 - 79
[23] Spotting Audio-Visual Inconsistencies (SAVI) in Manipulated Video
Bolles, Robert
Burns, J. Brian
Graciarena, Martin
Kathol, Andreas
Lawson, Aaron
McLaren, Mitchell
Mensink, Thomas
[J]. 2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 1907 - 1914
[24] Audio-visual synchrony for detection of monologues in video archives
Iyengar, G
Nock, HJ
Neti, C
[J]. 2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I, PROCEEDINGS, 2003, : 329 - 332
[25] Audio-Visual Speaker Recognition for Video Broadcast News
Benoît Maison
Chalapathy Neti
Andrew Senior
[J]. Journal of VLSI signal processing systems for signal, image and video technology, 2001, 29 : 71 - 79
[26] Audio-visual event recognition in surveillance video sequences
Cristani, Marco
Bicego, Manuele
Murino, Vittorio
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2007, 9 (02) : 257 - 267
[27] An audio-visual distance for audio-visual speech vector quantization
Girin, L
Foucher, E
Feng, G
[J]. 1998 IEEE SECOND WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 1998, : 523 - 528
[28] Catching audio-visual mice:: The extrapolation of audio-visual speed
Hofbauer, MM
Wuerger, SM
Meyer, GF
Röhrbein, F
Schill, K
Zetzsche, C
[J]. PERCEPTION, 2003, 32 : 96 - 96
[29] Identification of story units in audio-visual sequences by joint audio and video processing
Saraceno, C
Leonardi, R
[J]. 1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 1, 1998, : 363 - 367
[30] Educators on the internet. Development and editing of audio-visual materials for teaching
Sotomayor Baca, Angelica
[J]. EDUCATIO SIGLO XXI, 2012, 30 (02): : 473 - 476

← 1 2 3 4 5 →