AVscript: Accessible Video Editing with Audio-Visual Scripts

被引：8

作者：

Huh, Mina ^{[1
]}

Yang, Saelyne ^{[2
]}

Peng, Yi-Hao ^{[3
]}

Chen, Xiang 'Anthony' ^{[4
]}

Kim, Young-Ho ^{[5
]}

Pavel, Amy ^{[1
]}

机构：

[1] Univ Texas Austin, Austin, TX 78712 USA

[2] Korea Adv Inst Sci & Technol, Daejeon, South Korea

[3] Carnegie Mellon Univ, Pittsburgh, PA USA

[4] Univ Calif Los Angeles, Los Angeles, CA USA

[5] NAVER AI Lab, Bundangdong, South Korea

来源：

PROCEEDINGS OF THE 2023 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI 2023) | 2023年

关键词：

video; authoring tools; accessibility;

D O I：

10.1145/3544548.3581494

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Sighted and blind and low vision (BLV) creators alike use videos to communicate with broad audiences. Yet, video editing remains inaccessible to BLV creators. Our formative study revealed that current video editing tools make it difficult to access the visual content, assess the visual quality, and efciently navigate the timeline. We present AVscript, an accessible text-based video editor. AVscript enables users to edit their video using a script that embeds the video's visual content, visual errors (e.g., dark or blurred footage), and speech. Users can also efficiently navigate between scenes and visual errors or locate objects in the frame or spoken words of interest. A comparison study (N=12) showed that AVscript signifcantly lowered BLV creators' mental demands while increasing confidence and independence in video editing. We further demonstrate the potential of AVscript through an exploratory study (N=3) where BLV creators edited their own footage.

引用

页数：17

共 50 条

[41] AUDIO-VISUAL PERCEPTION OF OMNIDIRECTIONAL VIDEO FOR VIRTUAL REALITY APPLICATIONS
Chao, Fang-Yi
Ozcinar, Cagri
Wang, Chen
Zerman, Emin
Zhang, Lu
Hamidouche, Wassim
Deforges, Olivier
Smolic, Aljosa
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2020,
[42] Discovering joint audio-visual codewords for video event detection
Jhuo, I-Hong
Ye, Guangnan
Gao, Shenghua
Liu, Dong
Jiang, Yu-Gang
Lee, D. T.
Chang, Shih-Fu
[J]. MACHINE VISION AND APPLICATIONS, 2014, 25 (01) : 33 - 47
[43] Efficient video coding based on audio-visual focus of attention
Lee, Jong-Seok
De Simone, Francesca
Ebrahimi, Touradj
[J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2011, 22 (08) : 704 - 711
[44] Audio-visual large-scale video copy detection
Liu, Yang
Xu, Changsheng
Lu, Hanqing
[J]. INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2011, 88 (18) : 3803 - 3816
[45] VidQ: Video Query Using Optimized Audio-Visual Processing
Felemban, Noor
Mehmeti, Fidan
Porta, Thomas F.
[J]. IEEE-ACM TRANSACTIONS ON NETWORKING, 2023, 31 (03) : 1338 - 1352
[46] Speaker dependent video indexing based on audio-visual interaction
Tsekeridou, S
Pitas, I
[J]. 1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 1, 1998, : 358 - 362
[47] Attention-Based Audio-Visual Fusion for Video Summarization
Fang, Yinghong
Zhang, Junpeng
Lu, Cewu
[J]. NEURAL INFORMATION PROCESSING (ICONIP 2019), PT II, 2019, 11954 : 328 - 340
[48] Audio-visual patient leaflets: making information accessible to women with limited English
Harrison, R.
Mckenzie, C.
[J]. BJOG-AN INTERNATIONAL JOURNAL OF OBSTETRICS AND GYNAECOLOGY, 2021, 128 : 221 - 221
[49] AUDIO-VISUAL EDUCATION
Brickman, William W.
[J]. SCHOOL AND SOCIETY, 1948, 67 (1739): : 320 - 326
[50] Audio-Visual Objects
Kubovy M.
Schutz M.
[J]. Review of Philosophy and Psychology, 2010, 1 (1) : 41 - 61

← 1 2 3 4 5 →