AVscript: Accessible Video Editing with Audio-Visual Scripts

被引:8
|
作者
Huh, Mina [1 ]
Yang, Saelyne [2 ]
Peng, Yi-Hao [3 ]
Chen, Xiang 'Anthony' [4 ]
Kim, Young-Ho [5 ]
Pavel, Amy [1 ]
机构
[1] Univ Texas Austin, Austin, TX 78712 USA
[2] Korea Adv Inst Sci & Technol, Daejeon, South Korea
[3] Carnegie Mellon Univ, Pittsburgh, PA USA
[4] Univ Calif Los Angeles, Los Angeles, CA USA
[5] NAVER AI Lab, Bundangdong, South Korea
关键词
video; authoring tools; accessibility;
D O I
10.1145/3544548.3581494
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sighted and blind and low vision (BLV) creators alike use videos to communicate with broad audiences. Yet, video editing remains inaccessible to BLV creators. Our formative study revealed that current video editing tools make it difficult to access the visual content, assess the visual quality, and efciently navigate the timeline. We present AVscript, an accessible text-based video editor. AVscript enables users to edit their video using a script that embeds the video's visual content, visual errors (e.g., dark or blurred footage), and speech. Users can also efficiently navigate between scenes and visual errors or locate objects in the frame or spoken words of interest. A comparison study (N=12) showed that AVscript signifcantly lowered BLV creators' mental demands while increasing confidence and independence in video editing. We further demonstrate the potential of AVscript through an exploratory study (N=3) where BLV creators edited their own footage.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] AUDIO-VISUAL PERCEPTION OF OMNIDIRECTIONAL VIDEO FOR VIRTUAL REALITY APPLICATIONS
    Chao, Fang-Yi
    Ozcinar, Cagri
    Wang, Chen
    Zerman, Emin
    Zhang, Lu
    Hamidouche, Wassim
    Deforges, Olivier
    Smolic, Aljosa
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2020,
  • [42] Discovering joint audio-visual codewords for video event detection
    Jhuo, I-Hong
    Ye, Guangnan
    Gao, Shenghua
    Liu, Dong
    Jiang, Yu-Gang
    Lee, D. T.
    Chang, Shih-Fu
    [J]. MACHINE VISION AND APPLICATIONS, 2014, 25 (01) : 33 - 47
  • [43] Efficient video coding based on audio-visual focus of attention
    Lee, Jong-Seok
    De Simone, Francesca
    Ebrahimi, Touradj
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2011, 22 (08) : 704 - 711
  • [44] Audio-visual large-scale video copy detection
    Liu, Yang
    Xu, Changsheng
    Lu, Hanqing
    [J]. INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2011, 88 (18) : 3803 - 3816
  • [45] VidQ: Video Query Using Optimized Audio-Visual Processing
    Felemban, Noor
    Mehmeti, Fidan
    Porta, Thomas F.
    [J]. IEEE-ACM TRANSACTIONS ON NETWORKING, 2023, 31 (03) : 1338 - 1352
  • [46] Speaker dependent video indexing based on audio-visual interaction
    Tsekeridou, S
    Pitas, I
    [J]. 1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 1, 1998, : 358 - 362
  • [47] Attention-Based Audio-Visual Fusion for Video Summarization
    Fang, Yinghong
    Zhang, Junpeng
    Lu, Cewu
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2019), PT II, 2019, 11954 : 328 - 340
  • [48] Audio-visual patient leaflets: making information accessible to women with limited English
    Harrison, R.
    Mckenzie, C.
    [J]. BJOG-AN INTERNATIONAL JOURNAL OF OBSTETRICS AND GYNAECOLOGY, 2021, 128 : 221 - 221
  • [49] AUDIO-VISUAL EDUCATION
    Brickman, William W.
    [J]. SCHOOL AND SOCIETY, 1948, 67 (1739): : 320 - 326
  • [50] Audio-Visual Objects
    Kubovy M.
    Schutz M.
    [J]. Review of Philosophy and Psychology, 2010, 1 (1) : 41 - 61