Dubbing in Practice: A Large Scale Study of Human Localization With Insights for Automatic Dubbing

被引:3
|
作者
Brannon, William [1 ]
Virkar, Yogesh [2 ]
Thompson, Brian [2 ]
机构
[1] MIT Media Lab, Cambridge, MA USA
[2] AWS AI Labs, Seattle, WA 98101 USA
关键词
TRANSLATION;
D O I
10.1162/tacl_a_00551
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We investigate how humans perform the task of dubbing video content from one language into another, leveraging a novel corpus of 319.57 hours of video from 54 professionally produced titles. This is the first such large-scale study we are aware of. The results challenge a number of assumptions commonly made in both qualitative literature on human dubbing and machine-learning literature on automatic dubbing, arguing for the importance of vocal naturalness and translation quality over commonly emphasized isometric (character length) and lip-sync constraints, and for a more qualified view of the importance of isochronic (timing) constraints. We also find substantial influence of the source-side audio on human dubs through channels other than the words of the translation, pointing to the need for research on ways to preserve speech characteristics, as well as transfer of semantic properties such as emphasis and emotion, in automatic dubbing systems.
引用
收藏
页码:419 / 435
页数:17
相关论文
共 50 条
  • [41] Population coverage and nonresponse bias in a large-scale human exposure study
    Paul Mosquin
    Roy Whitmore
    Cindy Suerken
    Jim Quackenboss
    Journal of Exposure Science & Environmental Epidemiology, 2005, 15 : 431 - 438
  • [42] Population coverage and nonresponse bias in a large-scale human exposure study
    Mosquin, P
    Whitmore, R
    Cindy, SB
    Quackenboss, J
    JOURNAL OF EXPOSURE ANALYSIS AND ENVIRONMENTAL EPIDEMIOLOGY, 2005, 15 (05): : 431 - 438
  • [43] THE USE OF XIPAMIDE IN ANTIHYPERTENSIVE THERAPY IN GENERAL-PRACTICE - A LARGE-SCALE MULTICENTER STUDY
    BAEHRE, M
    GRIMSHAW, JJ
    CURRENT THERAPEUTIC RESEARCH-CLINICAL AND EXPERIMENTAL, 1988, 44 (05): : 737 - 743
  • [44] Comparative study of rock support system design practice for large-scale underground excavations
    Cai, M
    Kaiser, PK
    Uno, H
    Tasaka, Y
    PACIFIC ROCKS 2000: ROCK AROUND THE RIM, 2000, : 1027 - 1034
  • [45] An empirical study of the effectiveness of IR-based bug localization for large-scale industrial projects
    Li, Wei
    Li, Qingan
    Ming, Yunlong
    Dai, Weijiao
    Ying, Shi
    Yuan, Mengting
    EMPIRICAL SOFTWARE ENGINEERING, 2022, 27 (02)
  • [46] An empirical study of the effectiveness of IR-based bug localization for large-scale industrial projects
    Wei Li
    Qingan Li
    Yunlong Ming
    Weijiao Dai
    Shi Ying
    Mengting Yuan
    Empirical Software Engineering, 2022, 27
  • [47] Reporting in large-scale agile organizations: insights and recommendations from a case study in software development
    Schuell, Moritz
    Hofmann, Peter
    Philipp, Pascal
    Urbach, Nils
    INFORMATION SYSTEMS AND E-BUSINESS MANAGEMENT, 2023, 21 (03) : 571 - 601
  • [48] Predictors of Literacy Development in Adulthood: Insights from a Large-scale, Two-wave Study
    Wicht, Alexandra
    Rammstedt, Beatrice
    Lechner, Clemens M.
    SCIENTIFIC STUDIES OF READING, 2021, 25 (01) : 84 - 92
  • [49] Reporting in large-scale agile organizations: insights and recommendations from a case study in software development
    Moritz Schüll
    Peter Hofmann
    Pascal Philipp
    Nils Urbach
    Information Systems and e-Business Management, 2023, 21 : 571 - 601
  • [50] Cognitive Processing Speed and Loneliness in Stroke Survivors: Insights from a Large-Scale Cohort Study
    Byrne, Christopher
    Coetzer, Rudi
    Ramsey, Richard
    ARCHIVES OF CLINICAL NEUROPSYCHOLOGY, 2024,