IMPROVING MULTIPLE-CROWD-SOURCED TRANSCRIPTIONS USING A SPEECH RECOGNISER

被引:0
|
作者
van Dalen, R. C. [1 ]
Knill, K. M. [1 ]
Tsiakoulis, P. [1 ]
Gales, M. J. F. [1 ]
机构
[1] Univ Cambridge, Dept Engn, Trumpington St, Cambridge CB2 1PZ, England
关键词
Automatic speech recognition; crowd-sourcing; transcription combination;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper introduces a method to produce high-quality transcriptions of speech data from only two crowd-sourced transcriptions. These transcriptions, produced cheaply by people on the Internet, for example through Amazon Mechanical Turk, are often of low quality. Often, multiple crowd-sourced transcriptions are combined to form one transcription of higher quality. However, the state of the art is to use essentially a form of majority voting, which requires at least three transcriptions for each utterance. This paper shows how to refine this approach to work with only two transcriptions. It then introduces a method that uses a speech recogniser (bootstrapped on a simple combination scheme) to combine transcriptions. When only two crowd-sourced transcriptions are available, on a noisy data set this improves the word error rate to gold-standard transcriptions by 21% relative.
引用
下载
收藏
页码:4709 / 4713
页数:5
相关论文
共 50 条
  • [41] Decision Support System for Agriculture Industry using Crowd Sourced Predictive Analytics
    Remya, S.
    Sasikala, R.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (11) : 310 - 318
  • [42] CSTD-Telugu Corpus: Crowd-Sourced Approach for Large-Scale Speech data collection
    Mirishkar, Ganesh S.
    Raju, Vishnu Vidyadhara V.
    Naroju, Meher Dinesh
    Maity, Sudhamay
    Yalla, Prakash
    Vuppala, Anil Kumar
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 511 - 517
  • [43] Improving crowd labeling using Stackelberg models
    Wenjun Yang
    Chaoqun Li
    International Journal of Machine Learning and Cybernetics, 2021, 12 : 1825 - 1838
  • [44] Improving crowd labeling using Stackelberg models
    Yang, Wenjun
    Li, Chaoqun
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (06) : 1825 - 1838
  • [45] Worker Selection in Crowd-sourced Platforms using Non-dominated Sorting
    Mishra, Sumit
    Yadav, Akash
    Sairam, Ashok Singh
    PROCEEDINGS OF THE 2019 IEEE REGION 10 CONFERENCE (TENCON 2019): TECHNOLOGY, KNOWLEDGE, AND SOCIETY, 2019, : 41 - 45
  • [46] Visual vehicle tracking through noise and occlusions using crowd-sourced maps
    Suraj, M. S.
    Grimmett, Hugo
    Platinsky, Lukas
    Ondruska, Peter
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 4531 - 4538
  • [47] Film Ties: Crowd-sourced Teaching of Cinematography Using Intelligent Example Galleries
    Bares, William
    Schwartz, Donald
    PROCEEDINGS OF THE 19TH ACM CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK AND SOCIAL COMPUTING COMPANION, 2016, : 229 - 232
  • [48] Comparing Generalisation Using Crowd-Sourced vs Expert Labels for Galaxies Classification
    Variawa, M. Z.
    van Zyl, T. L.
    Woolway, M.
    2020 7TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE (ISCMI 2020), 2020, : 158 - 162
  • [49] Augmenting Crowd-Sourced 3D Reconstructions using Semantic Detections
    Price, True
    Schonberger, Johannes L.
    Wei, Zhen
    Pollefeys, Marc
    Frahm, Jan-Michael
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1926 - 1935
  • [50] Forecasting Mobile Transmission Reliability Using Crowd-Sourced Cellular Coverage Data
    Martin, Michelle C.
    Kwan, Anne M.
    Forte, Eric J.
    Zhang, Stan F.
    Patek, Stephen D.
    2014 SYSTEMS AND INFORMATION ENGINEERING DESIGN SYMPOSIUM (SIEDS), 2014,