IMPROVING MULTIPLE-CROWD-SOURCED TRANSCRIPTIONS USING A SPEECH RECOGNISER

被引:0
|
作者
van Dalen, R. C. [1 ]
Knill, K. M. [1 ]
Tsiakoulis, P. [1 ]
Gales, M. J. F. [1 ]
机构
[1] Univ Cambridge, Dept Engn, Trumpington St, Cambridge CB2 1PZ, England
关键词
Automatic speech recognition; crowd-sourcing; transcription combination;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper introduces a method to produce high-quality transcriptions of speech data from only two crowd-sourced transcriptions. These transcriptions, produced cheaply by people on the Internet, for example through Amazon Mechanical Turk, are often of low quality. Often, multiple crowd-sourced transcriptions are combined to form one transcription of higher quality. However, the state of the art is to use essentially a form of majority voting, which requires at least three transcriptions for each utterance. This paper shows how to refine this approach to work with only two transcriptions. It then introduces a method that uses a speech recogniser (bootstrapped on a simple combination scheme) to combine transcriptions. When only two crowd-sourced transcriptions are available, on a noisy data set this improves the word error rate to gold-standard transcriptions by 21% relative.
引用
下载
收藏
页码:4709 / 4713
页数:5
相关论文
共 50 条
  • [31] Using Qualitative Spatial Logic for Validating Crowd-Sourced Geospatial Data
    Du, Heshan
    Hai Nguyen
    Alechina, Natasha
    Logan, Brian
    Jackson, Michael
    Goodwin, John
    PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 3948 - 3953
  • [32] A Map Framework Using Crowd-Sourced Data for Indoor Positioning and Navigation
    Graichen, Thomas
    Gruschka, Erik
    Heinkel, Ulrich
    2017 IEEE INTERNATIONAL WORKSHOP ON MEASUREMENT AND NETWORKING (M&N), 2017, : 217 - 222
  • [33] Assessing Workers Reliability in Crowd-sourced Computing using Bayesian Rules
    Hussin, Masnida
    Rozlan, Nur Aliya
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON APPLIED SCIENCE AND TECHNOLOGY (ICAST'18), 2018, 2016
  • [34] Personalised Route-Map Generation using Crowd Sourced GPS Traces
    Mandal, Ratna
    Agarwal, Nitin
    Nandi, Subrata
    Saha, Sujoy
    2014 2ND INTERNATIONAL CONFERENCE ON BUSINESS AND INFORMATION MANAGEMENT (ICBIM), 2014,
  • [35] Fuzzy Integrals of Crowd-Sourced Intervals Using A Measure of Generalized Accord
    Havens, Timothy C.
    Anderson, Derek T.
    Wagner, Christian
    Deilamsalehy, Hanieh
    Wonnacott, Dereck
    2013 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ - IEEE 2013), 2013,
  • [36] Improving Phonetic Transcriptions of Children's Speech by Pronunciation Modelling with Constrained CTC-Decoding
    Rumberg, Lars
    Gebauer, Christopher
    Ehlert, Hanna
    Luedtke, Ulrike
    Ostermann, Joern
    INTERSPEECH 2022, 2022, : 1357 - 1361
  • [37] Crowd Sourced Pedestrian Dead Reckoning and Mapping of Indoor Environments using Smartphones
    Gentner, Christian
    Karasek, Rostislav
    Schmidhammer, Martin
    PROCEEDINGS OF THE 32ND INTERNATIONAL TECHNICAL MEETING OF THE SATELLITE DIVISION OF THE INSTITUTE OF NAVIGATION (ION GNSS+ 2019), 2019, : 299 - 347
  • [38] Predicting Venue Popularity Using Crowd-Sourced and Passive Sensor Data
    Timokhin, Stanislav
    Sadrani, Mohammad
    Antoniou, Constantinos
    SMART CITIES, 2020, 3 (03): : 818 - 841
  • [39] Crowd-Sourced Wildfire Spread Prediction With Remote Georeferencing Using Smartphones
    Bogdos, Nikos
    Manolakos, Elias S.
    IEEE ACCESS, 2019, 7 : 102102 - 102112
  • [40] Building a crowd-sourced challenge using clinical trial data.
    Zhou, Fang Liz
    Guinney, Justin
    Abdallah, Kald
    Norman, Thea C.
    Bot, Brian
    Costello, James
    Shen, Liji
    Wang, Tao
    Xie, Yang
    Stolovitzky, Gustavo A.
    JOURNAL OF CLINICAL ONCOLOGY, 2015, 33 (15)