Deriving conversation-based features from unlabeled speech for discriminative language modeling

被引:0
|
作者
Karakos, D.
Roark, B.
Shafran, I.
Sagae, K.
Lehr, M.
Prud'hommeaux, E.
Xu, P.
Glenn, N.
Khudanpur, S.
Saraclar, M.
Bikel, D.
Dredze, M.
Callison-Burch, C.
Cao, Y.
Hall, K.
Hasler, E.
Koehn, P.
Lopez, A.
Post, M.
Riley, D.
机构
来源
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 | 2012年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The perceptron algorithm was used in [1] to estimate discriminative language models which correct errors in the output of ASR systems. In its simplest version, the algorithm simply increases the weight of n-gram features which appear in the correct (oracle) hypothesis and decreases the weight of n-gram features which appear in the 1-best hypothesis. In this paper, we show that the perceptron algorithm can be successfully used in a semi-supervised learning (SSL) framework, where limited amounts of labeled data are available. Our framework has some similarities to graph-based label propagation [2] in the sense that a graph is built based on proximity of unlabeled conversations, and then it is used to propagate confidences (in the form of features) to the labeled data, based on which perceptron trains a discriminative model. The novelty of our approach lies in the fact that the confidence "flows" from the unlabeled data to the labeled data, and not vice-versa, as is done traditionally in SSL. Experiments conducted at the 2011 CLSP Summer Workshop on the conversational telephone speech corpora Dev04f and Eva104f demonstrate the effectiveness of the proposed approach.
引用
收藏
页码:202 / 205
页数:4
相关论文
共 50 条
  • [41] Protocol for a conversation-based analysis study: PREVENT-ED investigates dialogue features that may help predict dementia onset in later life
    Garcia, Sofia de la Fuente
    Ritchie, Craig W.
    Luz, Saturnino
    BMJ OPEN, 2019, 9 (03):
  • [42] SOME APPLICATIONS OF TREE-BASED MODELING TO SPEECH AND LANGUAGE
    RILEY, MD
    SPEECH AND NATURAL LANGUAGE, 1989, : 339 - 352
  • [43] Risk-Based Semi-Supervised Discriminative Language Modeling for Broadcast Transcription
    Kobayashi, Akio
    Oku, Takahiro
    Imai, Toru
    Nakagawa, Seiichi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (11): : 2674 - 2681
  • [44] Language Discrimination from Speech Signal Using Perceptual and Physical Features
    Yasmin, Ghazaala
    DasGupta, Ishani
    Das, Asit K.
    COMPUTATIONAL INTELLIGENCE IN DATA MINING, 2019, 711 : 357 - 367
  • [45] A lazy learning-based language identification from speech using MFCC-2 features
    Himadri Mukherjee
    Sk Md Obaidullah
    K. C. Santosh
    Santanu Phadikar
    Kaushik Roy
    International Journal of Machine Learning and Cybernetics, 2020, 11 : 1 - 14
  • [46] A lazy learning-based language identification from speech using MFCC-2 features
    Mukherjee, Himadri
    Obaidullah, Sk Md
    Santosh, K. C.
    Phadikar, Santanu
    Roy, Kaushik
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (01) : 1 - 14
  • [47] Integrating MLP features and discriminative training in data sampling based ensemble acoustic modeling
    Chen, Xin
    Zhao, Yunxin
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1349 - 1352
  • [48] Noise Robust Fundamental Frequency Estimation of Speech using CNN-based discriminative modeling
    Kawamura, Tomonori
    Kai, Atsuhiko
    Nakagawa, Seiichi
    2018 5TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATICS: CONCEPTS, THEORY AND APPLICATIONS (ICAICTA 2018), 2018, : 60 - 65
  • [49] Morphology-based language modeling for conversational Arabic speech recognition
    Kirchhoff, Katrin
    Vergyri, Dimitra
    Bilmes, Jeff
    Duh, Kevin
    Stolcke, Andreas
    COMPUTER SPEECH AND LANGUAGE, 2006, 20 (04): : 589 - 608
  • [50] ANALYSIS OF MORPH-BASED LANGUAGE MODELING AND SPEECH RECOGNITION IN SLOVAK
    Stas, Jan
    Hladek, Daniel
    Juhar, Jozef
    Zlacky, Daniel
    ADVANCES IN ELECTRICAL AND ELECTRONIC ENGINEERING, 2012, 10 (04) : 291 - 296