Deriving conversation-based features from unlabeled speech for discriminative language modeling

被引:0
|
作者
Karakos, D.
Roark, B.
Shafran, I.
Sagae, K.
Lehr, M.
Prud'hommeaux, E.
Xu, P.
Glenn, N.
Khudanpur, S.
Saraclar, M.
Bikel, D.
Dredze, M.
Callison-Burch, C.
Cao, Y.
Hall, K.
Hasler, E.
Koehn, P.
Lopez, A.
Post, M.
Riley, D.
机构
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The perceptron algorithm was used in [1] to estimate discriminative language models which correct errors in the output of ASR systems. In its simplest version, the algorithm simply increases the weight of n-gram features which appear in the correct (oracle) hypothesis and decreases the weight of n-gram features which appear in the 1-best hypothesis. In this paper, we show that the perceptron algorithm can be successfully used in a semi-supervised learning (SSL) framework, where limited amounts of labeled data are available. Our framework has some similarities to graph-based label propagation [2] in the sense that a graph is built based on proximity of unlabeled conversations, and then it is used to propagate confidences (in the form of features) to the labeled data, based on which perceptron trains a discriminative model. The novelty of our approach lies in the fact that the confidence "flows" from the unlabeled data to the labeled data, and not vice-versa, as is done traditionally in SSL. Experiments conducted at the 2011 CLSP Summer Workshop on the conversational telephone speech corpora Dev04f and Eva104f demonstrate the effectiveness of the proposed approach.
引用
收藏
页码:202 / 205
页数:4
相关论文
共 50 条
  • [31] Speech Emotion Recognition Based on Transfer Emotion-Discriminative Features Subspace Learning
    Zhang, Kexin
    Liu, Yunxiang
    IEEE ACCESS, 2023, 11 : 56336 - 56343
  • [32] Speech recognition system in high noise background based on discriminative learning of environmental features
    Lu, Cheng-Guo
    Han, Ji-Qing
    Wang, Cheng-Fa
    Zhang, Lei
    Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2003, 35 (02): : 134 - 137
  • [33] Improving Minority Language Speech Recognition Based on Distinctive Features
    Fu, Tong
    Gao, Shaojun
    Wu, Xihong
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING, 2018, 11266 : 411 - 420
  • [34] From communication disorders research to conversation-based interventions for adults with aphasia: an online resource for clinicians and people with aphasia
    Beckley, F.
    Sirman, N.
    Little, L.
    Mahon, M.
    Maxim, J.
    Edwards, S.
    Best, W.
    Johnson, F.
    Newton, C.
    Beeke, S.
    INTERNATIONAL JOURNAL OF STROKE, 2012, 7 : 26 - 26
  • [35] LEARNING DISCRIMINATIVE FEATURES FROM SPECTROGRAMS USING CENTER LOSS FOR SPEECH EMOTION RECOGNITION
    Dai, Dongyang
    Wu, Zhiyong
    Li, Runnan
    Wu, Xixin
    Jia, Jia
    Meng, Helen
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7405 - 7409
  • [36] Features extraction, modeling and training strategies in continuous speech recognition for Romanian language
    Dumitru, CO
    Gavat, I
    Eurocon 2005: The International Conference on Computer as a Tool, Vol 1 and 2 , Proceedings, 2005, : 1425 - 1428
  • [37] Designing and evaluating interaction as conversation: A modeling language based on semiotic engineering
    Barbosa, SDJ
    de Paula, MG
    INTERACTIVE SYSTEMS: DESIGN, SPECIFICATION, AND VERIFICATION, 2003, 2844 : 16 - 33
  • [38] Language Identification From Speech Features Using SVM and LDA
    Anjana, J. S.
    Poorna, S. S.
    2018 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2018,
  • [39] MT-BASED ARTIFICIAL HYPOTHESIS GENERATION FOR UNSUPERVISED DISCRIMINATIVE LANGUAGE MODELING
    Dikici, Erinc
    Saraclar, Murat
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 1401 - 1405
  • [40] Research on Language Evolution and Language Diversity Based on Chinese Speech Pitch Deviation Features
    Lang J.
    Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)