Deriving conversation-based features from unlabeled speech for discriminative language modeling

被引:0
|
作者
Karakos, D.
Roark, B.
Shafran, I.
Sagae, K.
Lehr, M.
Prud'hommeaux, E.
Xu, P.
Glenn, N.
Khudanpur, S.
Saraclar, M.
Bikel, D.
Dredze, M.
Callison-Burch, C.
Cao, Y.
Hall, K.
Hasler, E.
Koehn, P.
Lopez, A.
Post, M.
Riley, D.
机构
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The perceptron algorithm was used in [1] to estimate discriminative language models which correct errors in the output of ASR systems. In its simplest version, the algorithm simply increases the weight of n-gram features which appear in the correct (oracle) hypothesis and decreases the weight of n-gram features which appear in the 1-best hypothesis. In this paper, we show that the perceptron algorithm can be successfully used in a semi-supervised learning (SSL) framework, where limited amounts of labeled data are available. Our framework has some similarities to graph-based label propagation [2] in the sense that a graph is built based on proximity of unlabeled conversations, and then it is used to propagate confidences (in the form of features) to the labeled data, based on which perceptron trains a discriminative model. The novelty of our approach lies in the fact that the confidence "flows" from the unlabeled data to the labeled data, and not vice-versa, as is done traditionally in SSL. Experiments conducted at the 2011 CLSP Summer Workshop on the conversational telephone speech corpora Dev04f and Eva104f demonstrate the effectiveness of the proposed approach.
引用
收藏
页码:202 / 205
页数:4
相关论文
共 50 条
  • [21] A Margin-based Discriminative Modeling Approach for Extractive Speech Summarization
    Liu, Shih-Hung
    Chen, Kuan-Yu
    Chen, Berlin
    Jan, Ea-Ee
    Wang, Hsin-Min
    Yen, Hsu-Chun
    Hsu, Wen-Lian
    2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
  • [22] Behavioral Signal Processing: Deriving Human Behavioral Informatics From Speech and Language
    Narayanan, Shrikanth
    Georgiou, Panayiotis G.
    PROCEEDINGS OF THE IEEE, 2013, 101 (05) : 1203 - 1233
  • [23] Lexicon optimization based on discriminative learning for automatic speech recognition of agglutinative language
    Ablimit, Mijit
    Kawahara, Tatsuya
    Hamdulla, Askar
    SPEECH COMMUNICATION, 2014, 60 : 78 - 87
  • [24] MODIFICATIONS OF SPEECH LANGUAGE LEVELS BY PRESERVICE EDUCATORS BASED ON LANGUAGE FEATURES
    SCHLOSS, PJ
    SMITH, MA
    BERNSTEIN, M
    JOURNAL OF COMMUNICATION DISORDERS, 1990, 23 (02) : 89 - 96
  • [25] Discriminative features based on modified log magnitude spectrum for playback speech detection
    Yang, Jichen
    Xu, Longting
    Ren, Bo
    Ji, Yunyun
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2020, 2020 (01)
  • [26] Discriminative features based on modified log magnitude spectrum for playback speech detection
    Jichen Yang
    Longting Xu
    Bo Ren
    Yunyun Ji
    EURASIP Journal on Audio, Speech, and Music Processing, 2020
  • [27] A speech recognition algorithm based on the features of Croatian language
    Peic, R
    PROCEEDINGS EC-VIP-MC 2003, VOLS 1 AND 2, 2003, : 613 - 618
  • [28] An Effective Contextual Language Modeling Framework for Speech Summarization with Augmented Features
    Weng, Shi-Yan
    Lo, Tien-Hong
    Chen, Berlin
    28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 316 - 320
  • [29] ON DERIVING LONGER FINGERPRINTS FROM FEATURES BASED ON PROJECTIONS
    Radhakrishnan, Regunathan
    Bauer, Claus
    Jiang, Wenyu
    2010 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2010), 2010, : 1359 - 1363
  • [30] Impact of web based language modeling on speech understanding
    Sarikaya, R
    Kuo, HKJ
    Gao, YQ
    2005 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2005, : 268 - 271