Deriving conversation-based features from unlabeled speech for discriminative language modeling

被引:0
|
作者
Karakos, D.
Roark, B.
Shafran, I.
Sagae, K.
Lehr, M.
Prud'hommeaux, E.
Xu, P.
Glenn, N.
Khudanpur, S.
Saraclar, M.
Bikel, D.
Dredze, M.
Callison-Burch, C.
Cao, Y.
Hall, K.
Hasler, E.
Koehn, P.
Lopez, A.
Post, M.
Riley, D.
机构
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The perceptron algorithm was used in [1] to estimate discriminative language models which correct errors in the output of ASR systems. In its simplest version, the algorithm simply increases the weight of n-gram features which appear in the correct (oracle) hypothesis and decreases the weight of n-gram features which appear in the 1-best hypothesis. In this paper, we show that the perceptron algorithm can be successfully used in a semi-supervised learning (SSL) framework, where limited amounts of labeled data are available. Our framework has some similarities to graph-based label propagation [2] in the sense that a graph is built based on proximity of unlabeled conversations, and then it is used to propagate confidences (in the form of features) to the labeled data, based on which perceptron trains a discriminative model. The novelty of our approach lies in the fact that the confidence "flows" from the unlabeled data to the labeled data, and not vice-versa, as is done traditionally in SSL. Experiments conducted at the 2011 CLSP Summer Workshop on the conversational telephone speech corpora Dev04f and Eva104f demonstrate the effectiveness of the proposed approach.
引用
收藏
页码:202 / 205
页数:4
相关论文
共 50 条
  • [1] Learner Modeling in Conversation-Based Assessment
    Zapata-Rivera, Diego
    Forsyth, Carol M.
    ADAPTIVE INSTRUCTIONAL SYSTEMS, AIS 2022, 2022, 13332 : 73 - 83
  • [2] Conversation-based natural language interface to relational databases
    Owda, Majdi
    Bandar, Zuhair
    Crockett, Keeley
    PROCEEDING OF THE 2007 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY, WORKSHOPS, 2007, : 363 - 367
  • [3] Can Conversation-Based Intervention Using Speech-Generating Devices Improve Language in Children With Partially Intelligible Speech?
    Luckins, Jessie M.
    Clarke, Michael T.
    COMMUNICATION DISORDERS QUARTERLY, 2021, 42 (03) : 131 - 144
  • [4] DISCRIMINATIVE LANGUAGE MODELING FOR SPEECH RECOGNITION WITH RELEVANCE INFORMATION
    Chen, Berlin
    Liu, Jia-Wen
    2011 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2011,
  • [5] A Decade of Discriminative Language Modeling for Automatic Speech Recognition
    Saraclar, Murat
    Dikici, Erinc
    Arisoy, Ebru
    SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 11 - 22
  • [6] SOCIAL VALENCE IN CHILDREN WITH SPECIFIC LANGUAGE IMPAIRMENT DURING IMITATION-BASED AND CONVERSATION-BASED LANGUAGE INTERVENTION
    HALEY, KL
    CAMARATA, SM
    NELSON, KE
    JOURNAL OF SPEECH AND HEARING RESEARCH, 1994, 37 (02): : 378 - 388
  • [7] Empowering Personalized Learning through a Conversation-based Tutoring System with Student Modeling
    Park, Minju
    Kim, Sojung
    Lee, Seunghyun
    Kwon, Soonwoo
    Kim, Kyuseok
    EXTENDED ABSTRACTS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2024, 2024,
  • [8] Simulated Dialogues with Virtual Agents: Effects of Agent Features in Conversation-Based Assessments
    Sparks, Jesse R.
    Zapata-Rivera, Diego
    Lehman, Blair
    James, Kofi
    Steinberg, Jonathan
    ARTIFICIAL INTELLIGENCE IN EDUCATION, PT II, 2018, 10948 : 469 - 474
  • [9] Discriminative Language Modeling With Linguistic and Statistically Derived Features
    Arisoy, Ebru
    Saraclar, Murat
    Roark, Brian
    Shafran, Izhak
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (02): : 540 - 550
  • [10] Discriminative auditory-based features for robust speech recognition
    Mak, BKW
    Tam, YC
    Li, PQ
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (01): : 27 - 36