Deriving conversation-based features from unlabeled speech for discriminative language modeling

被引：0

作者：

Karakos, D.

Roark, B.

Shafran, I.

Sagae, K.

Lehr, M.

Prud'hommeaux, E.

Xu, P.

Glenn, N.

Khudanpur, S.

Saraclar, M.

Bikel, D.

Dredze, M.

Callison-Burch, C.

Cao, Y.

Hall, K.

Hasler, E.

Koehn, P.

Lopez, A.

Post, M.

Riley, D.

机构：

来源：

13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 | 2012年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The perceptron algorithm was used in [1] to estimate discriminative language models which correct errors in the output of ASR systems. In its simplest version, the algorithm simply increases the weight of n-gram features which appear in the correct (oracle) hypothesis and decreases the weight of n-gram features which appear in the 1-best hypothesis. In this paper, we show that the perceptron algorithm can be successfully used in a semi-supervised learning (SSL) framework, where limited amounts of labeled data are available. Our framework has some similarities to graph-based label propagation [2] in the sense that a graph is built based on proximity of unlabeled conversations, and then it is used to propagate confidences (in the form of features) to the labeled data, based on which perceptron trains a discriminative model. The novelty of our approach lies in the fact that the confidence "flows" from the unlabeled data to the labeled data, and not vice-versa, as is done traditionally in SSL. Experiments conducted at the 2011 CLSP Summer Workshop on the conversational telephone speech corpora Dev04f and Eva104f demonstrate the effectiveness of the proposed approach.

引用

页码：202 / 205

页数：4

共 50 条

[1] Learner Modeling in Conversation-Based Assessment
Zapata-Rivera, Diego
Forsyth, Carol M.
ADAPTIVE INSTRUCTIONAL SYSTEMS, AIS 2022, 2022, 13332 : 73 - 83
[2] Conversation-based natural language interface to relational databases
Owda, Majdi
Bandar, Zuhair
Crockett, Keeley
PROCEEDING OF THE 2007 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY, WORKSHOPS, 2007, : 363 - 367
[3] Can Conversation-Based Intervention Using Speech-Generating Devices Improve Language in Children With Partially Intelligible Speech?
Luckins, Jessie M.
Clarke, Michael T.
COMMUNICATION DISORDERS QUARTERLY, 2021, 42 (03) : 131 - 144
[4] DISCRIMINATIVE LANGUAGE MODELING FOR SPEECH RECOGNITION WITH RELEVANCE INFORMATION
Chen, Berlin
Liu, Jia-Wen
2011 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2011,
[5] A Decade of Discriminative Language Modeling for Automatic Speech Recognition
Saraclar, Murat
Dikici, Erinc
Arisoy, Ebru
SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 11 - 22
[6] SOCIAL VALENCE IN CHILDREN WITH SPECIFIC LANGUAGE IMPAIRMENT DURING IMITATION-BASED AND CONVERSATION-BASED LANGUAGE INTERVENTION
HALEY, KL
CAMARATA, SM
NELSON, KE
JOURNAL OF SPEECH AND HEARING RESEARCH, 1994, 37 (02): : 378 - 388
[7] Empowering Personalized Learning through a Conversation-based Tutoring System with Student Modeling
Park, Minju
Kim, Sojung
Lee, Seunghyun
Kwon, Soonwoo
Kim, Kyuseok
EXTENDED ABSTRACTS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2024, 2024,
[8] Simulated Dialogues with Virtual Agents: Effects of Agent Features in Conversation-Based Assessments
Sparks, Jesse R.
Zapata-Rivera, Diego
Lehman, Blair
James, Kofi
Steinberg, Jonathan
ARTIFICIAL INTELLIGENCE IN EDUCATION, PT II, 2018, 10948 : 469 - 474
[9] Discriminative Language Modeling With Linguistic and Statistically Derived Features
Arisoy, Ebru
Saraclar, Murat
Roark, Brian
Shafran, Izhak
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (02): : 540 - 550
[10] Discriminative auditory-based features for robust speech recognition
Mak, BKW
Tam, YC
Li, PQ
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (01): : 27 - 36

← 1 2 3 4 5 →