Deriving conversation-based features from unlabeled speech for discriminative language modeling

被引：0

作者：

Karakos, D.

Roark, B.

Shafran, I.

Sagae, K.

Lehr, M.

Prud'hommeaux, E.

Xu, P.

Glenn, N.

Khudanpur, S.

Saraclar, M.

Bikel, D.

Dredze, M.

Callison-Burch, C.

Cao, Y.

Hall, K.

Hasler, E.

Koehn, P.

Lopez, A.

Post, M.

Riley, D.

机构：

来源：

13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 | 2012年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The perceptron algorithm was used in [1] to estimate discriminative language models which correct errors in the output of ASR systems. In its simplest version, the algorithm simply increases the weight of n-gram features which appear in the correct (oracle) hypothesis and decreases the weight of n-gram features which appear in the 1-best hypothesis. In this paper, we show that the perceptron algorithm can be successfully used in a semi-supervised learning (SSL) framework, where limited amounts of labeled data are available. Our framework has some similarities to graph-based label propagation [2] in the sense that a graph is built based on proximity of unlabeled conversations, and then it is used to propagate confidences (in the form of features) to the labeled data, based on which perceptron trains a discriminative model. The novelty of our approach lies in the fact that the confidence "flows" from the unlabeled data to the labeled data, and not vice-versa, as is done traditionally in SSL. Experiments conducted at the 2011 CLSP Summer Workshop on the conversational telephone speech corpora Dev04f and Eva104f demonstrate the effectiveness of the proposed approach.

引用

页码：202 / 205

页数：4

共 50 条

[21] A Margin-based Discriminative Modeling Approach for Extractive Speech Summarization
Liu, Shih-Hung
Chen, Kuan-Yu
Chen, Berlin
Jan, Ea-Ee
Wang, Hsin-Min
Yen, Hsu-Chun
Hsu, Wen-Lian
2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
[22] Behavioral Signal Processing: Deriving Human Behavioral Informatics From Speech and Language
Narayanan, Shrikanth
Georgiou, Panayiotis G.
PROCEEDINGS OF THE IEEE, 2013, 101 (05) : 1203 - 1233
[23] Lexicon optimization based on discriminative learning for automatic speech recognition of agglutinative language
Ablimit, Mijit
Kawahara, Tatsuya
Hamdulla, Askar
SPEECH COMMUNICATION, 2014, 60 : 78 - 87
[24] MODIFICATIONS OF SPEECH LANGUAGE LEVELS BY PRESERVICE EDUCATORS BASED ON LANGUAGE FEATURES
SCHLOSS, PJ
SMITH, MA
BERNSTEIN, M
JOURNAL OF COMMUNICATION DISORDERS, 1990, 23 (02) : 89 - 96
[25] Discriminative features based on modified log magnitude spectrum for playback speech detection
Yang, Jichen
Xu, Longting
Ren, Bo
Ji, Yunyun
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2020, 2020 (01)
[26] Discriminative features based on modified log magnitude spectrum for playback speech detection
Jichen Yang
Longting Xu
Bo Ren
Yunyun Ji
EURASIP Journal on Audio, Speech, and Music Processing, 2020
[27] A speech recognition algorithm based on the features of Croatian language
Peic, R
PROCEEDINGS EC-VIP-MC 2003, VOLS 1 AND 2, 2003, : 613 - 618
[28] An Effective Contextual Language Modeling Framework for Speech Summarization with Augmented Features
Weng, Shi-Yan
Lo, Tien-Hong
Chen, Berlin
28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 316 - 320
[29] ON DERIVING LONGER FINGERPRINTS FROM FEATURES BASED ON PROJECTIONS
Radhakrishnan, Regunathan
Bauer, Claus
Jiang, Wenyu
2010 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2010), 2010, : 1359 - 1363
[30] Impact of web based language modeling on speech understanding
Sarikaya, R
Kuo, HKJ
Gao, YQ
2005 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2005, : 268 - 271

← 1 2 3 4 5 →