Deriving conversation-based features from unlabeled speech for discriminative language modeling

被引：0

作者：

Karakos, D.

Roark, B.

Shafran, I.

Sagae, K.

Lehr, M.

Prud'hommeaux, E.

Xu, P.

Glenn, N.

Khudanpur, S.

Saraclar, M.

Bikel, D.

Dredze, M.

Callison-Burch, C.

Cao, Y.

Hall, K.

Hasler, E.

Koehn, P.

Lopez, A.

Post, M.

Riley, D.

机构：

来源：

13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 | 2012年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The perceptron algorithm was used in [1] to estimate discriminative language models which correct errors in the output of ASR systems. In its simplest version, the algorithm simply increases the weight of n-gram features which appear in the correct (oracle) hypothesis and decreases the weight of n-gram features which appear in the 1-best hypothesis. In this paper, we show that the perceptron algorithm can be successfully used in a semi-supervised learning (SSL) framework, where limited amounts of labeled data are available. Our framework has some similarities to graph-based label propagation [2] in the sense that a graph is built based on proximity of unlabeled conversations, and then it is used to propagate confidences (in the form of features) to the labeled data, based on which perceptron trains a discriminative model. The novelty of our approach lies in the fact that the confidence "flows" from the unlabeled data to the labeled data, and not vice-versa, as is done traditionally in SSL. Experiments conducted at the 2011 CLSP Summer Workshop on the conversational telephone speech corpora Dev04f and Eva104f demonstrate the effectiveness of the proposed approach.

引用

页码：202 / 205

页数：4

共 50 条

[31] Speech Emotion Recognition Based on Transfer Emotion-Discriminative Features Subspace Learning
Zhang, Kexin
Liu, Yunxiang
IEEE ACCESS, 2023, 11 : 56336 - 56343
[32] Speech recognition system in high noise background based on discriminative learning of environmental features
Lu, Cheng-Guo
Han, Ji-Qing
Wang, Cheng-Fa
Zhang, Lei
Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2003, 35 (02): : 134 - 137
[33] Improving Minority Language Speech Recognition Based on Distinctive Features
Fu, Tong
Gao, Shaojun
Wu, Xihong
INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING, 2018, 11266 : 411 - 420
[34] From communication disorders research to conversation-based interventions for adults with aphasia: an online resource for clinicians and people with aphasia
Beckley, F.
Sirman, N.
Little, L.
Mahon, M.
Maxim, J.
Edwards, S.
Best, W.
Johnson, F.
Newton, C.
Beeke, S.
INTERNATIONAL JOURNAL OF STROKE, 2012, 7 : 26 - 26
[35] LEARNING DISCRIMINATIVE FEATURES FROM SPECTROGRAMS USING CENTER LOSS FOR SPEECH EMOTION RECOGNITION
Dai, Dongyang
Wu, Zhiyong
Li, Runnan
Wu, Xixin
Jia, Jia
Meng, Helen
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7405 - 7409
[36] Features extraction, modeling and training strategies in continuous speech recognition for Romanian language
Dumitru, CO
Gavat, I
Eurocon 2005: The International Conference on Computer as a Tool, Vol 1 and 2 , Proceedings, 2005, : 1425 - 1428
[37] Designing and evaluating interaction as conversation: A modeling language based on semiotic engineering
Barbosa, SDJ
de Paula, MG
INTERACTIVE SYSTEMS: DESIGN, SPECIFICATION, AND VERIFICATION, 2003, 2844 : 16 - 33
[38] Language Identification From Speech Features Using SVM and LDA
Anjana, J. S.
Poorna, S. S.
2018 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2018,
[39] MT-BASED ARTIFICIAL HYPOTHESIS GENERATION FOR UNSUPERVISED DISCRIMINATIVE LANGUAGE MODELING
Dikici, Erinc
Saraclar, Murat
2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 1401 - 1405
[40] Research on Language Evolution and Language Diversity Based on Chinese Speech Pitch Deviation Features
Lang J.
Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)

← 1 2 3 4 5 →