MLP emulation of N-gram models as a first step to connectionist language modeling

被引：0

作者：

Castro, MJ ^{[1
]}

Prat, F ^{[1
]}

Casacuberta, F ^{[1
]}

机构：

[1] Univ Politecn Valencia, Dept Sistemes Informat & Computacio, Valencia, Spain

来源：

NINTH INTERNATIONAL CONFERENCE ON ARTIFICIAL NEURAL NETWORKS (ICANN99), VOLS 1 AND 2 | 1999年 / 470期

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In problems such as automatic speech recognition and machine translation, where the system response must be a sentence in a given language, language models are employed in order to improve system performance. These language models are usually N-gram models (for instance, bigram or trigram models) which are estimated from large text databases using the occurrence frequencies of these N-grams. In 1989, Nakamura and Shikano empirically showed how multilayer perceptrons can emulate trigram model predictive capabilities with additional generalization features. Our paper discusses Nakamura and Shikano's work, provides new empirical evidence on multilayer perceptron capability to emulate N-gram models, and proposes new directions for extending neural network-based language models. The experimental work we present here compares connectionist phonological bigram models with a conventional one using different measures, which include recognition performances in a Spanish acoustic-phonetic decoding task.

引用

页码：910 / 915

页数：6

共 50 条

[1] Modeling actions of PubMed users with n-gram language models
Lin, Jimmy
Wilbur, W. John
INFORMATION RETRIEVAL, 2009, 12 (04): : 487 - 503
[2] Modeling actions of PubMed users with n-gram language models
Jimmy Lin
W. John Wilbur
Information Retrieval, 2009, 12 : 487 - 503
[3] On compressing n-gram language models
Hirsimaki, Teemu
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 949 - 952
[4] Discriminative n-gram language modeling
Roark, Brian
Saraclar, Murat
Collins, Michael
COMPUTER SPEECH AND LANGUAGE, 2007, 21 (02): : 373 - 392
[5] MIXTURE OF MIXTURE N-GRAM LANGUAGE MODELS
Sak, Hasim
Allauzen, Cyril
Nakajima, Kaisuke
Beaufays, Francoise
2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 31 - 36
[6] Perplexity of n-Gram and Dependency Language Models
Popel, Martin
Marecek, David
TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 173 - 180
[7] Discriminative N-gram Language Modeling for Turkish
Arisoy, Ebru
Roark, Brian
Shafran, Izhak
Saraclar, Murat
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 825 - +
[8] Bayesian learning of n-gram statistical language modeling
Bai, Shuanhu
Li, Haizhou
2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 1045 - 1048
[9] Profile based compression of n-gram language models
Olsen, Jesper
Oria, Daniela
2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 1041 - 1044
[10] Improved N-gram Phonotactic Models For Language Recognition
BenZeghiba, Mohamed Faouzi
Gauvain, Jean-Luc
Lamel, Lori
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2718 - 2721

← 1 2 3 4 5 →