Neural-Network Lexical Translation for Cross-lingual IR from Text and Speech

被引：12

作者：

Zbib, Rabih ^{[1
]}

Zhao, Lingjun ^{[1
]}

Karakos, Damianos ^{[1
]}

Hartmann, William ^{[1
]}

DeYoung, Jay ^{[1
,2
]}

Huang, Zhongqiang ^{[1
,3
]}

Jiang, Zhuolin ^{[1
]}

Rivkin, Noah ^{[4
]}

Zhang, Le ^{[1
]}

Schwartz, Richard ^{[1
]}

Makhoul, John ^{[1
]}

机构：

[1] Raytheon BBN Technol, Cambridge, MA 02138 USA

[2] Northeastern Univ, Boston, MA 02115 USA

[3] Alibaba Technol, Hangzhou, Zhejiang, Peoples R China

[4] Franklin W Olin Coll Engn, Newton, MA USA

来源：

PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19) | 2019年

关键词：

Cross-lingual information retrieval; speech recognition; machine translation; probabilistic modeling; neural networks;

D O I：

10.1145/3331184.3331222

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We propose a neural network model to estimate word translation probabilities for Cross-Lingual Information Retrieval (CLIR). The model estimates better probabilities for word translations than automatic word alignments alone, and generalizes to unseen source-target word pairs. We further improve the lexical neural translation model (and subsequently CLIR), by incorporating source word context, and by encoding the character sequences of input source words to generate translations of out-of-vocabulary words. To be effective, neural network models typically need training on large amounts of data labeled directly on the final task, in this case relevance to queries. In contrast, our approach only requires parallel data to train the translation model, and uses an unsupervised model to compute CLIR relevance scores. We report results on the retrieval of text and speech documents from three morphologically complex languages with limited training data resources (Swahili, Tagalog, and Somali) and short English queries. Despite training on only about 2M words of parallel training data for each language, we obtain neural network translation models that are very effective for this task. We also obtain further improvements using (i) a modified relevance model, which uses the probability of occurrence of a translation of each query term in the source document, and (ii) confusion networks (instead of 1-best output) that encode multiple transcription alternatives in the output of an Automatic Speech Recognition (ASR) system. We achieve overall MAP relative improvements of up to 24% on Swahili, 50% on Tagalog, and 39% on Somali over the baseline probabilistic model, and larger improvements over monolingual retrieval from machine translation output.

引用

页码：645 / 654

页数：10

共 50 条

[1] Exploring Neural Translation Models for Cross-Lingual Text Similarity
Seki, Kazuhiro
[J]. CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 1591 - 1594
[2] Cross-lingual text similarity exploiting neural machine translation models
Seki, Kazuhiro
[J]. JOURNAL OF INFORMATION SCIENCE, 2021, 47 (03) : 404 - 418
[3] Cross-lingual Dialog Model for Speech to Speech Translation
Ettelaie, Emil
Georgiou, Panayiotis G.
Narayanan, Shrikanth
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1173 - 1176
[4] Cross-lingual Text Classification with Heterogeneous Graph Neural Network
Wang, Ziyun
Liu, Xuan
Yang, Peiji
Liu, Shixing
Wang, Zhisheng
[J]. ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 612 - 620
[5] Cross-Lingual Speech-to-Text Summarization
Pontes, Elvys Linhares
Gonzalez-Gallardo, Carlos-Emiliano
Torres-Moreno, Juan-Manuel
Huet, Stephane
[J]. MULTIMEDIA AND NETWORK INFORMATION SYSTEMS, 2019, 833 : 385 - 395
[6] Cross-Lingual Text Classification with Model Translation and Document Translation
Moh, Teng-Sheng
Zhang, Zhang
[J]. PROCEEDINGS OF THE 50TH ANNUAL ASSOCIATION FOR COMPUTING MACHINERY SOUTHEAST CONFERENCE, 2012,
[7] Text-To-Speech with cross-lingual Neural Network-based grapheme-to-phoneme models
Gonzalvo, Xavi
Podsiadlo, Monika
[J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 765 - 769
[8] Cross-Lingual Neural Network Speech Synthesis Based on Multiple Embeddings
Nosek, Tijana, V
Suzic, Sinisa B.
Pekar, Darko J.
Obradovic, Radovan J.
Secujski, Milan S.
Delic, Vlado D.
[J]. INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2021, 7 (02): : 110 - 120
[9] Cross-Lingual Korean Speech-to-Text Summarization
Yoon, HyoJeon
Dinh Tuyen Hoang
Ngoc Thanh Nguyen
Hwang, Dosam
[J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2019, PT I, 2019, 11431 : 198 - 206
[10] A Study of Neural Matching Models for Cross-lingual IR
Yu, Puxuan
Allan, James
[J]. PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1637 - 1640

← 1 2 3 4 5 →