I-VECTOR BASED LANGUAGE MODELING FOR SPOKEN DOCUMENT RETRIEVAL

被引：0

作者：

Chen, Kuan-Yu ^{[1
]}

Lee, Hung-Shin ^{[1
]}

Wang, Hsin-Min ^{[1
]}

Chen, Berlin

Chen, Hsin-Hsi

机构：

[1] Acad Sinica, Inst Informat Sci, Taipei, Taiwan

来源：

2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年

关键词：

Spoken document retrieval; i-vector; language modeling; inductive; transductive; SPEAKER; MATRIX;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Since more and more multimedia data associated with spoken documents have been made available to the public, spoken document retrieval (SDR) has become an important research subject in the past two decades. The i-vector based framework has been proposed and introduced to language identification (LID) and speaker recognition (SR) tasks recently. The major contribution of the i-vector framework is to reduce a series of acoustic feature vectors of a speech utterance to a low-dimensional vector representation, and then numbers of well-developed post-processing techniques (such as probabilistic linear discriminative analysis, PLDA) can be readily and effectively used. However, to our best knowledge, there is no research up to date on applying the i-vector framework for SDR or information retrieval (IR). In this paper, we make a step forward to formulate an i-vector based language modeling (IVLM) framework for SDR. Furthermore, we evaluate the proposed IVLM framework with both inductive and transductive learning strategies. We also exploit multi-levels of index features, including word-and subword-level units, in concert with the proposed framework. The results of SDR experiments conducted on the TDT-2 (Topic Detection and Tracking) collection demonstrate the performance merits of our proposed framework when compared to several existing approaches.

引用

页数：5

共 50 条

[1] I-VECTOR BASED LANGUAGE MODELING FOR QUERY REPRESENTATION
Chen, Kuan-Yu
Wang, Hsin-Min
Chen, Berlin
Chen, Hsin-His
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5211 - 5215
[2] ESSENCE VECTOR-BASED QUERY MODELING FOR SPOKEN DOCUMENT RETRIEVAL
Chen, Kuan-Yu
Liu, Shih-Hung
Chen, Berlin
Wang, Hsin-Min
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6274 - 6278
[3] A NEURAL DOCUMENT LANGUAGE MODELING FRAMEWORK FOR SPOKEN DOCUMENT RETRIEVAL
Yen, Li-Phen
Wu, Zhen-Yu
Chen, Kuan-Yu
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8139 - 8143
[4] Supervised I-vector modeling for language and accent recognition
Ramoji, Shreyas
Ganapathy, Sriram
[J]. COMPUTER SPEECH AND LANGUAGE, 2020, 60
[5] Regularization of neural network model with distance metric learning for i-vector based spoken language identification
Lu, Xugang
Shen, Peng
Tsao, Yu
Kawai, Hisashi
[J]. COMPUTER SPEECH AND LANGUAGE, 2017, 44 : 48 - 60
[6] Exploring the Use of Significant Words Language Modeling for Spoken Document Retrieval
Chen, Ying-Wen
Chen, Kuan-Yu
Wang, Hsin-Min
Chen, Berlin
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2889 - 2893
[7] A LOCALITY-PRESERVING ESSENCE VECTOR MODELING FRAMEWORK FOR SPOKEN DOCUMENT RETRIEVAL
Chen, Kuan-Yu
Liu, Shih-Hung
Chen, Berlin
Wang, Hsin-Min
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5665 - 5669
[8] I-vector features and deep neural network modeling for language recognition
Wang, Wei
Song, Wenjie
Chen, Chen
Zhang, Zhaoxin
Xin, Yi
[J]. 2018 INTERNATIONAL CONFERENCE ON IDENTIFICATION, INFORMATION AND KNOWLEDGE IN THE INTERNET OF THINGS, 2019, 147 : 36 - 43
[9] i-vector representation based on bottleneck features for language identification
Song, Yan
Jiang, Bing
Bao, YeBo
Wei, Si
Dai, Li-Rong
[J]. ELECTRONICS LETTERS, 2013, 49 (24) : 1569 - +
[10] Scalable I-vector Concatenation for PLDA based Language Identification System
Irtza, Saad
Bavattichalil, Haris
Sethu, Vidhyasaharan
Ambikairajah, Eliathamby
[J]. 2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 1182 - 1185

← 1 2 3 4 5 →