Syllable-based Chinese text/spoken document retrieval using text/speech queries

被引:7
|
作者
Bai, BR
Chen, BL
Wang, HM [1 ]
机构
[1] Acad Sinica, Inst Informat Sci, Taipei, Taiwan
[2] Natl Taiwan Univ, Dept Comp Sci & Informat Engn, Taipei 10764, Taiwan
[3] Natl Taiwan Univ, Dept Elect Engn, Taipei 10764, Taiwan
关键词
information retrieval; text document retrieval; spoken document retrieval; speech query; multi-modality; syllable lattice; speech recognition; Mandarin Chinese;
D O I
10.1142/S0218001400000398
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In light of the rapid growth of Chinese information resources on the Internet, this study investigates a novel approach that deals with the problem of Chinese text and spoken document retrieval using both text and speech queries. By properly utilizing the monosyllabic structure of the Chinese language, the proposed approach estimates the statistical similarity between the text/speech queries and the text/spoken documents at the phonetic level using the syllable-based statistical information. The investigation successfully implemented a prototype system with an interface supporting some user-friendly functions and the initial test results demonstrate the feasibility of the proposed approach.
引用
收藏
页码:603 / 616
页数:14
相关论文
共 50 条
  • [1] Development of syllable-based text to speech synthesis system in Bengali
    Narendra, N.
    Rao, K.
    Ghosh, Krishnendu
    Vempada, Ramu
    Maity, Sudhamay
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2011, 14 (03) : 167 - 181
  • [2] Development of Concatenative Syllable-Based Text to Speech Synthesis System for Tamil
    Sudhakar, B.
    Bensraj, R.
    [J]. ARTIFICIAL INTELLIGENCE AND EVOLUTIONARY ALGORITHMS IN ENGINEERING SYSTEMS, VOL 1, 2015, 324 : 585 - 592
  • [3] Video Retrieval Using Textual Queries and Spoken Text
    Sukhadeo, Bere Sachin
    Subhash, Rajpure Amol
    [J]. TECHNO-SOCIETAL 2018: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SOCIETAL APPLICATIONS - VOL 1, 2020, : 67 - 75
  • [4] Improved Syllable-Based Text to Speech Synthesis for Tone Language Systems
    Ekpenyong, Moses
    Udoh, EmemObong
    Udosen, Escor
    Urua, Eno-Abasi
    [J]. HUMAN LANGUAGE TECHNOLOGY CHALLENGES FOR COMPUTER SCIENCE AND LINGUISTICS, 2014, 8387 : 3 - 15
  • [5] Syllable-based relevance feedback techniques for Mandarin voice record retrieval using speech queries
    Lee, LS
    Bai, BR
    Chien, LF
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1459 - 1462
  • [6] Spoken document retrieval using both word-based and syllable-based document spaces with latent semantic indexing
    Ichikawa, Ken
    Tsuge, Satoru
    Kitaoka, Norihide
    Takeda, Kazuya
    Kita, Kenji
    [J]. 2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
  • [7] Experiments in syllable-based retrieval of broadcast news speech in Mandarin Chinese
    Wang, HM
    [J]. SPEECH COMMUNICATION, 2000, 32 (1-2) : 49 - 60
  • [8] Genetic Algorithms in Syllable-Based Text Compression
    Kuthan, Tomas
    Lansky, Jan
    [J]. DATESO 2007 - DATABASES, TEXTS, SPECIFICATIONS, OBJECTS: PROCEEDINGS OF THE 7TH ANNUAL INTERNATIONAL WORKSHOP, 2007, 235 : 21 - 34
  • [9] A Syllable-Based Technique for Uyghur Text Compression
    Abliz, Wayit
    Wu, Hao
    Maimaiti, Maihemuti
    Wushouer, Jiamila
    Abiderexiti, Kahaerjiang
    Yibulayin, Tuergen
    Wumaier, Aishan
    [J]. INFORMATION, 2020, 11 (03)
  • [10] A Novel Text-to-Speech Synthesis System Using Syllable-Based HMM for Tamil Language
    Manoharan, J. Samuel
    [J]. PROCEEDINGS OF SECOND INTERNATIONAL CONFERENCE ON SUSTAINABLE EXPERT SYSTEMS (ICSES 2021), 2022, 351 : 305 - 314