Domain Specific Audio Indexing Using Linguistic Information

被引:0
|
作者
Pandey, L. [1 ]
Nathwani, K. [1 ]
Kaur, S. [1 ]
Husain, I. [1 ]
Pathak, R. [1 ]
Singh, G. [1 ]
Tiwari, S. [1 ]
Hegde, Rajesh M. [1 ]
机构
[1] Indian Inst Technol, Dept Elect Engn, Kanpur 208016, Uttar Pradesh, India
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper a novel methodology for indexing domain specific audio archives using linguistic information present in the speech signal is discussed. The audio indexing system is phone based and can work under limited training data conditions. A training data set that captures the linguistic information within Hindi language at the syllable level is first developed. A reduced phone set is then derived from the super syllabic set of the Hindi language. The system is then bootstrapped at the phone level with domain specific data. The audio indexing itself is then performed using a novel sliding phone protocol technique. The performance of such a audio indexing system is then evaluated for Indian parliament speech and read news. The proposed bootstrapping method with sliding phone search provides reasonable improvements in phone recognition accuracy and in terms of search retrieval efficiency when compared to conventional methods.
引用
下载
收藏
页码:364 / 369
页数:6
相关论文
共 50 条
  • [21] Speaker indexing in large audio databases using anchor models
    Sturim, DE
    Reynolds, DA
    Singer, E
    Campbell, JP
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 429 - 432
  • [22] Audio Clips Content Comparison Using Latent Semantic Indexing
    Biatov, Konstantin
    Koehler, Joachim
    Schneider, Daniel
    2009 IEEE THIRD INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2009), 2009, : 509 - 512
  • [23] Indexing audio documents by using latent semantic analysis and SOM
    Kurimo, M
    KOHONEN MAPS, 1999, : 363 - 374
  • [24] Information Retrieval using Dynamic Indexing
    Mohammed, Sura I.
    Omara, Fatma A.
    Sharaf, Hussien M.
    2014 9TH INTERNATIONAL CONFERENCE ON INFORMATICS AND SYSTEMS (INFOS), 2014,
  • [25] Compressed domain video indexing techniques using DCT and motion vector information in MPEG video
    Kobla, V
    Doermann, D
    Lin, KID
    Faloutsos, C
    STORAGE AND RETRIEVAL FOR IMAGE AND VIDEO DATABASES V, 1997, 3022 : 200 - 211
  • [26] Indexing and Retrieval of Audio: A Survey
    Goujun Lu
    Multimedia Tools and Applications, 2001, 15 : 269 - 290
  • [27] State of the art in audio indexing
    Carré, M
    Philippe, P
    ANNALS OF TELECOMMUNICATIONS, 2000, 55 (9-10) : 507 - 525
  • [28] Audio characterization for video indexing
    Patel, NV
    Sethi, IK
    STORAGE AND RETRIEVAL FOR STILL IMAGE AND VIDEO DATABASES IV, 1996, 2670 : 373 - 384
  • [29] Automated Subject Indexing of Domain Specific Collections Using Word Embeddings and General Purpose Thesauri
    Sfakakis, Michalis
    Papachristopoulos, Leonidas
    Zoutsou, Kyriaki
    Tsakonas, Giannis
    Papatheodorou, Christos
    METADATA AND SEMANTIC RESEARCH, MTSR 2019, 2019, 1057 : 103 - 114
  • [30] Indexing and retrieval of audio: A survey
    Lu, GJ
    MULTIMEDIA TOOLS AND APPLICATIONS, 2001, 15 (03) : 269 - 290