Domain Specific Audio Indexing Using Linguistic Information

被引:0
|
作者
Pandey, L. [1 ]
Nathwani, K. [1 ]
Kaur, S. [1 ]
Husain, I. [1 ]
Pathak, R. [1 ]
Singh, G. [1 ]
Tiwari, S. [1 ]
Hegde, Rajesh M. [1 ]
机构
[1] Indian Inst Technol, Dept Elect Engn, Kanpur 208016, Uttar Pradesh, India
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper a novel methodology for indexing domain specific audio archives using linguistic information present in the speech signal is discussed. The audio indexing system is phone based and can work under limited training data conditions. A training data set that captures the linguistic information within Hindi language at the syllable level is first developed. A reduced phone set is then derived from the super syllabic set of the Hindi language. The system is then bootstrapped at the phone level with domain specific data. The audio indexing itself is then performed using a novel sliding phone protocol technique. The performance of such a audio indexing system is then evaluated for Indian parliament speech and read news. The proposed bootstrapping method with sliding phone search provides reasonable improvements in phone recognition accuracy and in terms of search retrieval efficiency when compared to conventional methods.
引用
收藏
页码:364 / 369
页数:6
相关论文
共 50 条
  • [31] State of the art in audio indexing
    Carre, Matthieu
    Philippe, Pierrick
    Annales des Telecommunications/Annals of Telecommunications, 2000, 55 (9-10): : 507 - 525
  • [32] Speech processing for audio indexing
    Lamel, Lori
    Gauvain, Jean-Luc
    ADVANCES IN NATURAL LANGUAGE PROCESSING, PROCEEDINGS, 2008, 5221 : 4 - 15
  • [33] Access to bilingual information using specific ontologies of the biomedical domain
    Carrero Garcia, Francisco
    Gomez Hidalgo, Jose Maria
    de Buenaga Rodriguez, Manuel
    Mata, Jacinto
    Mana Lopez, Manuel
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2007, (38): : 107 - 117
  • [34] USING DOMAIN SPECIFIC LANGUAGES IN THE BUILDING INFORMATION MODELLING WORKFLOW
    Fernando, Ruwan
    Steel, James
    Drogemuller, Robin
    PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON COMPUTER-AIDED ARCHITECTURAL DESIGN RESEARCH IN ASIA (CAADRIA 2011): CIRCUIT BENDING, BREAKING AND MENDING, 2011, : 731 - 740
  • [35] Using Wikipedia and Wiktionary in Domain-Specific Information Retrieval
    Mueller, Christof
    Gurevych, Iryna
    EVALUATING SYSTEMS FOR MULTILINGUAL AND MULTIMODAL INFORMATION ACCESS, 2009, 5706 : 219 - 226
  • [36] AWtoolbox: Characterizing Audio Information Using Audio Words
    Yeh, Chin-Chia Michael
    Jao, Ping-Keng
    Yang, Yi-Hsuan
    PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 809 - 812
  • [37] A simplified Latent Semantic Indexing approach for multi-linguistic information retrieval
    Liu, Y
    Lu, HM
    Lu, ZX
    Wang, P
    PACLIC 17: LANGUAGE, INFORMATION AND COMPUTATION, PROCEEDINGS, 2003, : 69 - 79
  • [38] Use of Audio Stories as Linguistic Treatment in Preschoolers with Specific Language Impairment
    Niebuhr-Siebert, S.
    Ritterfeld, U.
    SPRACHE-STIMME-GEHOR, 2012, 36 (01): : 18 - 24
  • [39] Domain-specific Noisy Query Correction using Linguistic Network Community Detection
    Patil, Sangameshwar
    WWW'20: COMPANION PROCEEDINGS OF THE WEB CONFERENCE 2020, 2020, : 126 - 127
  • [40] Nonexclusive Audio Segmentation and Indexing as a Pre-processor for Audio Information Mining A universal architecture and feature space selection
    Li, Francis F.
    2013 6TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), VOLS 1-3, 2013, : 1593 - 1597