Domain Specific Audio Indexing Using Linguistic Information

被引：0

作者：

Pandey, L. ^{[1
]}

Nathwani, K. ^{[1
]}

Kaur, S. ^{[1
]}

Husain, I. ^{[1
]}

Pathak, R. ^{[1
]}

Singh, G. ^{[1
]}

Tiwari, S. ^{[1
]}

Hegde, Rajesh M. ^{[1
]}

机构：

[1] Indian Inst Technol, Dept Elect Engn, Kanpur 208016, Uttar Pradesh, India

来源：

2014 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT) | 2014年

关键词：

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper a novel methodology for indexing domain specific audio archives using linguistic information present in the speech signal is discussed. The audio indexing system is phone based and can work under limited training data conditions. A training data set that captures the linguistic information within Hindi language at the syllable level is first developed. A reduced phone set is then derived from the super syllabic set of the Hindi language. The system is then bootstrapped at the phone level with domain specific data. The audio indexing itself is then performed using a novel sliding phone protocol technique. The performance of such a audio indexing system is then evaluated for Indian parliament speech and read news. The proposed bootstrapping method with sliding phone search provides reasonable improvements in phone recognition accuracy and in terms of search retrieval efficiency when compared to conventional methods.

引用

页码：364 / 369

页数：6

共 50 条

[31] State of the art in audio indexing
Carre, Matthieu
Philippe, Pierrick
Annales des Telecommunications/Annals of Telecommunications, 2000, 55 (9-10): : 507 - 525
[32] Speech processing for audio indexing
Lamel, Lori
Gauvain, Jean-Luc
ADVANCES IN NATURAL LANGUAGE PROCESSING, PROCEEDINGS, 2008, 5221 : 4 - 15
[33] Access to bilingual information using specific ontologies of the biomedical domain
Carrero Garcia, Francisco
Gomez Hidalgo, Jose Maria
de Buenaga Rodriguez, Manuel
Mata, Jacinto
Mana Lopez, Manuel
PROCESAMIENTO DEL LENGUAJE NATURAL, 2007, (38): : 107 - 117
[34] USING DOMAIN SPECIFIC LANGUAGES IN THE BUILDING INFORMATION MODELLING WORKFLOW
Fernando, Ruwan
Steel, James
Drogemuller, Robin
PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON COMPUTER-AIDED ARCHITECTURAL DESIGN RESEARCH IN ASIA (CAADRIA 2011): CIRCUIT BENDING, BREAKING AND MENDING, 2011, : 731 - 740
[35] Using Wikipedia and Wiktionary in Domain-Specific Information Retrieval
Mueller, Christof
Gurevych, Iryna
EVALUATING SYSTEMS FOR MULTILINGUAL AND MULTIMODAL INFORMATION ACCESS, 2009, 5706 : 219 - 226
[36] AWtoolbox: Characterizing Audio Information Using Audio Words
Yeh, Chin-Chia Michael
Jao, Ping-Keng
Yang, Yi-Hsuan
PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 809 - 812
[37] A simplified Latent Semantic Indexing approach for multi-linguistic information retrieval
Liu, Y
Lu, HM
Lu, ZX
Wang, P
PACLIC 17: LANGUAGE, INFORMATION AND COMPUTATION, PROCEEDINGS, 2003, : 69 - 79
[38] Use of Audio Stories as Linguistic Treatment in Preschoolers with Specific Language Impairment
Niebuhr-Siebert, S.
Ritterfeld, U.
SPRACHE-STIMME-GEHOR, 2012, 36 (01): : 18 - 24
[39] Domain-specific Noisy Query Correction using Linguistic Network Community Detection
Patil, Sangameshwar
WWW'20: COMPANION PROCEEDINGS OF THE WEB CONFERENCE 2020, 2020, : 126 - 127
[40] Nonexclusive Audio Segmentation and Indexing as a Pre-processor for Audio Information Mining A universal architecture and feature space selection
Li, Francis F.
2013 6TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), VOLS 1-3, 2013, : 1593 - 1597

← 1 2 3 4 5 →