Employing heterogeneous information in a multi-stream framework

被引:0
|
作者
Christensen, H [1 ]
Lindberg, B [1 ]
Andersen, O [1 ]
机构
[1] Univ Aalborg, Ctr PersonKommun, DK-9220 Aalborg, Denmark
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A multi-stream speech recogniser is based on the combination of multiple feature streams each containing complementary information. In the past, multi-stream research has typically focused on systems that use a single feature extraction method. This heritage from conventional speech recognisers is an unnecessary restriction and both psychoacoustic and phonetic knowledge strongly motivate the use of heterogeneous features. In this paper we investigate how heterogeneous processing can be used in two different multi-stream configurations: first, a system where each stream handles a different frequency region of the speech (a multi-band recogniser) and, second a multi-stream recogniser where each stream handles the full frequency region. For each type of system we compare the performance using both homogeneous and heterogeneous processing. We demonstrate that the use of heterogeneous information significantly improves the clean speech recognition performance motivating us to continue exploring more specifically designed stream processing.
引用
收藏
页码:1571 / 1574
页数:4
相关论文
共 50 条
  • [1] Research on Grey Modeling for Multi-stream Information
    Liu, Xin
    Dai, Jin
    Zhou, Weijie
    [J]. JOURNAL OF GREY SYSTEM, 2016, 28 (04): : 127 - 137
  • [2] Multi-stream ASR trained with heterogeneous reverberant environments
    Shire, ML
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 253 - 256
  • [3] Effective Multi-stream Joining in Apache Samza Framework
    Zhuang, Zhenyun
    Feng, Tao
    Pan, Yi
    Ramachandra, Haricharan
    Sridharan, Badri
    [J]. 2016 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2016, 2016, : 267 - 274
  • [4] Incorporating phonetic knowledge into a multi-stream HMM framework
    Norouzian, Atta
    Selouani, Sid-Ahmed
    O'Shaughnessy, Douglas
    [J]. 2008 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-4, 2008, : 1632 - +
  • [5] Optimal Traffic Allocation for Multi-Stream Aggregation in Heterogeneous Networks
    Avramova, Andrijana Popovska
    Iversen, Villy Baek
    [J]. 2015 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2015,
  • [6] An Optimization Framework for Scheduling and Resource Allocation in Multi-Stream Heterogeneous MIMO-OFDMA Wireless Networks
    Danobeitia, Borja
    Femenias, Guillem
    Riera-Palou, Felip
    [J]. 2012 IFIP WIRELESS DAYS (WD), 2012,
  • [7] Multi-stream inflation
    Li, Miao
    Wang, Yi
    [J]. JOURNAL OF COSMOLOGY AND ASTROPARTICLE PHYSICS, 2009, (07):
  • [8] Multi-stream MPA
    Bestler, Caitlin
    [J]. 2005 IEEE International Conference on Cluster Computing (CLUSTER), 2006, : 623 - 628
  • [9] Multi-stream Deep Learning Framework for Automated Presentation Assessment
    Li, Junnan
    Wong, Yongkang
    Kankanhalli, Mohan S.
    [J]. PROCEEDINGS OF 2016 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2016, : 222 - 225
  • [10] A Multi-Stream Sequence Learning Framework for Human Interaction Recognition
    Haroon, Umair
    Ullah, Amin
    Hussain, Tanveer
    Ullah, Waseem
    Sajjad, Muhammad
    Muhammad, Khan
    Lee, Mi Young
    Baik, Sung Wook
    [J]. IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2022, 52 (03) : 435 - 444