Segment-Level Pyramid Match Kernels For The Classification of Varying Length Patterns of Speech Using SVMs

被引:0
|
作者
Gupta, Shikha [1 ]
Dileep, A. D. [1 ]
Thenkanidiyoor, Veena [2 ]
机构
[1] Indian Inst Technol Mandi, Sch Comp & Elect Engn, Mandi, HP, India
[2] Natl Inst Technol Goa, Dept Comp Sci & Engn, Ponda, Goa, India
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Classification of long duration speech, represented as varying length sets of feature vectors using support vector machine (SVM) requires a suitable kernel. In this paper we propose a novel segment-level pyramid match kernel (SLPMK) for the classification of varying length patterns of long duration speech represented as sets of feature vectors. This kernel is designed by partitioning the speech signal into increasingly finer segments and matching the corresponding segments. We study the performance of the SVM-based classifiers using the proposed SLPMKs for speech emotion recognition and speaker identification and compare with that of the SVM-based classifiers using other dynamic kernels.
引用
收藏
页码:2030 / 2034
页数:5
相关论文
共 12 条
  • [1] Emotion Recognition from Varying Length Patterns of Speech using CNN-based Segment-Level Pyramid Match Kernel based SVMs
    Gupta, Shikha
    De, Kishalaya
    Dinesh, Dileep Aroor
    Thenkanidiyoor, Veena
    2019 25TH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2019,
  • [2] Segment-level probabilistic sequence kernel and segment-level pyramid match kernel based extreme learning machine for classification of varying length patterns of speech
    Shikha Gupta
    Ahmed Karanath
    Kansul Mahrifa
    A. D. Dileep
    Veena Thenkanidiyoor
    International Journal of Speech Technology, 2019, 22 : 231 - 249
  • [3] Segment-level probabilistic sequence kernel and segment-level pyramid match kernel based extreme learning machine for classification of varying length patterns of speech
    Gupta, Shikha
    Karanath, Ahmed
    Mahrifa, Kansul
    Dileep, A. D.
    Thenkanidiyoor, Veena
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (01) : 231 - 249
  • [4] Segment-Level Probabilistic Sequence Kernel Based Support Vector Machines for Classification of Varying Length Patterns of Speech
    Gupta, Shikha
    Thenkanidiyoor, Veena
    Dinesh, Dileep Aroor
    NEURAL INFORMATION PROCESSING, ICONIP 2016, PT IV, 2016, 9950 : 321 - 328
  • [5] HMM BASED PYRAMID MATCH KERNEL FOR CLASSIFICATION OF SEQUENTIAL PATTERNS OF SPEECH USING SUPPORT VECTOR MACHINES
    Dileep, A. D.
    Sekhar, C. Chandra
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 3562 - 3566
  • [6] A Family of Example-Specific Density based Matching Kernels for Classification of Varying Length Patterns of Speech using Support Vector Machines
    Sachdev, Abhijeet
    Dileep, A. D.
    Thenkanidiyoor, Veena
    2015 SECOND INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND MACHINE INTELLIGENCE (ISCMI), 2015, : 121 - 125
  • [7] Relation Classification Using Segment-Level Attention-based CNN and Dependency-based RNN
    Tran, Van-Hien
    Phi, Van-Thuy
    Shindo, Hiroyuki
    Matsumoto, Yuji
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 2793 - 2798
  • [8] Example-Specific Density Based Matching Kernel for Classification of Varying Length Patterns of Speech Using Support Vector Machines
    Sachdev, Abhijeet
    Dileep, A. D.
    Thenkanidiyoor, Veena
    NEURAL INFORMATION PROCESSING, PT I, 2015, 9489 : 177 - 184
  • [9] GMM-Based Intermediate Matching Kernel for Classification of Varying Length Patterns of Long Duration Speech Using Support Vector Machines
    Dileep, Aroor Dinesh
    Sekhar, Chellu Chandra
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (08) : 1421 - 1432
  • [10] Class-specific GMM based intermediate matching kernel for classification of varying length patterns of long duration speech using support vector machines
    Dileep, A. D.
    Sekhar, C. Chandra
    SPEECH COMMUNICATION, 2014, 57 : 126 - 143