SI-NET: MULTI-SCALE CONTEXT-AWARE CONVOLUTIONAL BLOCK FOR SPEAKER VERIFICATION

被引:6
|
作者
Li, Zhuo [1 ,2 ]
Fang, Ce [1 ,2 ]
Xiao, Runqiu [1 ,2 ]
Wang, Wenchao [1 ,2 ]
Yan, Yonghong [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Inst Acoust, Key Lab Speech Acoust & Content Understanding, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Xinjiang Key Lab Minor Speech & Language Informat, Urumqi, Peoples R China
关键词
speaker verification; Split-Integration; multi-scale features; dynamic integration; at a granular level;
D O I
10.1109/ASRU51503.2021.9688119
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Utilizing multi-scale information adequately is essential for building a high-performance speaker verification (SV) system. Biological research shows that the human auditory system employs a multi-timescale processing mode to extract information and has a mechanism of integrating multi-scale information to encode sound information. Inspired by this, we propose a novel block, named Split-Integration (SI), to explore multi-scale context-aware feature learning at a granular level for speaker verification. Our model involves a pair of operations, (i) multi-scale split, which is designed to imitate the multi-timescale processing mode, extracting multi-scale features by grouping and stacking different sizes of filters, and (ii) dynamic integration, which aims at reflecting analogy with the fusion mechanism, introducing KL divergence to measure the complementarily between multi-scale features such that the model fully integrates multi-scale features and produces better speaker-discriminative representation. Experiments are conducted on Voxceleb and Speakers in the Wild(SITW) datasets. Results demonstrate that our approach achieves a relative 10%-20% improvement on equal error rate (EER) over a strong baseline in the SV task.
引用
收藏
页码:220 / 227
页数:8
相关论文
共 50 条
  • [21] Multi-Scale Structure Perception and Global Context-Aware Method for Small-Scale Pedestrian Detection
    Gao, Hao
    Huang, Shucheng
    Li, Mingxing
    Li, Tian
    [J]. IEEE ACCESS, 2024, 12 : 76392 - 76403
  • [22] Multi-Branch Convolutional Network for Context-Aware Recommendation
    Guo, Wei
    Zhang, Can
    Guo, Huifeng
    Tang, Ruiming
    He, Xiuqiang
    [J]. PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1709 - 1712
  • [23] FREQUENCY AND MULTI-SCALE SELECTIVE KERNEL ATTENTION FOR SPEAKER VERIFICATION
    Mun, Sung Hwan
    Jung, Jee-Weon
    Han, Min Hyun
    Kim, Nam Soo
    [J]. 2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 548 - 554
  • [24] Global context-aware feature modulation networks for unified multi-scale super-resolution
    Zhang, Dacheng
    Lei, Weimin
    Zhang, Wei
    Chen, Xinyi
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (03)
  • [25] Fusing multi-scale context-aware information representation for automatic in-field pest detection and recognition
    Wang, Fangyuan
    Wang, Rujing
    Xie, Chengjun
    Yang, Po
    Liu, Liu
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2020, 169 (169)
  • [26] Ultra-high resolution, multi-scale, context-aware approach for detection of small cancers on mammography
    Rangarajan, Krithika
    Gupta, Aman
    Dasgupta, Saptarshi
    Marri, Uday
    Gupta, Arun Kumar
    Hari, Smriti
    Banerjee, Subhashis
    Arora, Chetan
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [27] Multi-scale attention context-aware network for detection and localization of image splicingEfficient and robust identification network
    Ruyong Ren
    Shaozhang Niu
    Junfeng Jin
    Jiwei Zhang
    Hua Ren
    Xiaojie Zhao
    [J]. Applied Intelligence, 2023, 53 : 18219 - 18238
  • [28] Ultra-high resolution, multi-scale, context-aware approach for detection of small cancers on mammography
    Krithika Rangarajan
    Aman Gupta
    Saptarshi Dasgupta
    Uday Marri
    Arun Kumar Gupta
    Smriti Hari
    Subhashis Banerjee
    Chetan Arora
    [J]. Scientific Reports, 12
  • [29] Scale- and Context-Aware Convolutional Non-Intrusive Load Monitoring
    Chen, Kunjin
    Zhang, Yu
    Wang, Qin
    Hu, Jun
    Fan, Hang
    He, Jinliang
    [J]. IEEE TRANSACTIONS ON POWER SYSTEMS, 2020, 35 (03) : 2362 - 2373
  • [30] MSCAReg-Net: Multi-scale complexity-aware convolutional neural network for deformable image registration
    Yu, Hu
    Zheng, Qiang
    Hu, Fang
    Ma, Chaoqing
    Wang, Shuo
    Wang, Shuai
    [J]. IET IMAGE PROCESSING, 2024, 18 (04) : 839 - 855