SI-NET: MULTI-SCALE CONTEXT-AWARE CONVOLUTIONAL BLOCK FOR SPEAKER VERIFICATION

被引:6
|
作者
Li, Zhuo [1 ,2 ]
Fang, Ce [1 ,2 ]
Xiao, Runqiu [1 ,2 ]
Wang, Wenchao [1 ,2 ]
Yan, Yonghong [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Inst Acoust, Key Lab Speech Acoust & Content Understanding, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Xinjiang Key Lab Minor Speech & Language Informat, Urumqi, Peoples R China
关键词
speaker verification; Split-Integration; multi-scale features; dynamic integration; at a granular level;
D O I
10.1109/ASRU51503.2021.9688119
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Utilizing multi-scale information adequately is essential for building a high-performance speaker verification (SV) system. Biological research shows that the human auditory system employs a multi-timescale processing mode to extract information and has a mechanism of integrating multi-scale information to encode sound information. Inspired by this, we propose a novel block, named Split-Integration (SI), to explore multi-scale context-aware feature learning at a granular level for speaker verification. Our model involves a pair of operations, (i) multi-scale split, which is designed to imitate the multi-timescale processing mode, extracting multi-scale features by grouping and stacking different sizes of filters, and (ii) dynamic integration, which aims at reflecting analogy with the fusion mechanism, introducing KL divergence to measure the complementarily between multi-scale features such that the model fully integrates multi-scale features and produces better speaker-discriminative representation. Experiments are conducted on Voxceleb and Speakers in the Wild(SITW) datasets. Results demonstrate that our approach achieves a relative 10%-20% improvement on equal error rate (EER) over a strong baseline in the SV task.
引用
收藏
页码:220 / 227
页数:8
相关论文
共 50 条
  • [31] Multi-scale attention context-aware network for detection and localization of image splicing Efficient and robust identification network
    Ren, Ruyong
    Niu, Shaozhang
    Jin, Junfeng
    Zhang, Jiwei
    Ren, Hua
    Zhao, Xiaojie
    [J]. APPLIED INTELLIGENCE, 2023, 53 (15) : 18219 - 18238
  • [32] Multi-Scale and Context-Aware Framework for Flood Segmentation in Post-Disaster High Resolution Aerial Images
    Khan, Sultan Daud
    Basalamah, Saleh
    [J]. REMOTE SENSING, 2023, 15 (08)
  • [33] Progressive Context-Aware Aggregation Network Combining Multi-Scale and Multi-Level Dense Reconstruction for Building Change Detection
    Xu, Chuan
    Ye, Zhaoyi
    Mei, Liye
    Yang, Wei
    Hou, Yingying
    Shen, Sen
    Ouyang, Wei
    Ye, Zhiwei
    [J]. REMOTE SENSING, 2023, 15 (08)
  • [34] MAGE: Multi-scale Context-aware Interaction based on Multi-granularity Embedding for Chinese Medical Question Answer Matching
    Wang, Meiling
    He, Xiaohai
    Liu, Yan
    Qing, Linbo
    Zhang, Zhao
    Chen, Honggang
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 228
  • [35] MCRD-Net: An unsupervised dense network with multi-scale convolutional block attention for multi-focus image fusion
    Zhou, Ding
    Jin, Xin
    Jiang, Qian
    Cai, Li
    Lee, Shin-jye
    Yao, Shaowen
    [J]. IET IMAGE PROCESSING, 2022, 16 (06) : 1558 - 1574
  • [36] Knowledge Graph Completion by Context-Aware Convolutional Learning with Multi-Hop Neighborhoods
    Oh, Byungkook
    Seo, Seungmin
    Lee, Kyong-Ho
    [J]. CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 257 - 266
  • [37] MULTI-SCALE CONTEXT-AWARE R-CNN FOR FEW-SHOT OBJECT DETECTION IN REMOTE SENSING IMAGES
    Su, Haozheng
    You, Yanan
    Meng, Gang
    [J]. 2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 1908 - 1911
  • [38] A multi-scale context-aware and batch-independent lightweight network for green tide extraction from SAR images
    Xu, Mingming
    Zhu, Xiaofang
    Liu, Yanfen
    Liu, Shanwei
    Sheng, Hui
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2024, 45 (13) : 4474 - 4499
  • [39] Occluded prohibited object detection in X-ray images with global Context-aware Multi-Scale feature Aggregation
    Ma, Chunjie
    Zhuo, Li
    Li, Jiafeng
    Zhang, Yutong
    Zhang, Jing
    [J]. NEUROCOMPUTING, 2023, 519 : 1 - 16
  • [40] Multi-scale deep context convolutional neural networks for semantic segmentation
    Zhou, Quan
    Yang, Wenbing
    Gao, Guangwei
    Ou, Weihua
    Lu, Huimin
    Chen, Jie
    Latecki, Longin Jan
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2019, 22 (02): : 555 - 570