A high-performance speech BioHashing retrieval algorithm based on audio segmentation

被引:1
|
作者
Huang, Yi-Bo [1 ]
Chen, De-Huai [1 ]
Hua, Bo-Run [1 ]
Zhang, Qiu-Yu [2 ]
机构
[1] Northwest Normal Univ, Coll Phys & Elect Engn, Lanzhou, Peoples R China
[2] Lanzhou Univ Technol, Sch Comp & Commun, Lanzhou, Peoples R China
来源
COMPUTER SPEECH AND LANGUAGE | 2023年 / 83卷
基金
中国国家自然科学基金;
关键词
High-performance speech retrieval; Biometric template; BioHashing; Audio segmentation; Hash reconstruction; QUANTIZATION;
D O I
10.1016/j.csl.2023.101551
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As one of the research hotspots in the field of speech recognition, content-based speech retrieval algorithms can detect speech information with the same content features, which improves computer intelligence while reducing labor costs, and thus have been widely used. Although most of the current speech content retrieval algorithms can guarantee excellent retrieval performance for small-scale speech retrieval work, the performance of the above algorithms is greatly reduced under the constraints of large speech data storage space and high content redundancy. In order to solve the above problems, a high-performance speech BioHashing retrieval algorithm based on audio segmentation is proposed in this paper. The algorithm is divided into an offline preprocessing phase and an online retrieval phase, The offline pre-processing stage converts the speech data into BioHashing sequences with speech content characteristics. In this process, first of all, the Power-Normalized Cepstral Coefficients (PNCC) features of the speech data are extracted and biometric templates with single mapping keys are constructed according to the PNCC features, obtaining BioHashing sequences. Then, slice the original speeches into short-time audio segments according to the proposed audio segmentation algorithm, and the hash reconstruction operation is performed on the BioHashing sequences to obtain the reconstructed Hashing sequences for online retrieval. The online search phase responds to the users' query requests, just find the hash index that matches the query hash sequence from the BioHashing index table, and will the standardized editing distance (SED) to the closest 1 value corresponding to the hash index as the retrieval result back to the user. The experimental results show that the reconstructed hash sequences obtained after removing the silent redundant segments have better robustness and discrimination. Moreover, the algorithm achieves 100% retrieval accuracy for the original speech clips, and the average retrieval time is only 0.0157 s, which shows that the algorithm has good retrieval performance and can meet the needs of speech retrieval in various environments.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] A high security BioHashing encrypted speech retrieval algorithm based on feature fusion
    Huang, Yi-bo
    Li, Hao
    Wang, Yong
    Xie, Yi-rong
    Zhang, Qiu-yu
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (25) : 33615 - 33640
  • [2] A high security BioHashing encrypted speech retrieval algorithm based on feature fusion
    Yi-bo Huang
    Hao Li
    Yong Wang
    Yi-rong Xie
    Qiu-yu Zhang
    Multimedia Tools and Applications, 2021, 80 : 33615 - 33640
  • [3] Verifiable speech retrieval algorithm based on diversity security template and biohashing
    Yuan Zhang
    Yi-bo Huang
    De-huai Chen
    Qiu-yu Zhang
    Multimedia Tools and Applications, 2023, 82 : 36973 - 37002
  • [4] Verifiable speech retrieval algorithm based on diversity security template and biohashing
    Zhang, Yuan
    Huang, Yi-bo
    Chen, De-huai
    Zhang, Qiu-yu
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (24) : 36973 - 37002
  • [5] BioHashing Speech Security Retrieval Algorithm Based on MSCC and Improved Hadamard Measurement Matrix
    Huang, Yi-Bo
    Zhang, Yuan
    Zhang, Qiu-Yu
    International Journal of Network Security, 2022, 24 (02): : 377 - 387
  • [6] Encrypted speech retrieval based on long sequence Biohashing
    Huang, Yi-bo
    Wang, Yong
    Li, Hao
    Zhang, Yuan
    Zhang, Qiu-yu
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (09) : 13065 - 13085
  • [7] Encrypted speech retrieval based on long sequence Biohashing
    Yi-bo Huang
    Yong Wang
    Hao Li
    Yuan Zhang
    Qiu-yu Zhang
    Multimedia Tools and Applications, 2022, 81 : 13065 - 13085
  • [8] Biohashing encrypted speech retrieval based on chaotic measurement matrix
    Huang Y.
    Wang Y.
    Zhang Q.
    Chen T.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2020, 48 (12): : 32 - 37
  • [9] A high-performance video segmentation algorithm
    Mao, B
    Chu, NJ
    Zhang, FY
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON IMAGING SCIENCE, SYSTEMS AND TECHNOLOGY, VOLS I AND II, 2001, : 410 - 413
  • [10] High Security Speech BioHashing Authentication Algorithm Based on Multi-feature Fusion
    Huang, Yi Bo
    Li, Hao
    Wang, Yong
    Zhang, Qiu Yu
    International Journal of Network Security, 2021, 23 (06) : 962 - 972