A Saliency-Based Auditory Attention Model with Applications to Unsupervised Prominent Syllable Detection in Speech

被引:0
|
作者
Kalinli, Ozlem [1 ]
Narayanan, Shrikanth [1 ]
机构
[1] Univ So Calif, Dept Elect Engn Syst, SAIL, Los Angeles, CA 90089 USA
关键词
auditory attention; auditory saliency map; prominent syllable detection; attention model;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A bottom-up or saliency driven attention allows the brain to detect nonspecific conspicuous targets in cluttered scenes before fully processing and recognizing the targets. Here, a novel biologically plausible auditory saliency map is presented to model such saliency based auditory attention. Multi-scale auditory features are extracted based on the processing stages in the central auditory system, and they are combined into a single master saliency map. The usefulness of the proposed auditory saliency map in detecting the prominent syllable and word locations in speech is tested in an unsupervised manner. When evaluated with broadcast news-style read speech using the BU Radio News Corpus, the model achieves 75.9 % accuracy at the syllable level, and 78.1 % accuracy at word level. These results compare well to results reported on human performance.
引用
收藏
页码:2452 / 2455
页数:4
相关论文
共 50 条
  • [1] A Saliency-based Attention LSTM Model for Cognitive Load Classification from Speech
    Gallardo-Antolin, Ascension
    Montero, Juan M.
    [J]. INTERSPEECH 2019, 2019, : 216 - 220
  • [2] SALIENCY-BASED UNSUPERVISED IMAGE MATTING
    Tan, Guanghua
    Qi, Jun
    Gao, Chunming
    Chen, Jin
    Zhuo, Liyuan
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2014, 28 (04)
  • [3] Saliency-based dual-attention network for unsupervised video object segmentation
    Guifang Zhang
    Hon-Cheng Wong
    [J]. The Journal of Supercomputing, 2024, 80 (4) : 4996 - 5010
  • [4] Saliency-based dual-attention network for unsupervised video object segmentation
    Zhang, Guifang
    Wong, Hon-Cheng
    [J]. JOURNAL OF SUPERCOMPUTING, 2024, 80 (04): : 4996 - 5010
  • [5] A Saliency-based Unsupervised Method for Angiectasia Detection in Endoscopic Video Frames
    Deeba, Farah
    Mohammed, Shahed K.
    Bui, Francis M.
    Wahid, Khan A.
    [J]. JOURNAL OF MEDICAL AND BIOLOGICAL ENGINEERING, 2018, 38 (02) : 325 - 335
  • [6] Saliency-based bit plane detection for network applications
    Kaljahi, Maryam Asadzadeh
    Shivakumara, Palaiahnakote
    Hakak, Saqib
    Idris, Mohd Yamani Idna
    Anisi, Mohammad Hossein
    Rajan, Deepu
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (25-26) : 18495 - 18513
  • [7] A Saliency-based Unsupervised Method for Angiectasia Detection in Endoscopic Video Frames
    Farah Deeba
    Shahed K. Mohammed
    Francis M. Bui
    Khan A. Wahid
    [J]. Journal of Medical and Biological Engineering, 2018, 38 : 325 - 335
  • [8] Saliency-based bit plane detection for network applications
    Maryam Asadzadeh Kaljahi
    Palaiahnakote Shivakumara
    Saqib Hakak
    Mohd Yamani Idna Idris
    Mohammad Hossein Anisi
    Deepu Rajan
    [J]. Multimedia Tools and Applications, 2020, 79 : 18495 - 18513
  • [9] A model of saliency-based visual attention for rapid scene analysis
    Itti, L
    Koch, C
    Niebur, E
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (11) : 1254 - 1259
  • [10] Saliency-Based Spatiotemporal Attention for Video Captioning
    Chen, Yangyu
    Zhang, Weigang
    Wang, Shuhui
    Li, Liang
    Huang, Qingming
    [J]. 2018 IEEE FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM), 2018,