Accelerated Nonparametric Bayesian Double Articulation Analyzer for Unsupervised Word Discovery

被引:0
|
作者
Ozaki, Ryo [1 ]
Taniguchi, Tadahiro [2 ]
机构
[1] Ritsumeikan Univ, Grad Sch Informat Sci & Engn, Kusatsu, Shiga, Japan
[2] Ritsumeikan Univ, Coll Informat Sci & Engn, Kusatsu, Shiga, Japan
关键词
nonparametric Bayesian double articulation analyzer; unsupervised word segmentation; word discovery;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes an accelerated nonparametric Bayesian double articulation analyzer (NPB-DAA) for enabling a developmental robot to acquire words and phonemes directly from speech signals without labeled data in more realistic scenario than conventional NPB-DAA. Word discovery and phoneme acquisition are known as important tasks in human child development. Human infants can discover words and phonemes from raw speech signals at eight months without any label data, unlike supervised learning-based speech recognition systems. NPB-DAA was proposed by Taniguchi et al. and shown to be able to perform simultaneous word and phoneme discovery without any label data. However, the computational cost of NPB-DAA was extremely large, and thus could not be applied to large-scale speech data. In this paper, we introduce lookup tables for conventional NPB-DAA to reduce the computational cost and developed an accelerated NPB-DAA. Using the lookup tables, values calculated in each subroutine are memorized and reused in the subsequent calculations. This acceleration does not harm the quality of word and phoneme discovery because the introduction of the lookup tables is theoretically supported. This paper also shows that our accelerated NPB-DAA significantly reduced the computational cost by 90% compared to conventional NPB-DAA.
引用
收藏
页码:238 / 244
页数:7
相关论文
共 50 条
  • [1] Double Articulation Analyzer With Prosody for Unsupervised Word and Phone Discovery
    Okuda, Yasuaki
    Ozaki, Ryo
    Komura, Soichiro
    Taniguchi, Tadahiro
    [J]. IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2023, 15 (03) : 1335 - 1347
  • [2] Double articulation analyzer with deep sparse autoencoder for unsupervised word discovery from speech signals
    Taniguchi, Tadahiro
    Nakashima, Ryo
    Liu, Hailong
    Nagasaka, Shogo
    [J]. ADVANCED ROBOTICS, 2016, 30 (11-12) : 770 - 783
  • [3] Unsupervised Phoneme and Word Discovery From Multiple Speakers Using Double Articulation Analyzer and Neural Network With Parametric Bias
    Nakashima, Ryo
    Ozaki, Ryo
    Taniguchi, Tadahiro
    [J]. FRONTIERS IN ROBOTICS AND AI, 2019, 6
  • [4] Nonparametric Bayesian Double Articulation Analyzer for Direct Language Acquisition From Continuous Speech Signals
    Taniguchi, Tadahiro
    Nagasaka, Shogo
    Nakashima, Ryo
    [J]. IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2016, 8 (03) : 171 - 185
  • [5] Prediction of Next Contextual Changing Point of Driving Behavior Using Unsupervised Bayesian Double Articulation Analyzer
    Nagasaka, Shogo
    Taniguchi, Tadahiro
    Hitomi, Kentarou
    Takenaka, Kazuhito
    Bando, Takashi
    [J]. 2014 IEEE INTELLIGENT VEHICLES SYMPOSIUM PROCEEDINGS, 2014, : 930 - 937
  • [6] Unsupervised Multimodal Word Discovery Based on Double Articulation Analysis With Co-Occurrence Cues
    Taniguchi, Akira
    Murakami, Hiroaki
    Ozaki, Ryo
    Taniguchi, Tadahiro
    [J]. IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2023, 15 (04) : 1825 - 1840
  • [7] Semiotic Prediction of Driving Behavior using Unsupervised Double Articulation Analyzer
    Taniguchi, Tadahiro
    Nagasaka, Shogo
    Hitomi, Kentarou
    Chandrasiri, Naiwala P.
    Bando, Takashi
    [J]. 2012 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2012, : 849 - 854
  • [8] ITERATIVE BAYESIAN WORD SEGMENTATION FOR UNSUPERVISED VOCABULARY DISCOVERY FROM PHONEME LATTICES
    Heymann, Jahn
    Walter, Oliver
    Haeb-Umbach, Reinhold
    Raj, Bhiksha
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [9] A computational model for unsupervised word discovery
    ten Bosch, Louis
    Cranen, Bert
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2668 - 2671
  • [10] Nonparametric Bayesian Models for Unsupervised Activity Recognition and Tracking
    Dhir, Neil
    Perov, Yura
    Wood, Frank
    [J]. 2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), 2016, : 4040 - 4045