Speech period detection using neural network classification

被引:0
|
作者
Vrábel, A [1 ]
Rozinaj, G [1 ]
机构
[1] Slovak Univ Technol Bratislava, FEEIT, Dept Telecommun, Bratislava 81219, Slovakia
关键词
pitch detection; speech synthesis; neural network; EMU database system;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The implementation of the Slovak speech synthesizer, based on the TD PSOLA algorithm in the EMU database system [1], has proven the strong relation between accurate pitch period estimation and the quality of synthesized speech. Although the problem of pitch detection is well known for a long time, there has not been developed a method yet, which works well in general. In this work the neural network is used as a statistical classifier, generalizing outputs of various pitch detection algorithms. Further it describes its usage in the text-to-speech syn-thesizer and discusses achieved results.
引用
收藏
页码:145 / 148
页数:4
相关论文
共 50 条
  • [1] Classification of Imagined Speech Using Siamese Neural Network
    Lee, Dong-Yeon
    Lee, Minji
    Lee, Seong-Whan
    2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 2979 - 2984
  • [2] Endpoint detection of speech signal using neural network
    Hussain, A
    Samad, SA
    Fah, LB
    IEEE 2000 TENCON PROCEEDINGS, VOLS I-III: INTELLIGENT SYSTEMS AND TECHNOLOGIES FOR THE NEW MILLENNIUM, 2000, : 271 - 274
  • [3] Improved Speech Emotion Classification Using Deep Neural Network
    Mariwan Hama Saeed
    Circuits, Systems, and Signal Processing, 2023, 42 : 7357 - 7376
  • [5] Dari Speech Classification Using Deep Convolutional Neural Network
    Dawodi, Mursal
    Baktash, Jawid Ahamd
    Wada, Tomohisa
    Alam, Najwa
    Joya, Mohammad Zarif
    2020 IEEE INTERNATIONAL IOT, ELECTRONICS AND MECHATRONICS CONFERENCE (IEMTRONICS 2020), 2020, : 110 - 113
  • [6] Moving object detection and classification using neural network
    Dewan, M. Ali Akber
    Hossain, M. Julius
    Chae, Oksam
    AGENT AND MULTI-AGENT SYSTEMS: TECHNOLOGIES AND APPLICATIONS, PROCEEDINGS, 2008, 4953 : 152 - 161
  • [7] Detection and classification of systolic murmur using a neural network
    Nakamitsu, T
    Shino, H
    Kotani, T
    Yana, K
    Harada, K
    Sudoh, J
    Harasawa, E
    Itoh, H
    PROCEEDINGS OF THE 1996 FIFTEENTH SOUTHERN BIOMEDICAL ENGINEERING CONFERENCE, 1996, : 365 - 366
  • [8] Spoofing Speech Detection using Temporal Convolutional Neural Network
    Tian, Xiaohai
    Xiao, Xiong
    Chng, Eng Siong
    Li, Haizhou
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [9] Speaker Identification Using Robust Speech Detection and Neural Network
    Ouzounov, Atanas
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2007, 7 (03) : 48 - 54
  • [10] Gender Classification in Speech Recognition using Fuzzy Logic and Neural Network
    Meena, Kunjithapatham
    Subramaniam, Kulumani
    Gomathy, Muthusamy
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2013, 10 (05) : 477 - 485