ENGLISH NOUN PHRASE ACCENT PREDICTION FOR TEXT-TO-SPEECH

被引:15
|
作者
SPROAT, R
机构
[1] AT and T Bell Lab., Murray Hill, NJ
来源
COMPUTER SPEECH AND LANGUAGE | 1994年 / 8卷 / 02期
关键词
D O I
10.1006/csla.1994.1004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes np, a component of the AT&T Bell Laboratories English text-to-speech system that computes accentuation for constructions such as apple cake, apple pie and sump pump factor. Np uses both rule-based and statistical 'corpus-based' methods. These methods are discussed, and their benefits and shortcomings enumerated. The various components of np are evaluated, and it is shown that the overall performance of np significantly reduces the error rate in accent assignment over some simple-minded 'baseline' approaches. The paper concludes by outlining some future areas of research.
引用
收藏
页码:79 / 94
页数:16
相关论文
共 50 条
  • [1] Assigning phrase accent to Chinese text-to-speech system
    Qian, Y
    Chen, F
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 485 - 488
  • [2] Automatic Pitch Accent Prediction for Text-To-Speech Synthesis
    Read, Ian
    Cox, Stephen
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2085 - 2088
  • [3] Japanese Neural Incremental Text-to-Speech Synthesis Framework With an Accent Phrase Input
    Yanagita, Tomoya
    Sakti, Sakriani
    Nakamura, Satoshi
    [J]. IEEE ACCESS, 2023, 11 : 22355 - 22363
  • [4] FOCUS AND ACCENT IN A DUTCH TEXT-TO-SPEECH SYSTEM
    BAART, JLG
    [J]. FOURTH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 1989, : 111 - 115
  • [5] INTONATIONAL PHRASE BREAK PREDICTION FOR TEXT-TO-SPEECH SYNTHESIS USING DEPENDENCY RELATIONS
    Mishra, Taniya
    Kim, Yeon-jun
    Bangalore, Srinivas
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4919 - 4923
  • [6] Phrase break prediction with bidirectional encoder representations in Japanese text-to-speech synthesis
    Futamata, Kosuke
    Park, Byeongseon
    Yamamoto, Ryuichi
    Tachibana, Kentaro
    [J]. INTERSPEECH 2021, 2021, : 3126 - 3130
  • [7] Data-Driven Phrase Break Prediction for Bengali Text-to-Speech System
    Ghosh, Krishnendu
    Rao, K. Sreenivasa
    [J]. CONTEMPORARY COMPUTING, 2012, 306 : 118 - 129
  • [8] Indian accent text-to-speech system for web browsing
    Sen, A
    Samudravijaya, K
    [J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2002, 27 (1): : 113 - 126
  • [9] Indian accent text-to-speech system for web browsing
    Aniruddha Sen
    K. Samudravijaya
    [J]. Sadhana, 2002, 27 : 113 - 126
  • [10] REVIEW OF TEXT-TO-SPEECH CONVERSION FOR ENGLISH
    HERTZ, SR
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1988, 84 (03): : 1097 - 1099