A WFST-based Log-linear Framework for Speaking-style Transformation

被引:0
|
作者
Neubig, Graham [1 ]
Mori, Shinsuke [1 ]
Kawahara, Tatsuya [1 ]
机构
[1] Kyoto Univ, Grad Sch Informat, Sakyo Ku, Kyoto 6068501, Japan
关键词
speaking style transformation; disfluency detection; weighted finite state transducers; log-linear model;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When attempting to make transcripts from automatic speech recognition results, disfluency deletion, transformation of colloquial expressions, and insertion of dropped words must be performed to ensure that the final product is clean transcript-style text. This paper introduces a system for the automatic transformation of the spoken word to transcript-style language that enables not only deletion of disfluencies, but also substitutions of colloquial expressions and insertion of dropped words. A number of potentially useful features are combined in a log-linear probabilistic framework, and the utility of each is examined. The system is implemented using weighted finite state transducers (WFSTs) to allow for easy combination of features and integration with other WFST-based systems. On evaluation, the best system achieved a 5.37% word error rate, a 5.49% absolute gain over a rule-based baseline and a 1.54% absolute gain over a simple noisy-channel model.
引用
收藏
页码:1503 / 1506
页数:4
相关论文
共 50 条
  • [31] THE BOX-COX TRANSFORMATION AND NON-ITERATIVE ESTIMATION METHODS FOR ORDINAL LOG-LINEAR MODELS
    Beh, Eric J.
    Farver, Thomas B.
    AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2012, 54 (04) : 475 - 484
  • [32] Efficient parameter estimation for parabolic SPDEs based on a log-linear model for realized volatilities
    Bibinger, Markus
    Bossert, Patrick
    JAPANESE JOURNAL OF STATISTICS AND DATA SCIENCE, 2023, 6 (01) : 407 - 429
  • [33] Log-Linear Model Based Behavior Selection Method for Artificial Fish Swarm Algorithm
    Huang, Zhehuang
    Chen, Yidong
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2015, 2015
  • [34] Log-Linear Pool to Combine Prior Distributions: A Suggestion for a Calibration-Based Approach
    Rufo, M. J.
    Martin, J.
    Perez, C. J.
    BAYESIAN ANALYSIS, 2012, 7 (02): : 411 - 438
  • [35] Efficient parameter estimation for parabolic SPDEs based on a log-linear model for realized volatilities
    Markus Bibinger
    Patrick Bossert
    Japanese Journal of Statistics and Data Science, 2023, 6 : 407 - 429
  • [36] IMPROVED STATISTICAL MODELS FOR SMT-BASED SPEAKING STYLE TRANSFORMATION
    Neubig, Graham
    Akita, Yuya
    Mori, Shinsuke
    Kawahara, Tatsuya
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5206 - 5209
  • [37] A new non-parametric control chart for monitoring general linear profiles based on log-linear modelling
    Huwang, Longcheen
    Liao, Yen-Ming
    QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2023, 39 (03) : 1024 - 1042
  • [38] Log-linear model-based multifactor dimensionality reduction method to detect genegene interactions
    Lee, Seung Yeoun
    Chung, Yujin
    Elston, Robert C.
    Kim, Youngchul
    Park, Taesung
    BIOINFORMATICS, 2007, 23 (19) : 2589 - 2595
  • [39] Trihalomethane prediction model for water supply system based on machine learning and Log-linear regression
    Hui Li
    Yangyang Chu
    Yanping Zhu
    Xiaomeng Han
    Shihu Shu
    Environmental Geochemistry and Health, 2024, 46
  • [40] Fast newton method to solve KLR based on multilevel circulant matrix with log-linear complexity
    Junna Zhang
    Shuisheng Zhou
    Cui Fu
    Feng Ye
    Applied Intelligence, 2023, 53 : 21407 - 21421