LIGHTSPEECH: LIGHTWEIGHT AND FAST TEXT TO SPEECH WITH NEURAL ARCHITECTURE SEARCH

被引:19
|
作者
Luo, Renqian [1 ]
Tan, Xu [2 ]
Wang, Rui [2 ]
Qin, Tao [3 ]
Li, Jinzhu [3 ]
Zhao, Sheng [3 ]
Chen, Enhong [1 ]
Liu, Tie-Yan [2 ]
机构
[1] Univ Sci & Technol China, Hefei, Peoples R China
[2] Microsoft Res Asia, Beijing, Peoples R China
[3] Microsoft Azure Speech, Beijing, Peoples R China
关键词
Text to speech; lightweight; fast; neural architecture search;
D O I
10.1109/ICASSP39728.2021.9414403
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Text to speech (TTS) has been broadly used to synthesize natural and intelligible speech in different scenarios. Deploying TTS in various end devices such as mobile phones or embedded devices requires extremely small memory usage and inference latency. While non-autoregressive TTS models such as FastSpeech have achieved significantly faster inference speed than autoregressive models, their model size and inference latency are still large for the deployment in resource constrained devices. In this paper, we propose LightSpeech, which leverages neural architecture search (NAS) to automatically design more lightweight and efficient models based on FastSpeech. We first profile the components of current FastSpeech model and carefully design a novel search space containing various lightweight and potentially effective architectures. Then NAS is utilized to automatically discover well performing architectures within the search space. Experiments show that the model discovered by our method achieves 15x model compression ratio and 6.5x inference speedup on CPU with on par voice quality. Audio demos are provided at https://speechresearch.github.io/lightspeech.
引用
收藏
页码:5699 / 5703
页数:5
相关论文
共 50 条
  • [1] LIGHTSPEECH: LIGHTWEIGHT NON-AUTOREGRESSIVE MULTI-SPEAKER TEXT-TO-SPEECH
    Li, Song
    Ouyang, Beibei
    Li, Lin
    Hong, Qingyang
    [J]. 2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 499 - 506
  • [2] Neural Architecture Search With a Lightweight Transformer for Text-to-Image Synthesis
    Li, Wei
    We, Shiping
    Shi, Kaibo
    Yang, Yin
    Huang, Tingwen
    [J]. IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2022, 9 (03): : 1567 - 1576
  • [3] Fast, Accurate and Lightweight Super-Resolution with Neural Architecture Search
    Chu, Xiangxiang
    Zhang, Bo
    Ma, Hailong
    Xu, Ruijun
    Li, Qingyuan
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 59 - 64
  • [4] Neural Architecture Search for Lightweight Neural Network in Food Recognition
    Tan, Ren Zhang
    Chew, XinYing
    Khaw, Khai Wah
    [J]. MATHEMATICS, 2021, 9 (11)
  • [5] Fast and Practical Neural Architecture Search
    Cui, Jiequan
    Chen, Pengguang
    Li, Ruiyu
    Liu, Shu
    Shen, Xiaoyong
    Jia, Jiaya
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6211 - 6220
  • [6] NEURAL ARCHITECTURE SEARCH FOR SPEECH EMOTION RECOGNITION
    Wu, Xixin
    Hu, Shoukang
    Wu, Zhiyong
    Liu, Xunying
    Meng, Helen
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6902 - 6906
  • [7] Lightweight Model Construction Based on Neural Architecture Search
    Yao X.
    Shi Y.
    Huo G.
    Xu N.
    [J]. Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2021, 34 (11): : 1038 - 1048
  • [8] Adaptive lightweight convolutional neural architecture search for segmentation problem
    Wang, Wei
    Wang, Xianpeng
    Song, Xiangman
    [J]. ENGINEERING OPTIMIZATION, 2024, 56 (07) : 1122 - 1139
  • [9] LightNAS: On Lightweight and Scalable Neural Architecture Search for Embedded Platforms
    Luo, Xiangzhong
    Liu, Di
    Kong, Hao
    Huai, Shuo
    Chen, Hui
    Liu, Weichen
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2023, 42 (06) : 1784 - 1797
  • [10] Efficient and Lightweight Visual Tracking with Differentiable Neural Architecture Search
    Gao, Peng
    Liu, Xiao
    Sang, Hong-Chuan
    Wang, Yu
    Wang, Fei
    [J]. ELECTRONICS, 2023, 12 (17)