Automatic segmentation and labelling of multi-lingual speech data

被引:18
|
作者
Vorstermans, A [1 ]
Martens, JP [1 ]
VanCoile, B [1 ]
机构
[1] STATE UNIV GHENT, ELIS, B-9000 GHENT, BELGIUM
关键词
automatic segmentation and labelling; multi-lingual; neural networks;
D O I
10.1016/S0167-6393(96)00037-4
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A new system for the automatic segmentation and labelling of speech is presented. The system is capable of labelling speech originating from different languages without requiring extensive linguistic knowledge or large (manually segmented and labeled) training databases of that language. The system comprises small neural networks for the segmentation and the broad phonetic classification of the speech. These networks were originally trained on one task (Flemish continuous speech), and are automatically adapted to a new task. Due to the limited size of the neural networks, the segmentation and labelling strategy requires but a limited amount of computations, and the adaptation to a new task can be accomplished very quickly. The system was first evaluated on five isolated word corpora designed for the development of Dutch, French, American English, Spanish and Korean text-to-speech systems. The results show that the accuracy of the obtained automatic segmentation and labelling is comparable to that of human experts. In order to provide segmentation and labelling results which can be compared to data reported in the literature, additional tests were run on TIMIT and on the English, Danish and Italian portions of the EUROMO continuous speech utterances. The performance of our system appears to compare favourably to that of other systems.
引用
收藏
页码:271 / 293
页数:23
相关论文
共 50 条
  • [41] Detecting Hate Speech in Cross-Lingual and Multi-lingual Settings Using Language Agnostic Representations
    Rodriguez, Sebastian E.
    Allende-Cid, Hector
    Allende, Hector
    [J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2021, 2021, 12702 : 77 - 87
  • [42] Cross corpus multi-lingual speech emotion recognition using ensemble learning
    Zehra, Wisha
    Javed, Abdul Rehman
    Jalil, Zunera
    Khan, Habib Ullah
    Gadekallu, Thippa Reddy
    [J]. COMPLEX & INTELLIGENT SYSTEMS, 2021, 7 (04) : 1845 - 1854
  • [43] An optimized machine translation technique for multi-lingual speech to sign language notation
    Dhanjal, Amandeep Singh
    Singh, Williamjeet
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (17) : 24099 - 24117
  • [44] An optimized machine translation technique for multi-lingual speech to sign language notation
    Amandeep Singh Dhanjal
    Williamjeet Singh
    [J]. Multimedia Tools and Applications, 2022, 81 : 24099 - 24117
  • [45] Cross corpus multi-lingual speech emotion recognition using ensemble learning
    Wisha Zehra
    Abdul Rehman Javed
    Zunera Jalil
    Habib Ullah Khan
    Thippa Reddy Gadekallu
    [J]. Complex & Intelligent Systems, 2021, 7 : 1845 - 1854
  • [46] Multi-lingual and Cross-lingual timeline extraction
    Laparra, Egoitz
    Agerri, Rodrigo
    Aldabe, Itziar
    Rigau, German
    [J]. KNOWLEDGE-BASED SYSTEMS, 2017, 133 : 77 - 89
  • [47] Depression-level assessment from multi-lingual conversational speech data using acoustic and text features
    Demiroglu, Cenk
    Besirli, Asli
    Ozkanca, Yasin
    Celik, Selime
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2020, 2020 (01)
  • [48] Depression-level assessment from multi-lingual conversational speech data using acoustic and text features
    Cenk Demiroglu
    Aslı Beşirli
    Yasin Ozkanca
    Selime Çelik
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2020
  • [49] Multi-lingual and multi-cultural conditions
    Arce, CH
    [J]. TRANSPORT SURVEY QUALITY AND INNOVATION, 2003, : 209 - 213
  • [50] Multi-Task Based Mispronunciation Detection of Children Speech Using Multi-Lingual Information
    Wei, Linxuan
    Dong, Wenwei
    Lin, Binghuai
    Zhang, Jinsong
    [J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1791 - 1794