RECOGNITION OF HIGHLY IMBALANCED CODE-MIXED BILINGUAL SPEECH WITH FRAME-LEVEL LANGUAGE DETECTION BASED ON BLURRED POSTERIORGRAM

被引:0
|
作者
Yeh, Ching-Feng [1 ]
Heidel, Aaron
Lee, Hong-Yi [1 ]
Lee, Lin-Shan [1 ]
机构
[1] Natl Taiwan Univ, Grad Inst Commun Engn, Taipei, Taiwan
关键词
code-mixing; multilingual; ASR;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this work, we proposed a new framework for recognition of highly imbalanced code-mixed bilingual speech using an additional frame-level language detector in the conventional recognition system. Blurred posteriorgram features (BPFs) are also proposed to be used in the language detector. The approach was evaluated with real spontaneous lectures offered at National Taiwan University. The highly imbalanced language distribution in code-mixed speech makes the task difficult. Preliminary experimental results showed not only very good performance improvement, but the improvement is complementary to that brought by better acoustic models, whether due to better adaptation approach or increased training data. The code-mixed bilingual speech is frequently used in the daily lives of many people in the globalized world today.
引用
收藏
页码:4873 / 4876
页数:4
相关论文
共 10 条
  • [1] RECOGNITION OF HIGHLY IMBALANCED CODE-MIXED BILINGUAL SPEECH WITH FRAME-LEVEL LANGUAGE DETECTION BASED ON BLURRED POSTERIORGRAM
    Yeh, Ching-Feng
    Heidel, Aaron
    Lee, Hong-Yi
    Lee, Lin-Shan
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4873 - 4876
  • [2] An Improved Framework for Recognizing Highly Imbalanced Bilingual Code-Switched Lectures with Cross-Language Acoustic Modeling and Frame-Level Language Identification
    Yeh, Ching-Feng
    Lee, Lin-Shan
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (07) : 1144 - 1159
  • [3] An improved framework for recognizing highly imbalanced bilingual code-switched lectures with cross-language acoustic modeling and frame-level language identification
    Yeh, Ching-Feng
    Lee, Lin-Shan
    [J]. IEEE Transactions on Audio, Speech and Language Processing, 2015, 23 (07): : 1144 - 1159
  • [4] Deep Learning-based Hate Speech Detection in Code-mixed Tamil Text
    Anbukkarasi, S.
    Varadhaganapathy, S.
    [J]. IETE JOURNAL OF RESEARCH, 2023, 69 (11) : 7893 - 7898
  • [5] Finding Complex Features for Guest Language Fragment Recovery in Resource-Limited Code-Mixed Speech Recognition
    Heidel, Aaron
    Lu, Hsiang-Hung
    Lee, Lin-Shan
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (12) : 2148 - 2161
  • [6] Block-Based High Performance CNN Architectures for Frame-Level Overlapping Speech Detection
    Yousefi, Midia
    Hansen, John H. L.
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 28 - 40
  • [7] Exploring the Impact of Lexicon-based Knowledge Transfer for Hate Speech Detection in Indonesia Code-Mixed Languages
    Pamungkas, Endang Wahyu
    Purworini, Dian
    Priyawati, Diah
    Chasana, Rona Rizhky Bunga
    [J]. PROCEEDINGS OF 2023 7TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, NLPIR 2023, 2023, : 85 - 90
  • [8] Transfer learning based code-mixed part-of-speech tagging using character level representations for Indian languages
    Madasamy, Anand Kumar
    Padannayil, Soman Kutti
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 14 (6) : 7207 - 7218
  • [9] Transfer learning based code-mixed part-of-speech tagging using character level representations for Indian languages
    Anand Kumar Madasamy
    Soman Kutti Padannayil
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 7207 - 7218
  • [10] The Effect of Phrase Vector Embedding in Explainable Hierarchical Attention-Based Tamil Code-Mixed Hate Speech and Intent Detection
    Devi, V. Sharmila
    Kannimuthu, S.
    Madasamy, Anand Kumar
    [J]. IEEE ACCESS, 2024, 12 : 11316 - 11329