Development of Language Resources for Speech Application in Gujarati and Marathi

被引:0
|
作者
Madhavi, Maulik C. [1 ]
Sharma, Shubham [1 ]
Patil, Hemant A. [1 ]
机构
[1] Dhirubhai Ambani Inst Informat & Commun Technol D, Gandhinagar, Gujarat, India
关键词
Phonetic transcription; syllabification; pitch marking; low resource language;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper discusses development of resources using linguistics and signal processing aspects for two low resource Indian languages, viz., Gujarati and Marathi. Speech resource development discusses the details of data collection, transcription at phone and syllable level and corresponding linguistic units such as phones and syllables. In order to analyze the performance at different fluency levels, three types of recording modes, viz., read, conversation and lecture are considered in this paper. Manual annotation of speech in terms of International Phonetic Alphabet (IPA) symbols is presented. In the later section, we discuss speech segmentation at syllable level and prosodic level marking (pitch marking). Short-term Energy contour is smoothened using group-delay-based algorithm in order to detect syllable units in the speech signal. Detection rate obtained for syllable marking within 20 % agreement duration is of the order of 60 % in case of read mode speech. Prosody pitch marks are analyzed via F-0 pattern of a speech signal. The key strength of this study is the analysis for different kinds of recording modes, viz., read, conversation and lecture mode. It is found that CV (where, Consonant is followed by Vowel) type of syllables have highest occurrence (more than 50 %) in both the languages. Read speech is observed to perform better than spontaneous speech in terms of automatic prosodic marking.
引用
收藏
页码:115 / 118
页数:4
相关论文
共 50 条
  • [1] Development of Speech Corpora in Gujarati and Marathi for Phonetic Transcription
    Malde, Kewal D.
    Vachhani, Bhavik B.
    Madhavi, Maulik C.
    Chhayani, Nirav H.
    Patil, Hemant A.
    [J]. 2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
  • [2] Script to speech conversion for Marathi language
    Pasalkar, NB
    Joshi, CV
    Tasgaonkar, M
    [J]. IEEE TENCON 2003: CONFERENCE ON CONVERGENT TECHNOLOGIES FOR THE ASIA-PACIFIC REGION, VOLS 1-4, 2003, : 1262 - 1266
  • [3] On the Development of Speech Resources for the Mixtec Language
    Caballero-Morales, Santiago-Omar
    [J]. SCIENTIFIC WORLD JOURNAL, 2013,
  • [4] Speech Recognition using HTK Toolkit for Marathi Language
    Chavan, Supriya S.
    Handore, S. M.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON POWER, CONTROL, SIGNALS AND INSTRUMENTATION ENGINEERING (ICPCSI), 2017, : 1591 - 1597
  • [5] DEVELOPMENT OF VOCAL TRACT LENGTH NORMALIZED PHONETIC ENGINE FOR GUJARATI AND MARATHI LANGUAGES
    Sharma, Shubham
    Madhavi, Maulik C.
    Patil, Hemant A.
    [J]. 2014 17TH ORIENTAL CHAPTER OF THE INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDIZATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (COCOSDA), 2014,
  • [6] Corpus Building for Hate Speech Detection of Gujarati Language
    Vadesara, Abhilasha
    Tanna, Purna
    [J]. SOFT COMPUTING AND ITS ENGINEERING APPLICATIONS, ICSOFTCOMP 2022, 2023, 1788 : 382 - 395
  • [7] Design and Development of Marathi Speech Interface System
    Gaikwad, Santosh
    Gawali, Bharti
    Mehrotra, Suresh
    [J]. ADVANCED COMPUTING AND SYSTEMS FOR SECURITY, VOL 2, 2016, 396 : 3 - 20
  • [8] Phonetic Transcription of Fricatives and Plosives for Gujarati and Marathi Languages
    Patil, Hemant A.
    Madhavi, Maulik C.
    Malde, Kewal D.
    Vachhani, Bhavik B.
    [J]. 2012 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2012), 2012, : 177 - 180
  • [9] Expressive Speech Synthesis using Prosodic Modification for Marathi Language
    Anil, Manjare Chandraprabha
    Shirbahadurkar, S. D.
    [J]. 2ND INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN) 2015, 2015, : 126 - 130
  • [10] Syllable-Based Concatenative Speech Synthesis for Marathi Language
    Ghate, Pravin M.
    Shirbahadurkar, Suresh D.
    [J]. INFORMATION AND COMMUNICATION TECHNOLOGY FOR COMPETITIVE STRATEGIES, 2019, 40 : 615 - 624