PHONETIC AND PROSODICALLY RICH TRANSCRIBED SPEECH CORPUS IN INDIAN LANGUAGES : BENGALI AND ODIA

被引:0
|
作者
Kumar, Sunil S. B. [1 ]
Rao, K. Sreenivasa [1 ]
Pati, Debadatta [2 ]
机构
[1] Indian Inst Technol, Sch Informat Technol, Kharagpur 721302, W Bengal, India
[2] Balasore Coll Engn & Technol, Sergarh 756060, Balasore, India
关键词
Phonetic; Prosody; Speech corpus; Bengali; Oriya Syllable; International Phonetic Alphabet (IPA); Pitch;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we introduce a speech corpus in Indian languages namely Bengali and Odia, which provides phonetic and prosodic information. Phonetics and prosody are vital parameters in human speech perception, hence systematically studying them will help in performing various speech processing tasks. Motivated by this, we have developed Phonetic and Prosodically Rich Transcribed (PPRT) Speech corpus in Bengali and Oriya languages. In this speech corpus ten hours of read speech, five hours of conversation speech and five hours of extempore speech have been collected. The database has been transcribed using International Phonetic Alphabet (IPA) for representing all possible phoneme variations. Along with the phonetic transcription, prosodic information such as duration patterns of syllables, intonation patterns of phrases and break patterns within and across phrases are represented.
引用
收藏
页数:5
相关论文
共 19 条
  • [1] DEVELOPMENT OF PHONETIC ENGINE FOR INDIAN LANGUAGES : BENGALI AND ORIYA
    Manjunath, K. E.
    Rao, K. Sreenivasa
    Pati, Debadatta
    [J]. 2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
  • [2] Development of Kannada Speech Corpus for Prosodically Guided Phonetic Search Engine
    Shridhara, M., V
    Banahatti, Bapu K.
    Narthan, L.
    Karjigi, Veena
    Kumaraswamy, R.
    [J]. 2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
  • [3] Development of Consonant-Vowel Recognition Systems for Indian Languages : Bengali and Odia
    Manjunath, K. E.
    Kumar, Sunil S. B.
    Pati, Debadatta
    Satapathy, Biswajit
    Rao, K. Sreenivasa
    [J]. 2013 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2013,
  • [4] On building phonetically and prosodically rich speech corpus for text-to-speech synthesis
    Matousek, Jindrich
    Romportl, Jan
    [J]. PROCEEDINGS OF THE SECOND IASTED INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE, 2006, : 442 - +
  • [5] Indian Languages Corpus for Speech Recognition
    Basu, Joyanta
    Khan, Soma
    Roy, Rajib
    Saxena, Babita
    Ganguly, Dipankar
    Arora, Sunita
    Arora, Karunesh Kumar
    Bansal, Shweta
    Agrawal, Shyam Sunder
    [J]. 2019 22ND CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (O-COCOSDA), 2019, : 13 - 18
  • [6] Salient phonetic features of Indian languages in speech technology
    Bhaskararao, Peri
    [J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2011, 36 (05): : 587 - 599
  • [7] Salient phonetic features of Indian languages in speech technology
    PERI BHASKARARAO
    [J]. Sadhana, 2011, 36 : 587 - 599
  • [8] Constructing a Phonetic Transcribed Text Corpus for Southern Thai Dialect Speech Recognition
    Aunkaew, Sittichok
    Karnjanadecha, Montri
    Wutiwiwatchai, Chai
    [J]. PROCEEDINGS OF THE 2015 12TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE), 2015, : 69 - 73
  • [9] Automatic Phonetic Transcription for Read, Extempore and Conversation Speech for an Indian Language: Bengali
    Manjunath, K. E.
    Rao, K. Sreenivasa
    [J]. 2014 TWENTIETH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2014,
  • [10] IndicSpeech: Text-to-Speech Corpus for Indian Languages
    Srivastava, Nimisha
    Mukhopadhyay, Rudrabha
    Prajwal, K. R.
    Jawahar, C., V
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6417 - 6422