Text Implicates Prosodic Ambiguity: A Corpus for Intention Identification of the Korean Spoken Language

被引:0
|
作者
Cho, Won Ik [1 ,2 ]
Kim, Nam Soo [1 ,2 ]
机构
[1] Seoul Natl Univ, Dept ECE, Seoul, South Korea
[2] Seoul Natl Univ, INMC, Seoul, South Korea
关键词
Korean spoken language; speech act; intention identification; prosodic ambiguity; directiveness; rhetoricalness; intonation-dependency;
D O I
10.1145/3529648
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Phonetic features are indispensable in understanding the spoken language. Especially in Korean, which is wh-in-situ and head-final, the addressee of spoken language sometimes finds it hard to discern the speaker's original intention if not provided with the sentence prosody. However, acoustic information may not be guaranteed for all spoken language processing, due to the difficulty of managing and computing speech data. This article suggests a corpus that aims to distinguish utterances with ambiguous intention from clear-cut ones, utilizing the prosodic ambiguity of the text input. In detail, the resulting classification system decides whether the given text input is one of fragment, statement, question, command, rhetorical question/command, or indecisive, taking into account the intonation-dependency of the text. Based on an intuitive understanding of the Korean language engaged in the data annotation, we construct a corpuswith seven intention categories, train classification systems, and validate the utility of our dataset with quantitative and qualitative analyses.
引用
收藏
页数:20
相关论文
共 14 条
  • [1] PROSODIC ATTRIBUTE MODEL FOR SPOKEN LANGUAGE IDENTIFICATION
    Ng, Raymond W. M.
    Leung, Cheung-Chi
    Lee, Tan
    Ma, Bin
    Li, Haizhou
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5022 - 5025
  • [2] Integrating acoustic, prosodic and phonotactic features for spoken language identification
    Tong, Rong
    Ma, Bin
    Zhu, Donglai
    Li, Haizhou
    Chng, Eng Siong
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 205 - 208
  • [3] Research on Korean Spoken Language Identificaiotn Based on Special Syllables and Prosodic Feature
    Lu, Shi-Dan
    Cui, Rong-Yi
    2013 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (ICCSAI 2013), 2013, : 377 - 380
  • [4] A method for creation and validation of a natural spoken language corpus used for prosodic and speech perception
    Wendt, B
    Hufnagel, K
    Brechmann, A
    Gaschler-Markefski, B
    Tiedge, E
    Ackermann, H
    Scheich, H
    BRAIN AND LANGUAGE, 2003, 87 (01) : 187 - 187
  • [5] DESIGNING A KOREAN FRENCH-LEARNERS' SPEECH CORPUS (KFLSC) FOR SPOKEN LANGUAGE ASSESSMENT
    Park, Soeun
    Chun, Jihye
    Kim, Mi Hyun
    Lee, Hyunjoo
    Lee, Seong Heon
    Kim, Sunhee
    2022 25TH CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (O-COCOSDA 2022), 2022,
  • [6] Automatic Emotional Spoken Language Text Corpus Construction from Written Dialogs in Fictions
    Chen, Jinkun
    Liu, Cong
    Li, Ming
    2017 SEVENTH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2017, : 319 - 324
  • [7] Language Identification: A New Fast Algorithm to Identify the Language of a Text in a Multilingual Corpus
    Gadri, Said
    Moussaoui, Abdelouahab
    Belabdelouahab-Fernini, Linda
    2014 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2014, : 321 - 326
  • [8] A method for constructing Korean spontaneous spoken language corpus based on an imitation of abbreviated and transformed particles
    Hyok-Chol Ri
    Chol Kim
    Mok-Ran Jo
    International Journal of Speech Technology, 2022, 25 : 205 - 210
  • [9] A method for constructing Korean spontaneous spoken language corpus based on an imitation of abbreviated and transformed particles
    Ri, Hyok-Chol
    Kim, Chol
    Jo, Mok-Ran
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2022, 25 (01) : 205 - 210
  • [10] A Study on the Word Usage of L2 Korean Learners through Text Mining -Based on Analysis of Spoken Corpus-
    Kim, Eunjeong
    Choi, Soo Yeon
    Lee, Myungji
    JOURNAL OF THE INTERNATIONAL NETWORK FOR KOREAN LANGUAGE AND CULTURE, 2021, 18 (03): : 23 - 50