Automatic Identification of Discourse Relations in Indian Languages

被引:0
|
作者
Devi, Sobha Lalitha [1 ]
Gopalan, Sindhuja [1 ]
Lakshmi, S. [1 ]
机构
[1] Anna Univ, AU KBC Res Ctr, Madras 600025, Tamil Nadu, India
关键词
Discourse relation; CRFs; Connectives; arguments;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
This paper describes the first effort on automatic identification of connectives and their arguments for three Indian languages Hindi, Malayalam and Tamil. We have adopted machine learning technique Conditional Random Fields (CRFs) for our work. We have used a corpus of 3000 sentences belonging to health domain. Domain independent features were extracted to improve the performance of the system. We mainly concentrated on the identification of explicit connectives and their arguments. Two sets of experiments were performed. First set of experiment was performed for the identification of connectives and next for the identification of argument boundaries. Using this approach we obtained encouraging results for all the three languages. Error analysis shows the presence of different structural patterns of discourse relations among three languages.
引用
下载
收藏
页数:7
相关论文
共 50 条
  • [21] A Survey of Automatic Text Summarization Techniques for Indian and Foreign Languages
    Shah, Prachi
    Desai, Nikita P.
    2016 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, AND OPTIMIZATION TECHNIQUES (ICEEOT), 2016, : 4598 - 4601
  • [22] ASRoIL: a comprehensive survey for automatic speech recognition of Indian languages
    Singh, Amitoj
    Kadyan, Virender
    Kumar, Munish
    Bassan, Nancy
    ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (05) : 3673 - 3704
  • [23] WTASR: Wavelet Transformer for Automatic Speech Recognition of Indian Languages
    Choudhary, Tripti
    Goyal, Vishal
    Bansal, Atul
    BIG DATA MINING AND ANALYTICS, 2023, 6 (01) : 85 - 91
  • [24] Quantitative Aspects of PDTB-Style Discourse Relations across Languages
    Sun, Kun
    Zhang, Lili
    JOURNAL OF QUANTITATIVE LINGUISTICS, 2018, 25 (04) : 342 - 371
  • [25] From discourse to pathology: Automatic identification of Parkinson's disease patients via morphological measures across three languages
    Eyigoz, Elif
    Courson, Melody
    Sedeno, Lucas
    Rogg, Katharina
    Orozco-Arroyave, Juan Rafael
    Noth, Elmar
    Skodda, Sabine
    Trujillo, Natalia
    Rodriguez, Mabel
    Rusz, Jan
    Munoz, Edinson
    Cardona, Juan F.
    Herrera, Eduar
    Hesse, Eugenia
    Ibanez, Agustin
    Cecchi, Guillermo
    Garcia, Adolfo M.
    CORTEX, 2020, 132 : 191 - 205
  • [26] Discourse markers and coherence relations: Comparison across markers, languages and modalities
    Taboada, Maite
    Gomez-Gonzalez, Maria de los Angeles
    LINGUISTICS AND THE HUMAN SCIENCES, 2010, 6 (1-3): : 17 - 41
  • [27] AUTOMATIC LEARNING OF FUZZY NAMING RELATIONS OVER FINITE LANGUAGES
    DEMORI, R
    SAITTA, L
    INFORMATION SCIENCES, 1980, 21 (02) : 93 - 139
  • [28] Signaling coherence relations by means of discourse markers makes identification of the relations easier? An investigation of the recognition of the relations by discourse addressees
    Antonio, Juliano Desiderato
    REVISTA DE ESTUDOS DA LINGUAGEM, 2016, 24 (01) : 293 - 325
  • [29] Cognate Identification to improve Phylogenetic trees for Indian Languages
    Kanojia, Diptesh
    Kulkarni, Malhar
    Bhattacharyya, Pushpak
    Haffari, Gholemreza
    PROCEEDINGS OF THE 6TH ACM IKDD CODS AND 24TH COMAD, 2019, : 297 - 300
  • [30] Automatic Learning of Discourse Relations in Swedish Using Cue Phrases
    Karlsson, Stefan
    Nugues, Pierre
    ADVANCES IN NATURAL LANGUAGE PROCESSING, 2010, 6233 : 179 - 184