Automatic Identification of Discourse Relations in Indian Languages

被引:0
|
作者
Devi, Sobha Lalitha [1 ]
Gopalan, Sindhuja [1 ]
Lakshmi, S. [1 ]
机构
[1] Anna Univ, AU KBC Res Ctr, Madras 600025, Tamil Nadu, India
关键词
Discourse relation; CRFs; Connectives; arguments;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
This paper describes the first effort on automatic identification of connectives and their arguments for three Indian languages Hindi, Malayalam and Tamil. We have adopted machine learning technique Conditional Random Fields (CRFs) for our work. We have used a corpus of 3000 sentences belonging to health domain. Domain independent features were extracted to improve the performance of the system. We mainly concentrated on the identification of explicit connectives and their arguments. Two sets of experiments were performed. First set of experiment was performed for the identification of connectives and next for the identification of argument boundaries. Using this approach we obtained encouraging results for all the three languages. Error analysis shows the presence of different structural patterns of discourse relations among three languages.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Discourse Tagging for Indian Languages
    Devi, Sobha Lalitha
    Lakshmi, S.
    Gopalan, Sindhuja
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, CICLING 2014, PT I, 2014, 8403 : 469 - 480
  • [2] AUTOMATIC LANGUAGE IDENTIFICATION OF THREE INDIAN LANGUAGES USING VECTOR QUANTIZATION
    Roy, Pinki
    Das, Pradip K.
    [J]. FOURTH INTERNATIONAL CONFERENCE ON COMPUTER AND ELECTRICAL ENGINEERING (ICCEE 2011), 2011, : 293 - +
  • [3] Identification of Relations from IndoWordNet for Indian Languages using Support Vector Machine
    Garg, Megha
    Sinha, Bhaskar
    Chandra, Somnath
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTING AND NETWORK COMMUNICATIONS (COCONET), 2015, : 547 - 552
  • [4] A GMM-BASED HIERARCHICAL AUTOMATIC LANGUAGE IDENTIFICATION SYSTEM FOR INDIAN LANGUAGES
    Jothilakshmi, S.
    Ramalingam, V.
    Palanivel, S.
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 2012, 26 (06) : 554 - 570
  • [5] Automatic Language Identification for Seven Indian Languages using Higher Level Features
    Madhu, Chithra
    George, Anu
    Mary, Leena
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, INFORMATICS, COMMUNICATION AND ENERGY SYSTEMS (SPICES), 2017,
  • [6] AUTOMATIC TEXT SUMMARIZATION FOR INDIAN LANGUAGES
    Kumar, Jeetendra
    Shekhar, Shashi
    Gupta, Rashmi
    [J]. EVERYMANS SCIENCE, 2022, 57 (01):
  • [7] Automatic identification of European languages
    Zhdanova, AV
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, 2002, 2553 : 76 - 84
  • [8] Automatic Mapping of French Discourse Connectives to PDTB Discourse Relations
    Laali, Majid
    Kosseim, Leila
    [J]. 18TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2017), 2017, : 1 - 6
  • [9] Automatic identification of rhetorical relations among intra-sentence discourse segments in Arabic
    Lagrini S.
    Azizi N.
    Redjimi M.
    Dwairi M.A.
    [J]. International Journal of Intelligent Systems Technologies and Applications, 2019, 18 (03): : 281 - 302
  • [10] Identification of Indian Languages in romanized form
    Yadav, Pratibha
    Mishra, Girish
    Saxena, P. K.
    [J]. PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION, 2007, : 112 - +