Design of a POS Tagger using Conditional Random Fields for Malayalam

被引:0
|
作者
Krishnapriya, V [1 ]
Sreesha, P. [1 ]
Harithalakshmi, T. R. [1 ]
Archana, T. C. [1 ]
Vettath, Jayasree N. [1 ]
机构
[1] Sreepathy Inst Management & Technol, Dept Comp Sci & Engn, Palakkad, Kerala, India
关键词
POS tagging; Hidden Markov Model; Stochastic process; CRF; Malayalam;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Parts of Speech tagging, is a process of marking the words in a text as corresponding to a particular part of speech, based on its definition and context. POS tagger plays an important role in Natural language applications like speech recognition, natural language parsing, information retrieval and extraction. This paper discusses architecture for designing a Part-Of-Speech (POS tagger for Malayalam language using Conditional Random Field (CRF). The experiments presented in this paper use an annotated corpus of 1028 sentences (11,315 words) and tagset consists of 100 tags. A trigram based tagging scheme is involved in the experiments. The proposed system is based on an empirical approach that models the human POS tagging processing more realistically than the existing systems, without compromising the efficiency and accuracy.
引用
收藏
页码:370 / 373
页数:4
相关论文
共 50 条
  • [1] DEVELOPMENT OF A POS TAGGER FOR MALAYALAM-An Experience
    Manju, K.
    Soumya, S.
    Idicula, Sumam Mary
    [J]. 2009 INTERNATIONAL CONFERENCE ON ADVANCES IN RECENT TECHNOLOGIES IN COMMUNICATION AND COMPUTING (ARTCOM 2009), 2009, : 709 - 713
  • [2] Khmer POS Tagging Using Conditional Random Fields
    Sangvat, Sokunsatya
    Pluempitiwiriyawej, Charnyote
    [J]. COMPUTATIONAL LINGUISTICS, PACLING 2017, 2018, 781 : 169 - 178
  • [3] POS Tagging and NER System for Kannada Using Conditional Random Fields
    Swamy, Arpitha
    Srinath, S.
    [J]. INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2021, 11 (04) : 1 - 13
  • [4] POS Tagging in Amazighe Using Support Vector Machines and Conditional Random Fields
    Outahajala, Mohamed
    Benajiba, Yassine
    Rosso, Paolo
    Zenkouar, Lahbib
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, 2011, 6716 : 238 - 241
  • [5] Conditional Random Fields for Korean Morpheme Segmentation and POS Tagging
    Na, Seung-Hoon
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2015, 14 (03)
  • [6] CCRFs: Cascaded Conditional Random Fields for Chinese POS Tagging
    Shen, Guoyang
    Qiu, Likun
    Hu, Changjian
    Zhao, Kai
    [J]. IEEE NLP-KE 2009: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2009, : 437 - 444
  • [7] TLT-CRF: A Lexicon-supported Morphological Tagger for Latin Based on Conditional Random Fields
    vor der Brueck, Tim
    Mehler, Alexander
    [J]. LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 1514 - 1519
  • [8] Design and implementation of an open source Greek POS Tagger and Entity Recognizer using spaCy
    Partalidou, Eleni
    Spyromitros-Xioufis, Eleftherios
    Doropoulos, Stavros
    Vologiannidis, Stavros
    Diamantaras, Konstantinos I.
    [J]. 2019 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2019), 2019, : 337 - 341
  • [9] Extracting Thai Compounds using Collocations and POS Bigram Probabilities without a POS Tagger
    Aroonmanakun, Wirote
    [J]. 2009 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2009, : 118 - 122
  • [10] Curb Reconstruction using Conditional Random Fields
    Siegemund, Jan
    Pfeiffer, David
    Franke, Uwe
    Foerstner, Wolfgang
    [J]. 2010 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2010, : 203 - 210