Chunking Arabic Texts Using Conditional Random Fields

被引:0
|
作者
Khoufi, Nabil [1 ]
Aloulou, Chafik [1 ]
Hadrich Belguith, Lamia [1 ]
机构
[1] Univ Sfax, FSEGS, MIRACL Lab, ANLP Res Grp, Sfax, Tunisia
关键词
Chunking; Arabic language; CRF; supervised learning;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Chunking or shallow syntactic parsing is proving to be a task of interest to many natural language processing applications. The problem gets worse for the Arabic language because of its specific features that make it quite different and even more ambiguous than other natural languages when processed. In this paper, we present a method for chunking Arabic texts based on supervised learning. We use the Conditional Random Fields algorithm and the Penn Arabic Treebank to train the model. For the experimentation, we use over than 10,100 sentences as training data and 2,524 sentences for the test. The evaluation of the method consists of the calculation of the generated model accuracy and the results are very encouraging.
引用
收藏
页码:428 / 432
页数:5
相关论文
共 50 条
  • [1] Chunking using conditional random fields in Korean texts
    Lee, YH
    Kim, MY
    Lee, JH
    [J]. NATURAL LANGUAGE PROCESSING - IJCNLP 2005, PROCEEDINGS, 2005, 3651 : 155 - 164
  • [2] Chunking in Turkish with Conditional Random Fields
    Yildiz, Olcay Taner
    Solak, Ercan
    Ehsani, Razieh
    Gorgun, Onur
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2015), PT I, 2015, 9041 : 173 - 184
  • [3] Chinese chunking algorithm based on conditional random fields
    Sun, Guang-Lu
    Liu, Bing-Quan
    Wang, Xiao-Long
    Liu, Yuan-Chao
    [J]. PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 2509 - 2513
  • [4] Chinese Chunking Algorithm Based on Cascaded Conditional Random Fields
    Sun, Guang-Lu
    Liu, Yuan-Chao
    Qiao, Pei-Li
    Lang, Fei
    [J]. PROCEEDINGS OF THE 11TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2008,
  • [5] Vietnamese Noun Phrase Chunking based on Conditional Random Fields
    Nguyen Thi Huong Thao
    Nguyen Phuong Thai
    Nguyen Le Minh
    Ha Quang Thuy
    [J]. INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2009), 2009, : 172 - +
  • [6] Bengali Noun Phrase Chunking Based on Conditional Random Fields
    Sarkar, Kamal
    Gayen, Vivekananda
    [J]. 2014 2ND INTERNATIONAL CONFERENCE ON BUSINESS AND INFORMATION MANAGEMENT (ICBIM), 2014,
  • [7] Punctuation Prediction for Vietnamese Texts Using Conditional Random Fields
    Pham, Quang H.
    Nguyen, Binh T.
    Nguyen Viet Cuong
    [J]. SOICT 2019: PROCEEDINGS OF THE TENTH INTERNATIONAL SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY, 2019, : 322 - 327
  • [8] Extracting Terms from Texts with Conditional Random Fields
    Li YiXuan
    Lu Xun
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON EDUCATION, MANAGEMENT, COMPUTER AND SOCIETY, 2016, 37 : 293 - 296
  • [9] Recognizing logical parts in Vietnamese Legal Texts using Conditional Random Fields
    Nguyen Truong Son
    Ho Bao Quoc
    Nguyen Thi Phuong Duyen
    Nguyen Le Minh
    [J]. 2015 IEEE RIVF INTERNATIONAL CONFERENCE ON COMPUTING & COMMUNICATION TECHNOLOGIES - RESEARCH, INNOVATION, AND VISION FOR THE FUTURE (RIVF), 2015, : 1 - 6
  • [10] Aspect Terms Extraction of Arabic Dialects for Opinion Mining Using Conditional Random Fields
    Alawami, Alawya
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, (CICLING 2016), PT II, 2018, 9624 : 211 - 220