Ontology-supported text classification based on cross-lingual word sense disambiguation

被引:0
|
作者
Tufis, Dan [1 ]
Koeva, Svetla [2 ]
机构
[1] Res Inst Artificial Intelligence, Romanian Acad, 13,13 Septembrie, Bucharest 050711, Romania
[2] Bulgarian Acad Sci, Inst Bulgerian Lang, Sofia, Bulgaria
来源
关键词
cross-lingual document classification; multilingual lexical ontology; parallel corpora; word alignment; word sense disambiguation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The paper reports on recent experiments in cross-lingual document processing (with a case study for Bulgarian-English-Romanian language pairs) and brings evidence on the benefits of using linguistic ontologies for achieving, with a high level of accuracy, difficult tasks in NLP such as word alignment, word sense disambiguation, document classification, cross-language information retrieval, etc. We provide brief descriptions of the parallel corpus we used, the multilingual lexical ontology which supports our research, the word alignment and word sense disambiguation systems we developed and a preliminary report on all ongoing development of a system for cross-lingual text-classification which takes advantage of these multilingual technologies. Unlike the keyword-based methods in document processing, the concept-based methods are supposed to better exploit the semantic information contained in a particular document and thus to provide more accurate results.
引用
收藏
页码:447 / +
页数:3
相关论文
共 50 条
  • [41] Word sense disambiguation as the primary step of ontology integration
    Banek, Marko
    Vrdoljak, Boris
    Tjoa, A. Min
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2008, 5181 : 65 - +
  • [42] Word sense disambiguation based on word sense clustering
    Anaya-Sanchez, Henry
    Pons-Porrata, Aurora
    Berlanga-Llavori, Rafael
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA-SBIA 2006, PROCEEDINGS, 2006, 4140 : 472 - 481
  • [43] Cross-lingual sense determination:: Can it work?
    Ide, N
    [J]. COMPUTERS AND THE HUMANITIES, 2000, 34 (1-2): : 223 - 234
  • [44] Prompt-based learning framework for zero-shot cross-lingual text classification
    Feng, Kai
    Huang, Lan
    Wang, Kangping
    Wei, Wei
    Zhang, Rui
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [45] Word sense disambiguation based sentiment lexicons for sentiment classification
    Hung, Chihli
    Chen, Shiuan-Jeng
    [J]. KNOWLEDGE-BASED SYSTEMS, 2016, 110 : 224 - 232
  • [46] Emotion Detection in Cross-Lingual Text Based on Bidirectional LSTM
    Ren, Han
    Wan, Jing
    Ren, Yafeng
    [J]. SECURITY WITH INTELLIGENT COMPUTING AND BIG-DATA SERVICES, 2020, 895 : 838 - 845
  • [47] CLUSE: Cross-Lingual Unsupervised Sense Embeddings
    Chi, Ta-Chung
    Chen, Yun-Nung
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 271 - 281
  • [48] Cross-Lingual Sense Determination: Can It Work?
    Nancy Ide
    [J]. Computers and the Humanities, 2000, 34 : 223 - 234
  • [49] A survey of cross-lingual word embedding models
    Ruder, Sebastian
    Vulić, Ivan
    Søgaard, Anders
    [J]. Journal of Artificial Intelligence Research, 2019, 65 : 569 - 631
  • [50] Refinement of Unsupervised Cross-Lingual Word Embeddings
    Biesialska, Magdalena
    Costa-jussa, Marta R.
    [J]. ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 1978 - 1981