A Grammar-Based Semantic Similarity Algorithm for Natural Language Sentences

被引:32
|
作者
Lee, Ming Che [1 ]
Chang, Jia Wei [2 ]
Hsieh, Tung Cheng [3 ]
机构
[1] Ming Chuan Univ, Dept Comp & Commun Engn, Taoyuan 333, Taiwan
[2] Natl Cheng Kung Univ, Dept Engn Sci, Tainan 701, Taiwan
[3] Hsuan Chuang Univ, Dept Visual Commun Design, Hsinchu 300, Taiwan
来源
关键词
INFORMATION; PRINCIPLES; EXTRACTION; RETRIEVAL; WORDNET;
D O I
10.1155/2014/437162
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper presents a grammar and semantic corpus based similarity algorithm for natural language sentences. Natural language, in opposition to "artificial language", such as computer programming languages, is the language used by the general public for daily communication. Traditional information retrieval approaches, such as vector models, LSA, HAL, or even the ontology-based approaches that extend to include concept similarity comparison instead of cooccurrence terms/words, may not always determine the perfect matching while there is no obvious relation or concept overlap between two natural language sentences. This paper proposes a sentence similarity algorithm that takes advantage of corpus-based ontology and grammatical rules to overcome the addressed problems. Experiments on two famous benchmarks demonstrate that the proposed algorithm has a significant performance improvement in sentences/short-texts with arbitrary syntax and structure.
引用
下载
收藏
页数:17
相关论文
共 50 条
  • [1] Offline grammar-based recognition of handwritten sentences
    Zimmermann, M
    Chappelier, JC
    Bunke, H
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (05) : 818 - 821
  • [2] Grammar-based geodesics in semantic networks
    Rodriguez, Marko A.
    Watkins, Jennifer H.
    KNOWLEDGE-BASED SYSTEMS, 2010, 23 (08) : 844 - 855
  • [3] A Grammar-based model for the Semantic web
    Jung, Hyosook
    Park, Seongbin
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2011, 8 (01) : 73 - 100
  • [4] Addressing the Variability of Natural Language Expression in Sentence Similarity with Semantic Structure of the Sentences
    Achananuparp, Palakorn
    Hu, Xiaohua
    Yang, Christopher C.
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, 5476 : 548 - 555
  • [5] Translating formal software specifications to natural language - A grammar-based approach
    Burke, DA
    Johannisson, K
    LOGICAL ASPECTS OF COMPUTATIONAL LINGUISTICS, PROCEEDINGS, 2005, 3492 : 51 - 66
  • [6] Grammar-based connectionist approaches to language
    Smolensky, P
    COGNITIVE SCIENCE, 1999, 23 (04) : 589 - 613
  • [7] Grammar-based random walkers in semantic networks
    Rodriguez, Marko A.
    KNOWLEDGE-BASED SYSTEMS, 2008, 21 (07) : 727 - 739
  • [8] A new ontology-based semantic similarity algorithm in the natural language processing
    Zhu, Xin-Hua
    Su, Fang-Fang
    Tang, Qi-Feng
    International Journal of Digital Content Technology and its Applications, 2012, 6 (02) : 188 - 195
  • [9] SGA: A grammar-based alignment algorithm
    Hu, Guangyue
    Shen, Shiyi
    Ruan, Jishou
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2007, 86 (01) : 17 - 20
  • [10] An Online Algorithm for Lightweight Grammar-Based Compression
    Maruyama, Shirou
    Sakamoto, Hiroshi
    Takeda, Masayuki
    ALGORITHMS, 2012, 5 (02) : 214 - 235