A Grammar-Based Semantic Similarity Algorithm for Natural Language Sentences

被引:32
|
作者
Lee, Ming Che [1 ]
Chang, Jia Wei [2 ]
Hsieh, Tung Cheng [3 ]
机构
[1] Ming Chuan Univ, Dept Comp & Commun Engn, Taoyuan 333, Taiwan
[2] Natl Cheng Kung Univ, Dept Engn Sci, Tainan 701, Taiwan
[3] Hsuan Chuang Univ, Dept Visual Commun Design, Hsinchu 300, Taiwan
来源
关键词
INFORMATION; PRINCIPLES; EXTRACTION; RETRIEVAL; WORDNET;
D O I
10.1155/2014/437162
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper presents a grammar and semantic corpus based similarity algorithm for natural language sentences. Natural language, in opposition to "artificial language", such as computer programming languages, is the language used by the general public for daily communication. Traditional information retrieval approaches, such as vector models, LSA, HAL, or even the ontology-based approaches that extend to include concept similarity comparison instead of cooccurrence terms/words, may not always determine the perfect matching while there is no obvious relation or concept overlap between two natural language sentences. This paper proposes a sentence similarity algorithm that takes advantage of corpus-based ontology and grammatical rules to overcome the addressed problems. Experiments on two famous benchmarks demonstrate that the proposed algorithm has a significant performance improvement in sentences/short-texts with arbitrary syntax and structure.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Offline grammar-based recognition of handwritten sentences
    Zimmermann, M
    Chappelier, JC
    Bunke, H
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (05) : 818 - 821
  • [2] Grammar-based geodesics in semantic networks
    Rodriguez, Marko A.
    Watkins, Jennifer H.
    [J]. KNOWLEDGE-BASED SYSTEMS, 2010, 23 (08) : 844 - 855
  • [3] A Grammar-based model for the Semantic web
    Jung, Hyosook
    Park, Seongbin
    [J]. COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2011, 8 (01) : 73 - 100
  • [4] Addressing the Variability of Natural Language Expression in Sentence Similarity with Semantic Structure of the Sentences
    Achananuparp, Palakorn
    Hu, Xiaohua
    Yang, Christopher C.
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, 5476 : 548 - 555
  • [5] Grammar-based connectionist approaches to language
    Smolensky, P
    [J]. COGNITIVE SCIENCE, 1999, 23 (04) : 589 - 613
  • [6] Translating formal software specifications to natural language - A grammar-based approach
    Burke, DA
    Johannisson, K
    [J]. LOGICAL ASPECTS OF COMPUTATIONAL LINGUISTICS, PROCEEDINGS, 2005, 3492 : 51 - 66
  • [7] Grammar-based random walkers in semantic networks
    Rodriguez, Marko A.
    [J]. KNOWLEDGE-BASED SYSTEMS, 2008, 21 (07) : 727 - 739
  • [8] SGA: A grammar-based alignment algorithm
    Hu, Guangyue
    Shen, Shiyi
    Ruan, Jishou
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2007, 86 (01) : 17 - 20
  • [9] An Online Algorithm for Lightweight Grammar-Based Compression
    Maruyama, Shirou
    Sakamoto, Hiroshi
    Takeda, Masayuki
    [J]. ALGORITHMS, 2012, 5 (02) : 214 - 235
  • [10] Semantic Service Retrieval based on Natural Language Querying and Semantic Similarity
    de Castilho, Richard Eckart
    Gurevych, Iryna
    [J]. FIFTH IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2011), 2011, : 173 - 176