A Multi-Layer System for Semantic Textual Similarity

被引:3
|
作者
Ngoc Phuoc An Vo [1 ]
Popescu, Octavian [2 ]
机构
[1] Xerox Res Ctr Europe, Meylan, France
[2] IBM Corp, TJ Watson Res, Yorktown Hts, NY USA
来源
KDIR: PROCEEDINGS OF THE 8TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT - VOL. 1 | 2016年
关键词
Machine Learning; Natural Language Processing (NLP); Semantic Textual Similarity (STS);
D O I
10.5220/0006045800560067
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Building a system able to cope with various phenomena which falls under the umbrella of semantic similarity is far from trivial. It is almost always the case that the performances of a system do not vary consistently or predictably from corpora to corpora. We analyzed the source of this variance and found that it is related to the word-pair similarity distribution among the topics in the various corpora. Then we used this insight to construct a 4-module system that would take into consideration not only string and semantic word similarity, but also word alignment and sentence structure. The system consistently achieves an accuracy which is very close to the state of the art, or reaching a new state of the art. The system is based on a multi-layer architecture and is able to deal with heterogeneous corpora which may not have been generated by the same distribution.
引用
收藏
页码:56 / 67
页数:12
相关论文
共 50 条
  • [41] Semantic Textual Similarity Using Various Approaches
    Kazula, Maciej
    Kozlowski, Marek
    MACHINE INTELLIGENCE AND BIG DATA IN INDUSTRY, 2016, 19 : 49 - 62
  • [42] Linking Datasets Using Semantic Textual Similarity
    McCrae, John P.
    Buitelaar, Paul
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2018, 18 (01) : 109 - 123
  • [43] Czech news dataset for semantic textual similarity
    Sido, Jakub
    Sejak, Michal
    Prazak, Ondrej
    Konopik, Miloslav
    Moravec, Vaclav
    LANGUAGE RESOURCES AND EVALUATION, 2024,
  • [44] Similarity of crystalline structure of BSCCO single crystal and multi-layer film
    Zhang, H
    Tang, XT
    Zhang, L
    Han, SH
    Zhao, Y
    PHYSICA C, 2001, 357 (SUPPL. 2): : 190 - 193
  • [45] Image similarity measure based on multi-layer matching of SIFT feature
    Li, C. (lcl_zju@aliyun.com), 1600, Binary Information Press (10):
  • [46] Question Similarity Detection in Turkish Using Semantic Textual Similarity Methods
    Yildiz, Eray
    Findik, Yasin
    2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
  • [47] Deployment of Multi-layer UAV Relay System
    Han, Sang Ik
    Baek, Jaeuk
    Han, Youngnam
    2018 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2018,
  • [48] Multi-layer Location Verification System in MANETs
    Dong, Jingyi
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, CSPS 2018, VOL II: SIGNAL PROCESSING, 2020, 516 : 1428 - 1434
  • [49] Job scheduling in a multi-layer vision system
    Ercan, MF
    Oguz, C
    Fung, YF
    EURO-PAR'99: PARALLEL PROCESSING, 1999, 1685 : 317 - 321
  • [50] Gated Multi-Layer Fusion for Real-Time Semantic Segmentation
    Zhang C.
    Cheng Q.
    Li Z.
    Wang Z.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2020, 32 (09): : 1442 - 1449