Human Behavior Recognition: Semantics-based Text Copy Detection Method

被引:1
|
作者
Yang, Liu [1 ]
Xi, Jie [2 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Binjiang Coll, Nanjing 210044, Jiangsu, Peoples R China
[2] Nanjing Univ Informat Sci & Technol, Sch Comp & Software, Nanjing 210044, Jiangsu, Peoples R China
关键词
Semantics-based copy detection; Semantics similarity; Plain text; keyword extraction; SIMILARITY; SEARCH;
D O I
10.1109/CCITSA.2015.28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text document is the most widely used medium on the Internet. However, there are some emerging problems that cannot be neglected, such as plagiarism, reproduction of information content, illicit redistribution, and copyright disputes etc. Now plagiarists have become more and more "clever", they could rewrite the contents by using synonym substitution, syntactic variation and other methods. The traditional copy detection methods that use precise matching or similar string matching algorithms cannot apply to the circumstance of semantics-based copy method. To meet the challenge of supporting semantics-based copy detection, for the first time this paper proposes a semantics-based copy detection method supporting similarity ranking. Similarity scores between the suspicious text and each text from corpus are calculated using our proposed similarity calculation method. At last, top-k texts from corpus, which have high similarity scores with the suspicious text, are ranked and listed in descending order of the score. Experiments on the real-world dataset further show that our proposed solution is very efficient and effective in supporting semantics-based copy detection.
引用
收藏
页码:158 / 162
页数:5
相关论文
共 50 条
  • [21] Semantics-Based Intelligent Human-Computer Interaction
    Gatteschi, Valentina
    Lamberti, Fabrizio
    Montuschi, Paolo
    Sanna, Andrea
    [J]. IEEE INTELLIGENT SYSTEMS, 2016, 31 (04) : 11 - 21
  • [22] Extracting Chemical Reactions from Thai Text for Semantics-Based Information Retrieval
    Intarapaiboon, Peerasak
    Nantajeewarawat, Ekawit
    Theeramunkong, Thanaruk
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (03): : 479 - 486
  • [23] Extracting semantic link network of words from text for semantics-based applications
    Li, Jiazheng
    Zhou, Jian
    Zhuge, Hai
    [J]. Expert Systems with Applications, 2025, 263
  • [24] Extracting Chemical Reactions from Thai Text for Semantics-Based Information Retrieval
    Intarapaiboon, Peerasak
    Nantajeewarawat, Ekawit
    Theeramunkong, Thanaruk
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, PT I, PROCEEDINGS, 2010, 5990 : 271 - 281
  • [25] A semantics-based method for clustering of Chinese web search results
    Zhang, Hui
    Wang, Deqing
    Wang, Li
    Bi, Zhuming
    Chen, Yong
    [J]. ENTERPRISE INFORMATION SYSTEMS, 2014, 8 (01) : 147 - 165
  • [26] Schema-less, semantics-based change detection for XML documents
    Zhang, SH
    Dyreson, C
    Snodgrass, RT
    [J]. WEB INFORMATION SYSTEMS - WISE 2004, PROCEEDINGS, 2004, 3306 : 279 - 290
  • [27] Apposcopy: Semantics-Based Detection of Android Malware through Static Analysis
    Feng, Yu
    Anand, Saswat
    Dillig, Isil
    Aiken, Alex
    [J]. 22ND ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (FSE 2014), 2014, : 576 - 587
  • [28] Syntax, and semantics-based signature database for hybrid intrusion detection systems
    Barry, Bazara I. A.
    Chan, Anthony
    [J]. SECURITY AND COMMUNICATION NETWORKS, 2009, 2 (06) : 457 - 475
  • [29] TextDroid: Semantics-based Detection of Mobile Malware Using Network Flows
    Wang, Shanshan
    Yan, Qiben
    Chen, Zhenxiang
    Yang, Bo
    Zhao, Chuan
    Conti, Mauro
    [J]. 2017 IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2017, : 18 - 23
  • [30] CONSTDET: Control Semantics-Based Detection for GPS Spoofing Attacks on UAVs
    Wei, Xiaomin
    Sun, Cong
    Lyu, Minjie
    Song, Qipeng
    Li, Yue
    [J]. REMOTE SENSING, 2022, 14 (21)