Human Behavior Recognition: Semantics-based Text Copy Detection Method

被引:1
|
作者
Yang, Liu [1 ]
Xi, Jie [2 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Binjiang Coll, Nanjing 210044, Jiangsu, Peoples R China
[2] Nanjing Univ Informat Sci & Technol, Sch Comp & Software, Nanjing 210044, Jiangsu, Peoples R China
关键词
Semantics-based copy detection; Semantics similarity; Plain text; keyword extraction; SIMILARITY; SEARCH;
D O I
10.1109/CCITSA.2015.28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text document is the most widely used medium on the Internet. However, there are some emerging problems that cannot be neglected, such as plagiarism, reproduction of information content, illicit redistribution, and copyright disputes etc. Now plagiarists have become more and more "clever", they could rewrite the contents by using synonym substitution, syntactic variation and other methods. The traditional copy detection methods that use precise matching or similar string matching algorithms cannot apply to the circumstance of semantics-based copy method. To meet the challenge of supporting semantics-based copy detection, for the first time this paper proposes a semantics-based copy detection method supporting similarity ranking. Similarity scores between the suspicious text and each text from corpus are calculated using our proposed similarity calculation method. At last, top-k texts from corpus, which have high similarity scores with the suspicious text, are ranked and listed in descending order of the score. Experiments on the real-world dataset further show that our proposed solution is very efficient and effective in supporting semantics-based copy detection.
引用
收藏
页码:158 / 162
页数:5
相关论文
共 50 条
  • [1] Spam Filtering by Semantics-based Text Classification
    Hu, Wei
    Du, Jinglong
    Xing, Yongkang
    [J]. 2016 EIGHTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2016, : 89 - 94
  • [2] A semantics-based approach to malware detection
    Preda, Mila Dalla
    Christodorescu, Mihai
    Jha, Somesh
    Debray, Saumya
    [J]. ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 2008, 30 (05):
  • [3] A semantics-based approach to Malware detection
    Preda, Mila Dalla
    Christodorescu, Mihai
    Jha, Somesh
    Debray, Saumya
    [J]. ACM SIGPLAN NOTICES, 2007, 42 (01) : 377 - 388
  • [4] A Semantics-Based Approach to Malware Detection
    Preda, Mila Dalla
    Christodorescu, Mihai
    Jha, Somesh
    Debray, Saumya
    [J]. CONFERENCE RECORD OF POPL 2007: THE 34TH ACM SIGPLAN SIGACT SYMPOSIUM ON PRINCIPLES OF PROGAMMING LANGUAGES, 2007, : 377 - 388
  • [5] Human-centric and semantics-based explainable event detection: a survey
    Taiwo Kolajo
    Olawande Daramola
    [J]. Artificial Intelligence Review, 2023, 56 : 119 - 158
  • [6] Human-centric and semantics-based explainable event detection: a survey
    Kolajo, Taiwo
    Daramola, Olawande
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (SUPPL 1) : 119 - 158
  • [7] A New Semantics-Based Android Malware Detection
    Zhang, Xiaohan
    Jin, Zhengping
    [J]. 2016 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2016, : 1412 - 1416
  • [8] Semantic-Summarizer: Semantics-based text summarizer for English language text
    Mohd, Mudasir
    Nowsheena
    Wani, Mohsin Altaf
    Khanday, Hilal Ahmad
    Mir, Umar Bashir
    Nasrullah, Sheikh
    Maqbool, Zahid
    Wani, Abid Hussain
    [J]. SOFTWARE IMPACTS, 2023, 18
  • [9] Class Semantics-based Attention for Action Detection
    Sridhar, Deepak
    Quader, Niamul
    Muralidharan, Srikanth
    Li, Yaoxin
    Dai, Peng
    Lu, Juwei
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13719 - 13728
  • [10] A Semantics-Based Trajectory Segmentation Simplification Method
    Minshi Liu
    Guifang He
    Yi Long
    [J]. Journal of Geovisualization and Spatial Analysis, 2021, 5