A Short Text Similarity Algorithm for Finding Similar Police 110 Incidents

被引:0
|
作者
Duan, Lei [1 ]
Xu, Tongge [2 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China
[2] Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China
来源
2016 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA (CCBD) | 2016年
关键词
short text similarity; word embedding; police intelligence; CRIME; FRAMEWORK; COPLINK;
D O I
10.1109/CCBD.2016.22
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Finding similar police 110 incidents from the incident dataset plays an important role in recognising related cases from which the investigators could find more clues and make a better decision on police deployment. We aim at finding 110 incidents with similar case features and semantic compared against a given incident. A short text similarity algorithm is presented. Our algorithm is developed from a novel semantic similarity algorithm Word Mover'd Distance(WMD). In order to emphasize the significance of case features in incident text, the method introduces the traditional term frequency-inverted document frequency(TF-IDF) as term weights to the WMD. Then the algorithm is verified on the practical dataset of public security department to find similar incidents, and experiments show that the algorithm is effective and can improve the accuracy in finding similar police incidents.
引用
收藏
页码:260 / 264
页数:5
相关论文
共 50 条
  • [41] Finding similar consensus between trees: an algorithm and a distance hierarchy
    Wang, JTL
    Zhang, KZ
    PATTERN RECOGNITION, 2001, 34 (01) : 127 - 137
  • [43] Similar Group Finding Algorithm Based on Temporal Subgraph Matching
    Cai, Yizhu
    Li, Mo
    Xin, Junchang
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2019, 2019, 11888 : 221 - 235
  • [44] Similarity measures for Chinese short text based on representation learning
    University of Science and Technology Beijing, Beijing, China
    不详
    J. Inf. Comput. Sci., 6 (2253-2263):
  • [45] A comparative study of two short text semantic similarity measures
    O'Shea, James
    Bandar, Zuhair
    Crockett, Keeley
    McLean, David
    AGENT AND MULTI-AGENT SYSTEMS: TECHNOLOGIES AND APPLICATIONS, PROCEEDINGS, 2008, 4953 : 172 - 181
  • [46] A Framework for Measuring Similarity between Terms in Short Text Categorization
    Nandini, V
    Chitra, Janani R.
    Maheswari, P. Uma
    PROCEEDINGS OF 2016 ONLINE INTERNATIONAL CONFERENCE ON GREEN ENGINEERING AND TECHNOLOGIES (IC-GET), 2016,
  • [47] A survey on the techniques, applications, and performance of short text semantic similarity
    Han, Mengting
    Zhang, Xuan
    Yuan, Xin
    Jiang, Jiahao
    Yun, Wei
    Gao, Chen
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (05):
  • [48] MEASURING SHORT TEXT SEMANTIC SIMILARITY USING MULTIPLE MEASUREMENTS
    Zhu, Tian-Tian
    Lan, Man
    PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4, 2013, : 808 - 813
  • [49] Short Text Similarity Calculation Based on Jaccard and Semantic Mixture
    Wu, Shushu
    Liu, Fang
    Zhang, Kai
    Communications in Computer and Information Science, 2021, 1363 CCIS : 37 - 45
  • [50] A SHORT TEXT SIMILARITY CALCULATION METHOD BASED ON DEEP LEARNING
    Xu, Yong
    Peng, Yunke
    Wang, Hengna
    Wang, Xue’Er
    UPB Scientific Bulletin, Series C: Electrical Engineering and Computer Science, 2024, 86 (01): : 91 - 104