Crime base: Towards building a knowledge base for crime entities and their relationships from online news papers

被引:26
|
作者
Srinivasa, K. [1 ]
Thilagam, P. Santhi [1 ]
机构
[1] Natl Inst Technol Karnataka, Dept Comp Sci & Engn, Surathkal, India
关键词
Ontology; Natural Language Processing; Integration; Information Extraction; Knowledge Representation; SEMANTIC SIMILARITY; DISAMBIGUATION; RECOGNITION; EXTRACTION; ONTOLOGY;
D O I
10.1016/j.ipm.2019.102059
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the current era of internet, information related to crime is scattered across many sources namely news media, social networks, blogs, and video repositories, etc. Crime reports published in online newspapers are often considered as reliable compared to crowdsourced data like social media and contain crime information not only in the form of unstructured text but also in the form of images. Given the volume and availability of crime-related information present in online newspapers, gathering and integrating crime entities from multiple modalities and representing them as a knowledge base in machine-readable form will be useful for any law enforcement agencies to analyze and prevent criminal activities. Extant research works to generate the crime knowledge base, does not address extraction of all non-redundant entities from text and image data present in multiple newspapers. Hence, this work proposes Crime Base, an entity relationship based system to extract and integrate crime related text and image data from online newspapers with a focus towards reducing duplicity and loss of information in the knowledge base. The proposed system uses a rule-based approach to extract the entities from text and image captions. The entities extracted from text data are correlated using contextual as-well-as semantic similarity measures and image entities are correlated using low-level and high-level image features. The proposed system also presents an integrated view of these entities and their relations in the form of a knowledge base using OWL. The system is tested for a collection of crime related articles from popular Indian online newspapers.
引用
收藏
页数:19
相关论文
共 38 条
  • [1] Storybase: Towards Building a Knowledge Base for News Events
    Wu, Zhaohui
    Liang, Chen
    Giles, C. Lee
    [J]. PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2015): SYSTEM DEMONSTRATIONS, 2015, : 133 - 138
  • [2] Towards Building a Knowledge Base of Monetary Transactions from a News Collection
    Benetka, Jan R.
    Balog, Krisztian
    Norvag, Kjetil
    [J]. 2017 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL 2017), 2017, : 209 - 218
  • [3] Bootstrapping an Online News Knowledge Base
    Hoxha, Klesti
    Baxhaku, Artur
    Ninka, Ilia
    [J]. WEB ENGINEERING (ICWE 2016), 2016, 9671 : 501 - 506
  • [4] Developing a knowledge base for crime prevention: lessons learned from the British experience
    Tilley, Nick
    Laycock, Gloria
    [J]. CRIME PREVENTION & COMMUNITY SAFETY, 2018, 20 (04) : 228 - 242
  • [5] Towards Building a Knowledge Base for Research on Andean Weaving
    Arnold, Denise Y.
    Helmer, Sven
    Arando, Rodolfo Velasquez
    [J]. DATASPACE: THE FINAL FRONTIER, PROCEEDINGS, 2009, 5588 : 180 - +
  • [6] Tab2Know: Building a Knowledge Base from Tables in Scientific Papers
    Kruit, Benno
    He, Hongyu
    Urbani, Jacopo
    [J]. SEMANTIC WEB - ISWC 2020, PT I, 2020, 12506 : 349 - 365
  • [7] Towards systematic knowledge building: An anti-crime research and development continuum
    Adele V. Harrell
    [J]. Journal of Experimental Criminology, 2006, 2 (3) : 339 - 344
  • [8] Automatic construction of knowledge base from biological papers
    Ohta, Y
    Yamamoto, Y
    Okazaki, T
    Uchiyama, I
    Takagi, T
    [J]. ISMB-97 - FIFTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS FOR MOLECULAR BIOLOGY, PROCEEDINGS, 1997, : 218 - 225
  • [9] From Field Notes Towards a Knowledge Base
    Lendvai, Piroska
    Hunt, Steve
    [J]. SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 644 - 649
  • [10] Richness, Retrievability and Reliability–Issues in a Working Knowledge Base for Good Practice in Crime Prevention
    Karen Bullock
    Paul Ekblom
    [J]. European Journal on Criminal Policy and Research, 2010, 16 : 29 - 47