Stalker, a Multilingual Text Mining Search Engine for Open Source Intelligence

被引:0
|
作者
Neri, F. [1 ]
Pettoni, M. [2 ]
机构
[1] Lex Syst Dept, Via Malasoma 24, I-56121 Ospedaletto Pisa, Italy
[2] Stato Maggiore Difesa, Informat & Secur Dept (RIS) II, CIFI GE, Rome, Italy
关键词
open source intelligence; focused crawling; natural language processing; morphological analysis; syntactic analysis; functional analysis; supervised clustering; unsupervised clustering;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Open Source Intelligence (OSINT) is an intelligence gathering discipline that involves collecting information from open sources and analyzing it to produce usable intelligence. The international Intelligence Communities have seen open sources grow increasingly easier and cheaper to acquire in recent years. But up to 80% of electronic data is textual and most valuable information is often hidden and encoded in pages which are neither structured, nor classified. The process of accessing all these raw data, heterogeneous in terms of source and language, and transforming them into information is therefore strongly linked to automatic textual analysis and synthesis, which are greatly related to the ability to master the problems of multilinguality. This paper describes a content enabling system that provides deep semantic search and information access to large quantities of distributed multimedia data for both experts and general public. STALKER provides with a language independent search and dynamic classification features for a broad range of data collected from several sources in a number of culturally diverse languages.
引用
收藏
页码:35 / +
页数:3
相关论文
共 50 条
  • [1] Stalker, a multilingual text mining search engine for Open Source Intelligence
    Neri, F.
    Pettoni, Ten Col. M.
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL INFORMATION VISUALISATION, 2008, : 314 - 320
  • [2] Multilingual text mining
    Neri, F
    [J]. Data Mining VI: Data Mining, Text Mining and Their Business Applications, 2005, : 89 - 94
  • [3] Web Mining for Open Source Intelligence
    Best, Clive
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL INFORMATION VISUALISATION, 2008, : 321 - 325
  • [4] CLUO: WEB-SCALE TEXT MINING SYSTEM FOR OPEN SOURCE INTELLIGENCE PURPOSES
    Maciolek, Przemyslaw
    Dobrowolski, Grzegorz
    [J]. COMPUTER SCIENCE-AGH, 2013, 14 (01): : 45 - 62
  • [5] Swift Search An open-source search engine
    Kaneria, Fenil
    Khan, Shafaq
    Nizamuddin, Nishara
    [J]. 2022 7TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS ENGINEERING, ICISE 2022, 2022, : 6 - 9
  • [6] Lightweight Search Engine Based on Text-Mining
    Liu, Chao
    Yin, Shi Qun
    Sun, Meng Meng
    Gao, Sheng
    [J]. FUZZY SYSTEM AND DATA MINING, 2016, 281 : 264 - 270
  • [7] Mondou: Interface with text data mining for Web search engine
    Kawano, H
    Hasegawa, T
    [J]. PROCEEDINGS OF THE THIRTY-FIRST HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, VOL V: MODELING TECHNOLOGIES AND INTELLIGENT SYSTEMS TRACK, 1998, : 275 - 283
  • [8] OrChem: an open source chemistry search engine for Oracle
    Mark L Rijnbeek
    Christoph Steinbeck
    [J]. Journal of Cheminformatics, 2 (Suppl 1)
  • [9] NBLucene: Flexible and Efficient Open Source Search Engine
    Zhang, Zhaohua
    Ye, Benjun
    Huang, Jiayi
    Stones, Rebecca
    Wang, Gang
    Liu, Xiaoguang
    [J]. WEB-AGE INFORMATION MANAGEMENT, PT I, 2016, 9658 : 504 - 516
  • [10] Comprehensive Characterization of an Open Source Document Search Engine
    Hadjilambrou, Zacharias
    Kleanthous, Marios
    Antoniou, Georgia
    Portero, Antoni
    Sazeides, Yiannakis
    [J]. ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2019, 16 (02)