Transformation algorithms for data streams

被引:0
|
作者
Eick, Stephen G. [1 ]
Lockwood, John W. [1 ]
Loui, Ron [1 ]
Moscola, James [1 ]
Weishar, Doyle J. [1 ]
机构
[1] SSS Res Inc, Lisle, IL 60532 USA
关键词
D O I
暂无
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Next generation data processing systems must deal with very high data ingest rates and massive volumes of data. Such conditions are typically encountered in the Intelligence Community (IC) where analysts must search through huge volumes of data in order to gather evidence to support or refute their hypotheses. Their effort is made all the more difficult given that the data appears as unstructured text that is written in multiple languages using characters that have different encodings. Human Analysts have not been able to keep pace with reading the data and a large amount of data is discarded even though it might contain key information. The goal of our project is to assess the feasibility of incrementally replacing humans with automation in key areas of information processing. These areas include document ingest, content categorization, language translation, and context-and-temporally-based information retrieval.
引用
收藏
页码:3092 / 3101
页数:10
相关论文
共 50 条
  • [1] A Comparison of Clustering Algorithms for Data Streams
    Pereira, Cassio M. M.
    de Mello, Rodrigo F.
    INTEGRATED COMPUTING TECHNOLOGY, 2011, 165 : 59 - 74
  • [2] TADS: Transformation of Anomalies in Data Streams
    Green, Wyatt
    Johnsten, Tom
    Benton, Ryan G.
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 4284 - 4292
  • [3] Algorithms and estimators for summarization of unaggregated data streams
    Cohen, Edith
    Duffield, Nick
    Kaplan, Haim
    Lund, Carstent
    Thorup, Mikkel
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2014, 80 (07) : 1214 - 1244
  • [4] A Survey of Outlier Detection Algorithms for Data Streams
    Tamboli, Jinita
    Shukla, Madhu
    PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 3535 - 3540
  • [5] Review of Anomaly Detection Algorithms for Data Streams
    Lu, Tianyuan
    Wang, Lei
    Zhao, Xiaoyong
    APPLIED SCIENCES-BASEL, 2023, 13 (10):
  • [6] Better Algorithms for Counting Triangles in Data Streams
    McGregor, Andrew
    Vorotnikova, Sofya
    Vu, Hoa T.
    PODS'16: PROCEEDINGS OF THE 35TH ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2016, : 401 - 411
  • [7] TCAM-conscious algorithms for data streams
    Bandi, Nagender
    Metwally, Ahmed
    Agrawal, Divyakant
    El Abbadi, Amr
    2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2007, : 1317 - +
  • [8] Survey on Feature Transformation Techniques for Data Streams
    Bahri, Maroua
    Bifet, Albert
    Maniu, Silviu
    Gomes, Heitor Murilo
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 4796 - 4802
  • [9] DataX: A System for Data eXchange and Transformation of Streams
    Coviello, Giuseppe
    Rao, Kunal
    Sankaradas, Murugan
    Chakradhar, Srimat
    INTELLIGENT DISTRIBUTED COMPUTING XIV, 2022, 1026 : 319 - 329
  • [10] Harnessing the Strengths of Anytime Algorithms for Constant Data Streams
    Kranen, Philipp
    Seidl, Thomas
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT I, 2009, 5781 : 31 - 31