Automatic search from streaming data

被引:0
|
作者
Anni R. Coden
Eric W. Brown
机构
[1] IBM,T.J. Watson Research Center
来源
Information Retrieval | 2006年 / 9卷
关键词
Speech retrieval; Text mining; Information retrieval;
D O I
暂无
中图分类号
学科分类号
摘要
Streaming data poses a variety of new and interesting challenges for information retrieval and text analysis. Unlike static document collections, which are typically analyzed and indexed off-line to support ad-hoc queries, streaming data often must be analyzed on the fly and acted on as the data passes through the analysis system. Speech is one example of streaming data that is a challenge to exploit, yet has significant potential to provide value in a knowledge management system. We are specifically interested in techniques that analyze streaming data and automatically find collateral information, or information that clarifies, expands, and generally enhances the value of the streaming data. We present a system that analyzes a data stream and automatically finds documents related to the current topic of discussion in the data stream. Experimental results show that the system generates result lists with an average precision at 10 hits of better than 60%. We also present a hit-list re-ranking technique based on named entity analysis and automatic text categorization that can improve the search results by 6%–12%.
引用
收藏
页码:95 / 109
页数:14
相关论文
共 50 条
  • [1] Automatic search from streaming data
    Coden, AR
    Brown, EW
    INFORMATION RETRIEVAL, 2006, 9 (01): : 95 - 109
  • [2] Automatic Labeling Streaming Data for Event Detection from Heterogeneous Sensors
    Dao, Minh-Son
    Zettsu, Koji
    2015 SEVENTH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE), 2015, : 365 - 370
  • [3] Mining streaming emerging patterns from streaming data
    Alhammady, Hamad
    2007 IEEE/ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, VOLS 1 AND 2, 2007, : 432 - 436
  • [4] IMPROVING STREAMING AUTOMATIC SPEECH RECOGNITION WITH NON-STREAMING MODEL DISTILLATION ON UNSUPERVISED DATA
    Doutre, Thibault
    Han, Wei
    Ma, Min
    Lu, Zhiyun
    Chiu, Chung-Cheng
    Pang, Ruoming
    Narayanan, Arun
    Misra, Ananya
    Zhang, Yu
    Cao, Liangliang
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6558 - 6562
  • [5] Continuous Group Nearest Group Search over Streaming Data
    Zhu, Rui
    Li, Chunhong
    Zhang, Anzhen
    Zong, Chuanyu
    Xia, Xiufeng
    WEB AND BIG DATA, PT III, APWEB-WAIM 2023, 2024, 14333 : 80 - 95
  • [6] Efficient k-NN search on streaming data series
    Liu, XY
    Ferhatosmanoglu, H
    ADVANCES IN SPATIAL AND TEMPORAL DATABASES, PROCEEDINGS, 2003, 2750 : 83 - 101
  • [7] Diversity and Novelty on the Web: Search, Recommendation, and Data Streaming Aspects
    Santos, Rodrygo L. T.
    Castells, Pablo
    Altingovde, Ismail Sengor
    Can, Fazli
    WWW'15 COMPANION: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2015, : 1529 - 1530
  • [8] AUTOMATIC SYSTEM FOR THE STORAGE AND SEARCH OF ANNIHILATION DATA
    ABDURASULEV, Z
    ZHURAVLEVA, G
    MALYAN, V
    CRYSTAL RESEARCH AND TECHNOLOGY, 1988, 23 (03) : 455 - 458
  • [9] AUTOMATIC RULE EXTRACTION FROM STATISTICAL DATA AND FUZZY TREE SEARCH.
    Morishima, Shigeo
    Harashima, Hiroshi
    Systems and Computers in Japan, 1988, 19 (05) : 26 - 37
  • [10] Temporal Geo-Social Personalized Search Over Streaming Data
    Almaslukh, Abdulaziz
    Magdy, Amr
    27TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2019), 2019, : 189 - 198