Automatic search from streaming data

被引:0
|
作者
Anni R. Coden
Eric W. Brown
机构
[1] IBM,T.J. Watson Research Center
来源
Information Retrieval | 2006年 / 9卷
关键词
Speech retrieval; Text mining; Information retrieval;
D O I
暂无
中图分类号
学科分类号
摘要
Streaming data poses a variety of new and interesting challenges for information retrieval and text analysis. Unlike static document collections, which are typically analyzed and indexed off-line to support ad-hoc queries, streaming data often must be analyzed on the fly and acted on as the data passes through the analysis system. Speech is one example of streaming data that is a challenge to exploit, yet has significant potential to provide value in a knowledge management system. We are specifically interested in techniques that analyze streaming data and automatically find collateral information, or information that clarifies, expands, and generally enhances the value of the streaming data. We present a system that analyzes a data stream and automatically finds documents related to the current topic of discussion in the data stream. Experimental results show that the system generates result lists with an average precision at 10 hits of better than 60%. We also present a hit-list re-ranking technique based on named entity analysis and automatic text categorization that can improve the search results by 6%–12%.
引用
收藏
页码:95 / 109
页数:14
相关论文
共 50 条
  • [41] Generating Graph Snapshots from Streaming Edge Data
    Soundarajan, Sucheta
    Tamersoy, Acar
    Khalil, Elias B.
    Eliassi-Rad, Tina
    Chau, Duen Horng
    Gallagher, Brian
    Roundy, Kevin
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16 COMPANION), 2016, : 109 - 110
  • [42] IoT streaming data integration from multiple sources
    Tu, Doan Quang
    Kayes, A. S. M.
    Rahayu, Wenny
    Nguyen, Kinh
    COMPUTING, 2020, 102 (10) : 2299 - 2329
  • [43] Learning to Forecast Dynamical Systems from Streaming Data
    Giannakis, Dimitrios
    Henriksen, Amelia
    Tropp, Joel A.
    Ward, Rachel
    SIAM JOURNAL ON APPLIED DYNAMICAL SYSTEMS, 2023, 22 (02): : 527 - 558
  • [44] A Rough Set System for Mining from Streaming Data
    Wei, Yidong
    Leung, Carson K.
    Li, Cheng
    2022 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2022,
  • [45] Estimating Quantiles from the Union of Historical and Streaming Data
    Singh, Sneha Aman
    Srivastava, Divesh
    Tirthapura, Srikanta
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2016, 10 (04): : 433 - 444
  • [46] StreamTX: Extracting Tuples from Streaming XML Data
    Han, Wook-Shin
    Jiang, Haifeng
    Ho, Howard
    Li, Quanzhong
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (01): : 289 - 300
  • [47] Real time data streaming from smart phones
    Rowlands, David
    James, Daniel
    5TH ASIA-PACIFIC CONGRESS ON SPORTS TECHNOLOGY (APCST), 2011, 13 : 464 - 469
  • [48] Streaming principal component analysis from incomplete data
    Eftekhari, Armin
    Ongie, Gregory
    Balzano, Laura
    Wakin, Michael B.
    Journal of Machine Learning Research, 2019, 20
  • [49] Parallel Detection of Temporal Events from Streaming Data
    Wang, Hao
    Feng, Ling
    Xue, Wenwei
    WEB-AGE INFORMATION MANAGEMENT, 2011, 6897 : 639 - +
  • [50] Mining Streaming and Temporal Data: from Representation to Knowledge
    Zhang, Xiangliang
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 5744 - 5748