Probabilistic data fusion on a large document collection

被引:0
|
作者
David Lillis
Fergus Toolan
Rem Collier
John Dunnion
机构
[1] University College Dublin,School of Computer Science and Informatics
[2] Griffith College Dublin,Faculty of Computing Science
来源
关键词
Data fusion; Information retrieval;
D O I
暂无
中图分类号
学科分类号
摘要
Data fusion is the process of combining the output of a number of Information Retrieval (IR) algorithms into a single result set, to achieve greater retrieval performance. ProbFuse is a data fusion algorithm that uses the history of the underlying IR algorithms to estimate the probability that subsequent result sets include relevant documents in particular positions. It has been shown to out-perform CombMNZ, the standard data fusion algorithm against which to compare performance, in a number of previous experiments. This paper builds upon this previous work and applies probFuse to the much larger Web Track document collection from the 2004 Text REtreival Conference. The performance of probFuse is compared against that of CombMNZ using a number of evaluation measures and is shown to achieve substantial performance improvements.
引用
收藏
页码:23 / 34
页数:11
相关论文
共 50 条
  • [1] Probabilistic data fusion on a large document collection
    Lillis, David
    Toolan, Fergus
    Collier, Rem
    Dunnion, John
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2006, 26 (1-2) : 23 - 34
  • [2] Probabilistic methods for data fusion
    Mohammad-Djafari, A
    [J]. MAXIMUM ENTROPY AND BAYESIAN METHODS, 1998, 98 : 57 - 69
  • [3] On Probabilistic Data Collection in the NOTICE Architecture
    El-Tawab, Samy
    Wang, Xianping
    Alhafdhi, Ahmed
    Olariu, Stephan
    [J]. 2014 IEEE 11TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SENSOR SYSTEMS (MASS), 2014, : 642 - 650
  • [4] Probabilistic data association applications to data fusion
    Quaranta, Carlo
    Balzarotti, Giorgio
    [J]. OPTICAL ENGINEERING, 2008, 47 (02)
  • [5] Real-Time Probabilistic Data Fusion for Large-Scale IoT Applications
    Akbar, Adnan
    Kousiouris, George
    Pervaiz, Haris
    Sancho, Juan
    Ta-Shma, Paula
    Carrez, Francois
    Moessner, Klaus
    [J]. IEEE ACCESS, 2018, 6 : 10015 - 10027
  • [6] Data fusion with probabilistic conditional logic
    Fisseler, Jens
    Feher, Imre
    [J]. LOGIC JOURNAL OF THE IGPL, 2010, 18 (04) : 488 - 507
  • [7] Snapshot/Continuous Data Collection Capacity for Large-Scale Probabilistic Wireless Sensor Networks
    Ji, Shouling
    Beyah, Raheem
    Cai, Zhipeng
    [J]. 2012 PROCEEDINGS IEEE INFOCOM, 2012, : 1035 - 1043
  • [8] Toward Probabilistic Data Collection in the NOTICE Architecture
    Wang, Xianping
    El-Tawab, Samy
    Alhafdhi, Ahmed
    Almalag, Mohammad
    Olariu, Stephan
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2016, 17 (12) : 3354 - 3363
  • [9] DATA COMPRESSION OF LARGE DOCUMENT DATA BASES
    HEAPS, HS
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1975, 15 (01): : 32 - 39
  • [10] Toward a Robust Data Fusion for Document Retrieval
    He, Daqing
    Dan Wu
    [J]. IEEE NLP-KE 2008: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2008, : 338 - +