A Probabilistic Retrieval Model for Semistructured Data

被引:0
|
作者
Kim, Jinyoung [1 ]
Xue, Xiaobing [1 ]
Croft, W. Bruce [1 ]
机构
[1] Univ Massachusetts, Dept Comp Sci, Ctr Intelligent Informat Retrieval, Amherst, MA 01003 USA
关键词
LANGUAGE MODELS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Retrieving semistructed (XML) data typically requires either a structured query such as XPath, or a keyword query that does not take structure into account. IN this paper, we infer structural information automatically from keyword queries and incorporate this into a retrieval model. More specifically, we propose the concept of a mapping probability, which maps each query word into a related field (or XML element). This mapping probability is used as a weight to combine the language models estimated from each field. Experiments on two test collections show that our retrieval model based on mapping probabilities outperforms baseline techniques significantly.
引用
收藏
页码:228 / +
页数:2
相关论文
共 50 条
  • [1] PXML: A probabilistic semistructured data model and algebra
    Hung, E
    Getoor, L
    Subrahmanian, VS
    [J]. 19TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2003, : 467 - 478
  • [2] A framework for management of semistructured probabilistic data
    Zhao, WZ
    Dekhtyar, A
    Goldsmith, J
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2005, 25 (03) : 293 - 332
  • [3] Model-checking based data retrieval - An application to semistructured and temporal data
    Quintarelli, E
    [J]. MODEL-CHECKING BASED DATA RETRIEVAL: AN APPLICATION TO SEMISTRUCTURED AND TEMPORAL DATA, 2004, 2917 : 1 - 134
  • [4] A Framework for Management of Semistructured Probabilistic Data
    Wenzhong Zhao
    Alex Dekhtyar
    Judy Goldsmith
    [J]. Journal of Intelligent Information Systems, 2005, 25 : 293 - 332
  • [5] Probabilistic and prioritized data retrieval in the Linda coordination model
    Bravetti, M
    Gorrieri, R
    Lucchi, R
    Zavattaro, G
    [J]. COORDINATION MODELS AND LANGUAGES, PROCEEDINGS, 2004, 2949 : 55 - 70
  • [6] Semistructured probabilistic databases
    Dekhtyar, A
    Goldsmith, J
    Hawkes, SR
    [J]. THIRTEENTH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, 2001, : 36 - 45
  • [7] Modeling semistructured data by the adjacency model
    Töyli, J
    Linna, M
    Wanne, M
    [J]. KNOWLEDGE-BASED SOFTWARE ENGINEERING, 2002, 80 : 282 - 290
  • [8] Cooperative queries in semistructured data model
    Menon, K
    Madria, S
    Badia, A
    [J]. E-COMMERCE AND WEB TECHNOLOGIES, PROCEEDINGS, 2003, 2738 : 237 - 247
  • [9] A geographic, multimedia, and temporal data model for semistructured data
    Belussi, A
    Combi, C
    Migliorini, S
    Oliboni, B
    [J]. Sixteenth International Workshop on Database and Expert Systems Applications, Proceedings, 2005, : 463 - 467
  • [10] A Content-Oriented Data Model for Semistructured Data
    Novotny, Tomas
    [J]. DATESO 2007 - DATABASES, TEXTS, SPECIFICATIONS, OBJECTS: PROCEEDINGS OF THE 7TH ANNUAL INTERNATIONAL WORKSHOP, 2007, 235 : 55 - 66