A machine learning model for information retrieval with structured documents

被引:0
|
作者
Piwowarski, B [1 ]
Gallinari, P [1 ]
机构
[1] Univ Paris 06, LIP6, F-75015 Paris, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most recent document standards rely on structured representations. On the other hand, current information retrieval systems have been developed for flat document representations and cannot be easily extended to cope with more complex document types. Only a few models have been proposed for handling structured documents, and the design of such systems is still an open problem. We present here a new model for structured document retrieval which allows to compute and to combine the scores of document parts. It is based on bayesian networks and allows for learning the model parameters in the presence of incomplete data. We present an application of this model for ad-hoc retrieval and evaluate its performances on a small structured collection. The model can also be extended to cope with other tasks such as interactive navigation in structured documents or corpus.
引用
收藏
页码:425 / 438
页数:14
相关论文
共 50 条
  • [1] Machine learning ranking for structured information retrieval
    Vittaut, Jean-Noel
    Gallinari, Patrick
    [J]. ADVANCES IN INFORMATION RETRIEVAL, 2006, 3936 : 338 - 349
  • [2] Typed structured documents for information retrieval
    Dharap, C
    Bowman, CM
    [J]. PRINCIPLES OF DOCUMENT PROCESSING, 1997, 1293 : 135 - 151
  • [3] Construction of Model of Structured Documents Based on Machine Learning
    Golubev, Sergey
    [J]. PATTERN RECOGNITION AND MACHINE INTELLIGENCE, 2011, 6744 : 424 - 431
  • [4] Information theoretic retrieval with structured queries and documents
    Carpineto, Claudio
    Romano, Giovanni
    Caracciolo, Caterina
    [J]. COMPARATIVE EVALUATION OF XML INFORMATION RETRIEVAL SYSTEMS, 2007, 4518 : 178 - 184
  • [5] STRUCTURED DOCUMENTS RECOGNITION BASED ON MACHINE LEARNING
    Golubev, S. V.
    [J]. BIZNES INFORMATIKA-BUSINESS INFORMATICS, 2011, 16 (02): : 48 - 55
  • [6] Machine Learning for Information Retrieval
    Si, Luo
    Jin, Rong
    [J]. PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 1293 - 1293
  • [7] Design and implementation of a structured information retrieval system for SGML documents
    Han, SG
    Son, JH
    Chang, JW
    Zhoo, ZC
    [J]. 6TH INTERNATIONAL CONFERENCE ON DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 1999, : 81 - 88
  • [8] Information Retrieval in Educational Structured Documents Adapted to Learners Needs
    Iddir, Ounnaci
    Rachid, Ahmed-Ouamer
    [J]. 2014 4TH INTERNATIONAL SYMPOSIUM ISKO-MAGHREB: CONCEPTS AND TOOLS FOR KNOWLEDGE MANAGEMENT (ISKO-MAGHREB), 2014,
  • [9] Abductive retrieval of structured documents
    Muller, AA
    [J]. MATHEMATICAL AND COMPUTER MODELLING, 1997, 26 (01) : 15 - 28
  • [10] Comparing Machine Learning and Information Retrieval-Based Approaches for Filtering Documents in a Parliamentary Setting
    de Campos, Luis M.
    Fernandez-Luna, Juan M.
    Huete, Juan F.
    Redondo-Exposito, Luis
    [J]. SCALABLE UNCERTAINTY MANAGEMENT (SUM 2017), 2017, 10564 : 64 - 77