An End-to-End Efficient Lucene-Based Framework of Document/Information Retrieval

被引:0
|
作者
Ben Ayed, Alaidine [1 ]
Biskri, Ismail [2 ]
Meunier, Jean-Guy [3 ]
机构
[1] Univ Quebec Montreal, Cognit Comp Sci, Montreal, PQ, Canada
[2] Univ Quebec Trois Rivieres, Comp Sci Dept, Computat Linguist & Artificial Intelligence, Trois Rivieres, PQ, Canada
[3] Univ Quebec Montreal, Montreal, PQ, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Data and Knowledge Representation; Document Retrieval; Internet and Web Applications; Mono/Multi-Document Summarization; RELEVANCE;
D O I
10.4018/IJIRR.289950
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the context of big data and the Industrial Revolution 4.0 era, enhancing document/information retrieval framework efficiency to handle the ever-growing volume of text data in an ever more digital world is a must. This article describes a double-stage system of document/information retrieval. First, a Lucene-based document retrieval tool is implemented, and a couple of query expansion techniques using a comparable corpus (Wikipedia) and word embeddings are proposed and tested. Second, a retention-fidelity summarization protocol is performed on top of the retrieved documents to create a short, accurate, and fluent extract of a longer retrieved single document (or a set of top retrieved documents). Obtained results show that using word embeddings is an excellent way to achieve higher precision rates and retrieve more accurate documents. Also, obtained summaries satisfy the retention and fidelity criteria of relevant summaries.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Multimodal End-to-End Visual Document Parsing
    Lu, Yujiang
    Qiu, Weifeng
    Hong, Yinghua
    Wang, Jiayi
    HEALTH INFORMATION PROCESSING. EVALUATION TRACK PAPERS, 2023, 1773 : 154 - 163
  • [22] Attention-based end-to-end CNN framework for content-based X-ray image retrieval
    Ozturk, Saban
    Alhudhaif, Adi
    Polat, Kemal
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2021, 29 : 2680 - 2693
  • [23] End-to-End Algorithm for Absolute Phase Retrieval
    Lu, Jin
    Li, Yuan
    Xu, Jian
    Wang, Fu P.
    Sun, Xiao G.
    TWELFTH INTERNATIONAL CONFERENCE ON INFORMATION OPTICS AND PHOTONICS (CIOP 2021), 2021, 12057
  • [24] End-to-end Learning for Encrypted Image Retrieval
    Feng, Qihua
    Li, Peiya
    Lu, ZhiXun
    Liu, Guan
    Huang, Feiran
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1839 - 1845
  • [25] An end-to-end joint model for evidence information extraction from court record document
    Ji, Donghong
    Tao, Peng
    Fei, Hao
    Ren, Yafeng
    INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (06)
  • [26] An End-to-End Compression Framework Based on Convolutional Neural Networks
    Jiang, Feng
    Tao, Wen
    Liu, Shaohui
    Ren, Jie
    Guo, Xun
    Zhao, Debin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (10) : 3007 - 3018
  • [27] An End-to-End Lane Detection Framework Based on Geometry Transform
    Kou, Genghua
    Wang, Weida
    Yang, Chao
    Xiang, Changle
    Li, Ying
    PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 2456 - 2466
  • [28] An End-to-End Compression Framework Based on Convolutional Neural Networks
    Tao, Wen
    Jiang, Feng
    Zhang, Shengping
    Ren, Jie
    Shi, Wuzhen
    Zuo, Wangmeng
    Guo, Xun
    Zhao, Debin
    2017 DATA COMPRESSION CONFERENCE (DCC), 2017, : 463 - 463
  • [29] End-to-End Computer Vision Framework
    Orhei, Ciprian
    Mocofan, Muguras
    Vert, Silviu
    Vasiu, Radu
    2020 14TH INTERNATIONAL SYMPOSIUM ON ELECTRONICS AND TELECOMMUNICATIONS (ISETC), 2020, : 63 - 66
  • [30] Retargeting Video With an End-to-End Framework
    Le, Thi-Ngoc-Hanh
    Huang, HuiGuang
    Chen, Yi-Ru
    Lee, Tong-Yee
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (09) : 6164 - 6176