Named Entities as a Metadata Resource for Indexing and Searching Information

被引:1
|
作者
Izo, Flavio [1 ,2 ]
Oliveira, Elias [1 ]
Badue, Claudine [1 ]
机构
[1] Univ Fed Espirito Santo, Programa Posgrad Informat, Vitoria, ES, Brazil
[2] Inst Fed Espirito Santo, Cachoeiro De Itapemirim, Brazil
关键词
Artificial intelligence; Indexing; Named Entity Recognition; Natural Language Processing; Search engine;
D O I
10.1007/978-3-030-96308-8_78
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Named Entity Recognition (NER) is an essential task in Natural Language Processing (NLP). By using NER, it is possible to create associations in a text to recognize real-world entities. The data indexing process is also considered a vital resource, as it makes it easier to find texts in a set of documents. When we analyze a search engine, we aim at the ease of the user's search process. Indexing recognized entities could help the search engine find data with a high semantic index, therefore, more accurate. This study aims to investigate the automatic transformation of annotated entities as indexes for a search engine. The recognition of entities used the hybrid model CRF+LG. Search engines usually work with keyword localization (tokens). However, this research aimed to use a semantic search, as it improves the quality of the results by understanding the user's intention using enricher meta factors besides the keyword. We performed ten experiments using P@{5, 10, and 20} and the search engine with a high semantic index achieved accuracy of 100%, correctly returning all results. The search engine without NER was confused when producing results for person and organization categories, mainly.
引用
收藏
页码:838 / 848
页数:11
相关论文
共 50 条
  • [1] From subtitles to substantial metadata: examining characteristics of named entities and their role in indexing
    Husevag, Anne-Stine Ruud
    [J]. INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2019, 20 (03) : 241 - 251
  • [2] From subtitles to substantial metadata: examining characteristics of named entities and their role in indexing
    Anne-Stine Ruud Husevåg
    [J]. International Journal on Digital Libraries, 2019, 20 : 241 - 251
  • [3] Indexing concepts and/or named entities
    Buizza, Pino
    [J]. JLIS.IT, 2011, 2 (02):
  • [4] Exploring the Role of Named Entities in Automatic Indexing
    Husevag, Anne-Stine Ruud
    [J]. CHIIR'17: PROCEEDINGS OF THE 2017 CONFERENCE HUMAN INFORMATION INTERACTION AND RETRIEVAL, 2017, : 393 - 394
  • [5] Large-scale controlled vocabulary indexing for named entities
    Wasson, M
    [J]. 6TH APPLIED NATURAL LANGUAGE PROCESSING CONFERENCE/1ST MEETING OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE AND PROCEEDINGS OF THE ANLP-NAACL 2000 STUDENT RESEARCH WORKSHOP, 2000, : 276 - 281
  • [6] Suggesting named entities for information access
    Amigó, E
    Peñas, A
    Gonzalo, J
    Verdejo, F
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PROCEEDINGS, 2003, 2588 : 557 - 561
  • [7] Identifying Medical Named Entities with Word Information
    Ben, Yanyan
    Pang, Xueqin
    [J]. Data Analysis and Knowledge Discovery, 2023, 7 (05) : 123 - 132
  • [8] UNED at ImageCLEF 2005: Automatically structured queries with named entities over metadata
    Peinado, Victor
    Lopez-Ostenero, Fernando
    Gonzalo, Julio
    Verdejo, Felisa
    [J]. ACCESSING MULTILINGUAL INFORMATION REPOSITORIES, 2006, 4022 : 578 - 581
  • [9] Measuring semantic similarity between named entities by searching the web directory
    Liu, Iiahui
    BimbauM, Larry
    [J]. PROCEEDINGS OF THE IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE: WI 2007, 2007, : 461 - +
  • [10] Named Entities as Privileged Information for Hierarchical Text Clustering
    Sinoara, Roberta A.
    Sundermann, Camila V.
    Marcacini, Ricardo M.
    Domingues, Marcos A.
    Rezende, Solange O.
    [J]. PROCEEDINGS OF THE 18TH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM (IDEAS14), 2014, : 57 - 66