A methodology for noun phrase-based automatic indexing

被引:0
|
作者
Souza, Renato Rocha
Raghavan, K. S.
机构
[1] Univ Fed Minas Gerais, Sch Informat Sci, Dept Org & Tratamento Informac, BR-31161970 Belo Horizonte, MG, Brazil
[2] Indian Stat Inst, Documentat Res & Training Ctr, Bangalore 560059, Karnataka, India
来源
KNOWLEDGE ORGANIZATION | 2006年 / 33卷 / 01期
关键词
D O I
暂无
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
The scholarly community is increasingly employing the Web both for publication of scholarly output and for locating and accessing relevant scholarly literature. Organization of this vast body of digital information assumes significance in this context. The sheer volume of digital information to be handled makes traditional indexing and knowledge representation strategies ineffective and impractical. It is, therefore, worth exploring new approaches. An approach being discussed considers the intrinsic semantics of texts of documents. Based on the hypothesis that noun phrases in a text are semantically rich in terms of their ability to represent the subject content of the document, this approach seeks to identify and extract noun phrases instead of single keywords, and use them as descriptors. This paper presents a methodology that has been developed for extracting noun phrases from Portuguese texts. The results of an experiment carried out to test the adequacy of the methodology are also presented.
引用
收藏
页码:45 / 56
页数:12
相关论文
共 50 条
  • [1] Efficient phrase-based document indexing for web document clustering
    Hammouda, KM
    Kamel, MS
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2004, 16 (10) : 1279 - 1296
  • [2] Augmenting phrase-based text representation with conceptual indexing for effective retrieval
    Sharma, R
    Raj, PCR
    Raman, S
    [J]. IKE'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE ENGINEERING, VOLS 1 AND 2, 2003, : 27 - 31
  • [3] Statistical phrase-based translation
    Koehn, P
    Och, FJ
    Marcu, D
    [J]. HLT-NAACL 2003: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, 2003, : 127 - 133
  • [4] Hierarchical phrase-based translation
    Chiang, David
    [J]. COMPUTATIONAL LINGUISTICS, 2007, 33 (02) : 201 - 228
  • [5] A Comparative Study on Applying Hierarchical Phrase-based and Phrase-based on Thai-Chinese Translation
    Luekhong, Prasert
    Sukhauta, Rattasit
    Porkaew, Peerachet
    Ruangrajitpakorn, Taneth
    Supnithi, Thepchai
    [J]. 2012 SEVENTH INTERNATIONAL CONFERENCE ON KNOWLEDGE, INFORMATION AND CREATIVITY SUPPORT SYSTEMS (KICSS 2012), 2012, : 126 - 133
  • [6] On the Cost of Phrase-Based Ranking
    Petri, Matthias
    Moffat, Alistair
    [J]. SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 931 - 934
  • [7] Phrase-based Image Captioning
    Lebret, Remi
    Pinheiro, Pedro O.
    Collobert, Ronan
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 2085 - 2094
  • [8] A PHRASE-BASED MATCHING FUNCTION
    GALBIATI, G
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1991, 42 (01): : 36 - 48
  • [9] Integrating Phrase Inseparability in Phrase-Based Model
    Shi, Lixin
    Nie, Jian-Yun
    [J]. PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 708 - 709
  • [10] Noun phase selection in automatic indexing
    do Nascimento, Gustavo Diniz
    Correa, Renato Fernandes
    [J]. ENCONTROS BIBLI-REVISTA ELETRONICA DE BIBLIOTECONOMIA E CIENCIA DA INFORMACAO, 2019, 24 (55):