Reformulation of Telugu Web Query using Word Semantic Relationships

被引:0
|
作者
Kolikipogu, Ramakrishna [1 ]
Rani, Padmaja B. [2 ]
Kakulapati, Vijayalakshmi [3 ]
机构
[1] CMR Coll Engn & Tech, Dept CSE, Hyderabad, Andhra Pradesh, India
[2] JNTU Coll Engn, Dept CSE, Hyderabad, Andhra Pradesh, India
[3] RRS Coll Engn & Tech, Dept CSE, Muthangi, Medak, India
关键词
Information Retrievals; Web Query; Semantic Network; Query Reformulation; Synset; WordNet; Tokenization; Lemmatization; POS Tagging; Boolean Model; Vector Space Model; Indic Scripts;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Use of Internet becomes more popular in India to avail the information needs. A major area of Information browsing includes Education, Medical, Agriculture, Geographical, Business and other social domains. Availability of electronic documents for Indian Language is growing day by day. The people living throughout India speak different languages. The government of India has given "languages of the 8th Schedule" official status for 22 languages. Compare to European languages and other Indian languages, processing of Telugu language electronic documents is more difficult in nature. This is due to multi - encoding formats of the text. Indian languages are encoded using Unicode, ISCII. To fasten the retrieval process the Unicode or ISCII is need to be converted into simple and standard encoding which makes Information Retrieval as easy task. Once the information processing system is build for a mono-lingual, it is the base to go for Multi-lingual and Cross - lingual information processing. In Information Retrieval process users expects exact results for the given query. It depends on the vocabulary expertization of the end user in building the root query. Word mismatch is common problem of all languages in Information Retrieval process. Query Expansion gives a solution to the word mismatch problem. In Query Expansion the top ranked documents are used to expand the query terms. Sometimes user need to judge the relevance of the expanded query to iterate search. The relevance judgment of the user depends on the knowledge (i.e Language knowledge to describe the context of the query) of the user. If the concept hierarchy is properly defined, then user involvement is void in this scenario. This can be easily test on English language, but applying Query Reformulation technique directly on Indian languages is not stands good, because the nature of Indian languages is not simple like English. The Paper is aimed to reduce the mismatch between user query and retrieved documents by using semantic relationships between query terms and document terms. To test the proposed model, Telugu language, one of the Indian languages is taken as a case study. True translation from English to Telugu and vice versa is not possible due to high word conflation in Indian languages. This paper is an attempt to adopt Semantic Network with semantic relationships between terms of a query to reformulate and iterate the search. Method of Relevance Feedback improves recall without compromising precision, but it works well on limited corpus. Reformulation of query by embedding WordNet, ConceptNet relationships gave better results, but great fall of precision is observed. Comparison between initial query test results and reformulated query search results are made in result analysis.
引用
收藏
页码:774 / 780
页数:7
相关论文
共 50 条
  • [1] Towards distributed information retrieval in the Semantic Web: Query reformulation using the oMAP framework
    Straccia, Umberto
    Troncy, Raphael
    [J]. SEMANTIC WEB: RESEARCH AND APPLICATIONS, PROCEEDINGS, 2006, 4011 : 378 - 392
  • [2] Web Query Reformulation Using Differential Evolution
    Mahanti, Prabhat K.
    Al-Fayoumi, Mohammad
    Banerjee, Soumya
    Al-Obeidat, Feras
    [J]. TRENDS IN APPLIED INTELLIGENT SYSTEMS, PT II, PROCEEDINGS, 2010, 6097 : 484 - +
  • [3] A methodology for query reformulation in CIS using semantic knowledge
    Florescu, D
    Raschid, L
    Valduriez, P
    [J]. INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 1996, 5 (04): : 431 - 467
  • [4] SEMANTIC-BASED COMPOSITION OF MODULAR ONTOLOGIES APPLIED TO WEB QUERY REFORMULATION
    Elloumi-Chaabene, Manel
    Ben Mustapha, Nesrine
    Baazaoui-Zghal, Hajer
    Moreno, Antonio
    Sanchez, David
    [J]. ICSOFT 2011: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON SOFTWARE AND DATABASE TECHNOLOGIES, VOL 1, 2011, : 305 - 308
  • [5] Query Reformulation Using Ontology and Keyword for Durian Web Search
    Azizan, Azilawati
    Abu Bakar, Zainab
    Noah, Shahrul Azman
    [J]. 2016 THIRD INTERNATIONAL CONFERENCE ON INFORMATION RETRIEVAL AND KNOWLEDGE MANAGEMENT (CAMP), 2016, : 94 - 100
  • [6] Web query reformulation by knowledgeable agents
    Sen, S
    Saha, S
    Dutta, PS
    [J]. 2002 45TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL I, CONFERENCE PROCEEDINGS, 2002, : 659 - 662
  • [7] Generalized Syntactic and Semantic Models of Query Reformulation
    Herdagdelen, Amac
    Ciaramita, Massimiliano
    Mahler, Daniel
    Holmqvist, Maria
    Hall, Keith
    Riezler, Stefan
    Alfonseca, Enrique
    [J]. SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, 2010, : 283 - 290
  • [8] Efficient Algorithm for Web Search Query Reformulation Using Genetic Algorithm
    Singh, Vikram
    Garg, Siddhant
    Kaur, Pradeep
    [J]. COMPUTATIONAL INTELLIGENCE IN DATA MINING, VOL 1, CIDM 2015, 2016, 410 : 459 - 470
  • [9] Patterns of Query Reformulation During Web Searching
    Jansen, Bernard J.
    Booth, Danielle L.
    Spink, Amanda
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2009, 60 (07): : 1358 - 1371
  • [10] Influences on Query Reformulation in Collaborative Web Search
    Yue, Zhen
    Han, Shuguang
    He, Daqing
    Jiang, Jiepu
    [J]. COMPUTER, 2014, 47 (03) : 46 - 53