Reformulation of Telugu Web Query using Word Semantic Relationships

被引：0

作者：

Kolikipogu, Ramakrishna ^{[1
]}

Rani, Padmaja B. ^{[2
]}

Kakulapati, Vijayalakshmi ^{[3
]}

机构：

[1] CMR Coll Engn & Tech, Dept CSE, Hyderabad, Andhra Pradesh, India

[2] JNTU Coll Engn, Dept CSE, Hyderabad, Andhra Pradesh, India

[3] RRS Coll Engn & Tech, Dept CSE, Muthangi, Medak, India

来源：

PROCEEDINGS OF THE 2012 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI'12) | 2012年

关键词：

Information Retrievals; Web Query; Semantic Network; Query Reformulation; Synset; WordNet; Tokenization; Lemmatization; POS Tagging; Boolean Model; Vector Space Model; Indic Scripts;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Use of Internet becomes more popular in India to avail the information needs. A major area of Information browsing includes Education, Medical, Agriculture, Geographical, Business and other social domains. Availability of electronic documents for Indian Language is growing day by day. The people living throughout India speak different languages. The government of India has given "languages of the 8th Schedule" official status for 22 languages. Compare to European languages and other Indian languages, processing of Telugu language electronic documents is more difficult in nature. This is due to multi - encoding formats of the text. Indian languages are encoded using Unicode, ISCII. To fasten the retrieval process the Unicode or ISCII is need to be converted into simple and standard encoding which makes Information Retrieval as easy task. Once the information processing system is build for a mono-lingual, it is the base to go for Multi-lingual and Cross - lingual information processing. In Information Retrieval process users expects exact results for the given query. It depends on the vocabulary expertization of the end user in building the root query. Word mismatch is common problem of all languages in Information Retrieval process. Query Expansion gives a solution to the word mismatch problem. In Query Expansion the top ranked documents are used to expand the query terms. Sometimes user need to judge the relevance of the expanded query to iterate search. The relevance judgment of the user depends on the knowledge (i.e Language knowledge to describe the context of the query) of the user. If the concept hierarchy is properly defined, then user involvement is void in this scenario. This can be easily test on English language, but applying Query Reformulation technique directly on Indian languages is not stands good, because the nature of Indian languages is not simple like English. The Paper is aimed to reduce the mismatch between user query and retrieved documents by using semantic relationships between query terms and document terms. To test the proposed model, Telugu language, one of the Indian languages is taken as a case study. True translation from English to Telugu and vice versa is not possible due to high word conflation in Indian languages. This paper is an attempt to adopt Semantic Network with semantic relationships between terms of a query to reformulate and iterate the search. Method of Relevance Feedback improves recall without compromising precision, but it works well on limited corpus. Reformulation of query by embedding WordNet, ConceptNet relationships gave better results, but great fall of precision is observed. Comparison between initial query test results and reformulated query search results are made in result analysis.

引用

页码：774 / 780

页数：7

共 50 条

[1] Towards distributed information retrieval in the Semantic Web: Query reformulation using the oMAP framework
Straccia, Umberto
Troncy, Raphael
[J]. SEMANTIC WEB: RESEARCH AND APPLICATIONS, PROCEEDINGS, 2006, 4011 : 378 - 392
[2] Web Query Reformulation Using Differential Evolution
Mahanti, Prabhat K.
Al-Fayoumi, Mohammad
Banerjee, Soumya
Al-Obeidat, Feras
[J]. TRENDS IN APPLIED INTELLIGENT SYSTEMS, PT II, PROCEEDINGS, 2010, 6097 : 484 - +
[3] A methodology for query reformulation in CIS using semantic knowledge
Florescu, D
Raschid, L
Valduriez, P
[J]. INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 1996, 5 (04): : 431 - 467
[4] SEMANTIC-BASED COMPOSITION OF MODULAR ONTOLOGIES APPLIED TO WEB QUERY REFORMULATION
Elloumi-Chaabene, Manel
Ben Mustapha, Nesrine
Baazaoui-Zghal, Hajer
Moreno, Antonio
Sanchez, David
[J]. ICSOFT 2011: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON SOFTWARE AND DATABASE TECHNOLOGIES, VOL 1, 2011, : 305 - 308
[5] Query Reformulation Using Ontology and Keyword for Durian Web Search
Azizan, Azilawati
Abu Bakar, Zainab
Noah, Shahrul Azman
[J]. 2016 THIRD INTERNATIONAL CONFERENCE ON INFORMATION RETRIEVAL AND KNOWLEDGE MANAGEMENT (CAMP), 2016, : 94 - 100
[6] Web query reformulation by knowledgeable agents
Sen, S
Saha, S
Dutta, PS
[J]. 2002 45TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL I, CONFERENCE PROCEEDINGS, 2002, : 659 - 662
[7] Generalized Syntactic and Semantic Models of Query Reformulation
Herdagdelen, Amac
Ciaramita, Massimiliano
Mahler, Daniel
Holmqvist, Maria
Hall, Keith
Riezler, Stefan
Alfonseca, Enrique
[J]. SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, 2010, : 283 - 290
[8] Efficient Algorithm for Web Search Query Reformulation Using Genetic Algorithm
Singh, Vikram
Garg, Siddhant
Kaur, Pradeep
[J]. COMPUTATIONAL INTELLIGENCE IN DATA MINING, VOL 1, CIDM 2015, 2016, 410 : 459 - 470
[9] Patterns of Query Reformulation During Web Searching
Jansen, Bernard J.
Booth, Danielle L.
Spink, Amanda
[J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2009, 60 (07): : 1358 - 1371
[10] Influences on Query Reformulation in Collaborative Web Search
Yue, Zhen
Han, Shuguang
He, Daqing
Jiang, Jiepu
[J]. COMPUTER, 2014, 47 (03) : 46 - 53

← 1 2 3 4 5 →