Efficient Key Hash Indexing Scheme with Page Rank for Category Based Search Engine Big Data

被引:0
|
作者
Ragavan, N. [1 ]
机构
[1] Srimax Software Technol, Sivakasi, India
关键词
Big Data; Search Engine; Key Hash Indexing; Search Term; Page Ranking;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Big Data, true to its name, it deals with large volumes of data characterized by volume, variety and velocity. Big data has made the development of highly capable online search engines nowadays. Search Engine systems are differing by the way of how Indexing and Page Rankings are performed. Without Indexing, Search Engine would require considerable time and computing power. For example, while an index of 10,000 documents can be queried within milliseconds, a sequential scan of every word in 10,000 large documents could take hours. The common indexing methods are not suitable for Search Engine Big Data as it greatly increase the size of the data as well as reduce the scalability. This paper proposes a new method of Indexing Search Engine Big Data called Key Hash Indexing scheme followed by the implementation of Page Rank. A Comprehensive presentation of important technology and factors to achieve efficient Big Data storage, Indexing and Ranking in Web Search Engine are also considered. This system also shows the efficiency of the method with an extensive set of experiments on real data. Experimental results on real-time data sets show that the proposed solution is effective as well as efficient in index generation and ranking.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] IntelliSearch: A Search Engine based on Big Data Analytics integrated with Crowdsourcing and category-based search
    Lakhani, Ajeet
    Gupta, Ashish
    Chandrasekaran, K.
    [J]. 2015 INTERNATIONAL CONFERENCED ON CIRCUITS, POWER AND COMPUTING TECHNOLOGIES (ICCPCT-2015), 2015,
  • [2] Personal Search Engine Based on User Interests and Modified Page Rank
    Harb, Hany M.
    Khalifa, Ahmed R.
    Ishkewy, Hossam M.
    [J]. 2009 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND SYSTEMS (ICCES 2009), 2009, : 411 - 417
  • [3] A Linear Hash Based Indexing Scheme for Location Dependent Data Broadcast
    Sriprajna, K. J.
    Thilagam, Santhi P.
    [J]. 2009 INTERNATIONAL CONFERENCE ON ADVANCES IN RECENT TECHNOLOGIES IN COMMUNICATION AND COMPUTING (ARTCOM 2009), 2009, : 219 - +
  • [4] Big Data retrieval techniques based on Hash Indexing and MapReduce approach with NoSQL Database
    Gayathiri, N. R.
    Jaspher, David D.
    Natarajan, A. M.
    [J]. PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING & COMMUNICATION ENGINEERING (ICACCE-2019), 2019,
  • [5] Design and Implementation of Search Engine Based on Big Data
    Zhang Zhifeng
    Han Susu
    [J]. AGRO FOOD INDUSTRY HI-TECH, 2017, 28 (01): : 1355 - 1359
  • [6] SSEIM: An Efficient Search Scheme over Encrypted Data with Indexing on Mobile Cloud
    Jeevitha, B. K.
    Sindhija, S.
    Thriveni, J.
    Venugopal, K. R.
    [J]. 2019 FIFTEENTH INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING (ICINPRO): INTERNET OF THINGS, 2019, : 112 - 116
  • [7] Toward a Trend-Based Web Page Rank by Using Big Data on Smartphones
    Choi, Dae-Young
    [J]. ADVANCED SCIENCE LETTERS, 2014, 20 (10-12) : 2134 - 2137
  • [8] A Novel Hash-Based Streaming Scheme for Energy Efficient Full-Text Search in Wireless Data Broadcast
    Yang, Kai
    Shi, Yan
    Wu, Weili
    Gao, Xiaofeng
    Zhong, Jiaofei
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT I, 2011, 6587 : 372 - +
  • [9] Dynamic Partition Forest: An Efficient and Distributed Indexing Scheme for Similarity Search based on Hashing
    Lu, Yangdi
    Bo, Yang
    He, Wenbo
    Nabatchian, Amir
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 1059 - 1064
  • [10] Towards the design of efficient hash-based indexing scheme for growing databases on non-volatile memory
    Ma, Zhulin
    Sha, Edwin H-M
    Zhuge, Qingfeng
    Jiang, Weiwen
    Zhang, Runyu
    Gu, Shouzhen
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 105 : 1 - 12