Performance issues in distributed shared-nothing information-retrieval systems

被引:5
|
作者
Tomasic, A [1 ]
GarciaMolina, H [1 ]
机构
[1] STANFORD UNIV,DEPT COMP SCI,STANFORD,CA 94305
关键词
D O I
10.1016/S0306-4573(96)00019-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many information-retrieval systems provides access to abstracts. For example, Stanford University, through its FOLIO system, provides access to the INSPEC database of abstracts of the literature on physics, computer science, electrical engineering, etc. In this article, this database is studied by using a trace-driven simulation. It focuses on a physical-index design that accommodates truncations, inverted-index caching, and database scaling in a distributed shared-nothing system. All three issues are shown to have a strong effect on response time and throughput. Database scaling is explored in two ways. One way assumes an ''optimal'' configuration for a single host and then linearly scales the database by duplicating the host architecture as needed. The second way determines the optimal number of hosts given a fixed database size. Copyright (C) 1996 Elsevier Science Ltd
引用
收藏
页码:647 / 665
页数:19
相关论文
共 50 条
  • [31] Approaches to balancing data load of shared-nothing clusters and their performance comparison
    Wang, JH
    Tsutaya, Y
    Segawa, N
    Yamane, S
    Murayama, Y
    Miyazaki, M
    Suzuki, H
    NINTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, PROCEEDINGS, 2002, : 293 - 299
  • [32] Graph Partitioning Strategies for Efficient BFS in Shared-Nothing Parallel Systems
    Muntes-Mulero, Victor
    Martinez-Bazan, Norbert
    Larriba-Pey, Josep-Lluis
    Pacitti, Esther
    Valduriez, Patrick
    WEB-AGE INFORMATION MANAGEMENT, 2010, 6185 : 13 - +
  • [33] Osprey: Implementing MapReduce-Style Fault Tolerance in a Shared-Nothing Distributed Database
    Yang, Christopher
    Yen, Christine
    Tan, Ceryen
    Madden, Samuel R.
    26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING ICDE 2010, 2010, : 657 - 668
  • [34] PERFORMANCE-MEASURES FOR INFORMATION-RETRIEVAL SYSTEMS - AN EXPERIMENTAL APPROACH
    REGAZZI, JJ
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1988, 39 (04): : 235 - 251
  • [35] AI-SYSTEMS FOR CHEMICAL INFORMATION-RETRIEVAL - PROMISE AND PERFORMANCE
    LESK, M
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1991, 201 : 1 - CINF
  • [36] ELEMENTS OF A DISTRIBUTED TRANSPARENT INFORMATION-RETRIEVAL SYSTEM
    WILLIAMS, ME
    PREECE, SE
    PROCEEDINGS OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1980, 17 : 401 - 402
  • [37] A PERFORMANCE EVALUATION OF 4 PARALLEL JOIN ALGORITHMS IN A SHARED-NOTHING MULTIPROCESSOR ENVIRONMENT
    SCHNEIDER, DA
    DEWITT, DJ
    PROCEEDINGS OF THE 1989 ACM SIGMOD INTERNATIONAL CONFERENCE ON THE MANAGEMENT OF DATA, 1989, 18 : 110 - 121
  • [38] Decoupling load-balancing and optimization issues: A two-phase query processing framework for shared-nothing systems
    Tan, KL
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 1997, 12 (01): : 25 - 36
  • [39] Study of loading strategy in shared-nothing event stream parallel database systems
    Liu, Ying
    Wang, Qirong
    Sun, Ninghui
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2009, 46 (01): : 159 - 166
  • [40] NEW DIMENSIONS IN INFORMATION-RETRIEVAL SYSTEMS
    BEENHAKKER, HL
    OPERATIONS RESEARCH, 1964, 12 : B72 - &