Spatial-text tree clustering algorithm based on improved NSGA-Ⅲ

被引:0
|
作者
Ma W. [1 ]
Wang R. [1 ]
Wu Y. [1 ]
Deng S. [1 ]
机构
[1] Science and Technology on Information System Engineering Laboratory, National University of Defense Technology, Changsha
关键词
Genetic algorithm; Inverted spatial-text index; Multi-objective optimization; Non-dominating sorting; Priori initial population strategy;
D O I
10.13245/j.hust.200516
中图分类号
学科分类号
摘要
A multi-objective model for inverted-file R-tree (IR-tree) was constructed, and by improving the solving process of the non-dominating sorting genetic algorithm-Ⅲ (NSGA-Ⅲ), a priori initial population strategy non-dominating sorting genetic algorithm-Ⅲ (PIPS-NSGA-Ⅲ) was proposed to optimize the clustering splitting of text space nodes in IR-Tree. The optimal solution of the minimum overlap and coverage area, the average distance and the semantic similarity between the minimum bounding rectangles (MBRs) of nodes were considered by PIPS-NSGA-Ⅲ. By comparing the evolutionary multi-objective algorithms such as PIPS-NSGA-Ⅲ, NSGA-Ⅱ, NSGA-Ⅲ, SPEA-Ⅱ, the advantages and disadvantages of the algorithms were evaluated from the node classification time, efficiency, query time and accuracy. Experimental results show that the spatial-text index based on PIPS-NSGA-Ⅲ is more efficient, and the average searching time is reduced by 24.8% with the average accuracy improved by 3.75%, compared with the traditional simple traditional R-tree (STR-tree) and R-Tree spatial index structure. © 2020, Editorial Board of Journal of Huazhong University of Science and Technology. All right reserved.
引用
下载
收藏
页码:86 / 92
页数:6
相关论文
共 13 条
  • [1] 3
  • [2] LI Z, LEE K C, ZHENG B, Et al., IR-Tree: an efficient index for geographic document search, IEEE Transactions on Knowledge and Data Engineering, 23, 4, pp. 585-599, (2011)
  • [3] LEUTENEGGER S T, LOPEZ M A, EDGINGTON J., STR: a simple and efficient algorithm for R-tree packing, Proc of 13th International Conference on Data Engineering, pp. 497-506, (1997)
  • [4] (2014)
  • [5] GIAO B C, ANH D T., Improving sort-tile-recusive algorithm for R-tree packing in indexing time series, Proc of International Conference on the Computing and Communication Technologies-Research, Innovation, and Vision for the Future (RIVF), pp. 117-122, (2015)
  • [6] YANG Y, BAI P, GE N, Et al., LAZY R-tree: the R-tree with lazy splitting algorithm, Journal of Information Science, 46, 2, pp. 65-73, (2020)
  • [7] SEEGER B, KRIEGEL H P., The buddy effect: an efficient and robust access method for spatial data base systems, Proc of Very Large Database, pp. 590-601, (1990)
  • [8] CHA G H., An efficient indexing method for load networks, Transactions on Machine Learning and Artificial Intelligence, 6, 6, pp. 69-78, (2019)
  • [9] BERCHTOLD S, KEIM D, KRIEGEL H., An index structure for high-dimensional data, Readings in Multimedia Computing and Networking, 18, 8, pp. 451-485, (2001)
  • [10] BECKMANN N, KRIEGEL H-P, SCHNEIDER R, Et al., The R*-tree: an efficient and robust access method for points and rectangles, Proc of ACM Sigmod Record, pp. 322-331, (1990)