Distributed Computation of Pure High-Order Word Dependence

被引:0
|
作者
Ruan, Xingmao [1 ]
Sun, Yueheng [1 ]
Wang, Hailin [1 ]
Hou, Yuexian [1 ]
Zhao, Xiaozhao [1 ]
Zhang, Peng [1 ]
机构
[1] Tianjin Univ, Sch Comp Sci & Technol, Tianjin 300072, Peoples R China
关键词
Pure high order; Word associations; Information geometry; Distributed computation;
D O I
10.1007/978-3-642-54924-3_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many data mining methods have been proposed to obtain useful word associations in text documents. However, it is of great challenge to efficiently discover "pure'' high-order (n > 3) word association patterns, especially in the rapidly expanding data collection. Here, by "pure,'' we mean that those words form an inseparable semantic entity, i.e., the high-order dependence that cannot be reduced to the random coincidence of low-order dependence. Our aim is to find pure high-order word associations in large data sets. This paper proposes a distributed pure dependence mining (DPDM) algorithm based on information geometry, which can efficiently mine the pure dependence between words. We also construct a distributed pure dependence mining framework (DPDMF) to mine pure high-order word associations from data sets. Extensive experiments show that DPDM algorithm can significantly improve the efficiency when mining the pure high-order patterns from large data sets. Finally, we apply the extracted high-order word association patterns in the text classification tasks, which will also achieve significant improvement.
引用
收藏
页码:211 / 220
页数:10
相关论文
共 50 条
  • [1] Pure High-Order Word Dependence Mining via Information Geometry
    Hou, Yuexian
    He, Liang
    Zhao, Xiaozhao
    Song, Dawei
    ADVANCES IN INFORMATION RETRIEVAL THEORY, 2011, 6931 : 64 - +
  • [2] Mining Pure High-Order Word Associations via Information Geometry for Information Retrieval
    Hou, Yuexian
    Zhao, Xiaozhao
    Song, Dawei
    Li, Wenjie
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2013, 31 (03)
  • [3] Distributed Optimization of High-Order Nonlinear Systems: Saving Computation and Communication via Prefiltering
    Kang, Jian
    Guo, Ge
    Yang, Guanghong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (03) : 1144 - 1148
  • [4] HIGH-ORDER FIELD DERIVATIVES FOR COMPUTATION OF ABERRATIONS
    Radlicka, T.
    RECENT TRENDS IN CHARGED PARTICLE OPTICS AND SURFACE PHYSICS INSTRUMENTATION, PROCEEDINGS, 2008, : 95 - 96
  • [5] Scalable Computation of High-Order Optimization Queries
    Brucato, Matteo
    Abouzied, Azza
    Meliou, Alexandra
    COMMUNICATIONS OF THE ACM, 2019, 62 (02) : 108 - 116
  • [6] Neuronal computation using high-order statistics
    Feng, Jianfeng
    NETWORKS: FROM BIOLOGY TO THEORY, 2007, : 65 - +
  • [7] High-order squeezing of pure superposition states
    MA Aiqun(Harbin College
    Chinese Journal of Lasers, 1994, (02) : 105 - 115
  • [8] A DISTRIBUTED SIMULATOR FOR HIGH-ORDER PETRI NETS
    BUTLER, B
    ESSER, R
    MATTMANN, R
    LECTURE NOTES IN COMPUTER SCIENCE, 1991, 483 : 47 - 63
  • [9] Scalable Distributed High-Order Stencil Computations
    Jacquelin, Mathias
    Araya-Polo, Mauricio
    Meng, Jie
    SC22: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2022,
  • [10] COMPUTATION OF HIGH-ORDER MAPS TO MULTIPLE MACHINE PRECISION
    Wittig, Alexander
    Berz, Martin
    INTERNATIONAL JOURNAL OF MODERN PHYSICS A, 2009, 24 (05): : 1019 - 1039