Mining Similar Traces of Entities on Web

被引:2
|
作者
Huang, Xinyan [1 ,3 ]
Wang, Xinjun [1 ,2 ]
Li, Hui [1 ,2 ]
机构
[1] Shandong Univ, Num 1500,SunHua Rd High Tech Ind Dev Zone, Jinan 250100, Peoples R China
[2] Dareway Software Co Ltd, Jinan, Peoples R China
[3] Shandong Univ Finance & Econ, Jinan, Peoples R China
基金
中国国家自然科学基金;
关键词
Significant event; similar trace; candidate topic;
D O I
10.1515/cait-2015-0081
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Events about entities have been widely collected on Web, allowing us to analyze how peer entities interact and learn the relationships that exist among the entities. In this paper we investigate similar traces that have not been adequately studied so far. Intuitively, peer entities tend to have similar traces. The challenges in mining similar traces are: (1) the occurring time lags of traces are usually unknown and varying; (2) the existence of large-scale events of entities and complexity of the model representing all the events. In this paper we propose a simple, but practical method that addresses all these challenges. Firstly, sliding windows are adopted to filter out the significant events and then find the candidate topic sequences. Secondly, dynamic programming is employed to mine similar candidate topic sequences of entities. Finally, an efficient method is proposed to mine all the similar traces of entities. It is able to mine similar traces of peer entities with high accuracy. We conduct comprehensive experiments on synthetic datasets to demonstrate the efficiency of the method proposed.
引用
收藏
页码:219 / 229
页数:11
相关论文
共 50 条
  • [1] Mining Similar Traces of Entities on Web (vol 15, pg 219, 2015)
    Huang, Xinyan
    Wang, Xinjun
    Li, Hui
    [J]. CYBERNETICS AND INFORMATION TECHNOLOGIES, 2016, 16 (01) : 188 - 191
  • [2] Mining Periodic Traces of an Entity on Web
    Huang, X.
    Wang, X.
    Zhang, Y.
    Zhao, J.
    [J]. INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2015, 10 (05) : 654 - 666
  • [3] Mining and Modeling Web Trajectories from Passive Traces
    Vassio, Luca
    Mellia, Marco
    Figueiredo, Flavio
    Couto da Silva, Ana Paula
    Almeida, Jussara M.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 4016 - 4021
  • [4] Do Similar Entities Have Similar Embeddings?
    Hubert, Nicolas
    Paulheim, Heiko
    Brun, Armelle
    Monticolo, Davy
    [J]. SEMANTIC WEB, PT I, ESWC 2024, 2024, 14664 : 3 - 21
  • [5] Mining web sites using wrapper induction, named entities, and post-processing
    Sigletos, G
    Paliouras, G
    Spyropoulos, CD
    Hatzopoulos, M
    [J]. WEB MINING: FROM WEB TO SEMANTIC WEB, 2004, 3209 : 97 - 112
  • [6] Events and objects are similar cognitive entities
    Papafragou, Anna
    Ji, Yue
    [J]. COGNITIVE PSYCHOLOGY, 2023, 143
  • [7] Mining temporal explicit and implicit semantic relations between entities using web search engines
    Xu, Zheng
    Luo, Xiangfeng
    Zhang, Shunxiang
    Wei, Xiao
    Mei, Lin
    Hu, Chuanping
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2014, 37 : 468 - 477
  • [8] Leaving traces in the web
    Schüller, P
    [J]. TRACE ELEMENTS AND ELECTROLYTES, 2000, 17 (04): : 190 - 192
  • [9] Web + Data Mining = Web Mining
    Kilian Stoffel
    [J]. HMD Praxis der Wirtschaftsinformatik, 2009, 46 (4) : 6 - 20
  • [10] Identity of resources and entities on the web
    Presutti, Valentina
    Gangemi, Aldo
    [J]. INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2008, 4 (02) : 49 - 72