Cross-lingual analysis of English and Chinese web search

被引:0
|
作者
Lin, Peiguang [1 ]
Zhang, Tong [2 ]
Xia, Menglong [3 ]
Zhou, Jin [4 ]
Nie, Peiyao [1 ]
机构
[1] Shandong Univ Finance & Econ, Sch Comp Sci & Technol, Jinan 250001, Shandong, Peoples R China
[2] South China Univ Technol, Sch Elect & Informat Engn, Guangzhou 510000, Guangdong, Peoples R China
[3] Macau Univ Sci & Technol, Fac Hospitality & Tourism Management, Ave Wai Long, Taipa 999078, Macau, Peoples R China
[4] Univ Jinan, Shandong Prov Key Lab Network Based Intelligent C, Jinan 250001, Shandong, Peoples R China
关键词
cross-lingual analysis; web search analysis; search query; POS distribution; search session; session entropy; query reformulation; click graph analysis; query features; web search burstiness; ENGINE; ALGORITHM; BEHAVIOR;
D O I
10.1504/IJWGS.2018.095663
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
There is a growing number of the non-English Web in recent years. So the language-dependent and user-based search paradigms are becoming increasingly important for search engines. Unfortunately, most of the works are available on web search analysis are still English-based. In order to understand the behavioural commonality and distinction of non-English users, we propose a framework for analysing the web search behaviours of users in a cross-lingual context. This framework is composed of 10 factors, which can be applied at the query level, session level and corpus level respectively. The integral employment of these factors could help us with characterising the user behaviour of web search, even in different languages, with regard to both statistical and semantic perspectives. This framework shows a better efficiency not only in revealing the commonality and distinction of web search, but also in informing the design of search paradigms in a cross-lingual scenario.
引用
收藏
页码:376 / 399
页数:24
相关论文
共 50 条
  • [1] CLTC: A Chinese-English Cross-lingual Topic Corpus
    Xia, Yunqing
    Tang, Guoyu
    Jin, Peng
    Yang, Xia
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 532 - 537
  • [2] Cross-Lingual Web Spam Classification
    Garzo, Andras
    Daroczy, Balint
    Kiss, Tamas
    Siklosi, David
    Benczur, Andras A.
    PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'13 COMPANION), 2013, : 1149 - 1156
  • [3] Cross-Lingual Sentiment Relation Capturing for Cross-Lingual Sentiment Analysis
    Chen, Qiang
    Li, Wenjie
    Lei, Yu
    Liu, Xule
    Luo, Chuwei
    He, Yanxiang
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2017, 2017, 10193 : 54 - 67
  • [4] English and Malay Cross-lingual Sentiment Lexicon Acquisition and Analysis
    Nasharuddin, Nurul Amelina
    Abdullah, Muhamad Taufik
    Azman, Azreen
    Kadir, Rabiah Abdul
    INFORMATION SCIENCE AND APPLICATIONS 2017, ICISA 2017, 2017, 424 : 467 - 475
  • [5] The application of the comparable corpora in Chinese-English Cross-Lingual Information Retrieval
    Du, L
    Zhang, YB
    Sun, L
    Sun, YF
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2001, 16 (04) : 351 - 358
  • [6] The application of the comparable corpora in Chinese-English Cross-Lingual Information Retrieval
    Lin Du
    Yibo Zhang
    Le Sun
    Yufang Sun
    Journal of Computer Science and Technology, 2001, 16 : 351 - 358
  • [7] The Application of the Comparable Corpora in Chinese-English Cross-Lingual Information Retrieval
    杜林
    张毅波
    孙乐
    孙玉芳
    Journal of Computer Science and Technology, 2001, (04) : 351 - 358
  • [8] A Cross-Lingual Dictionary for English Wikipedia Concepts
    Spitkovsky, Valentin I.
    Chang, Angel X.
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 3168 - 3175
  • [9] English-Welsh Cross-Lingual Embeddings
    Espinosa-Anke, Luis
    Palmer, Geraint
    Corcoran, Padraig
    Filimonov, Maxim
    Spasic, Irena
    Knight, Dawn
    APPLIED SCIENCES-BASEL, 2021, 11 (14):
  • [10] Cross-Lingual Entity Linking for Web Tables
    Luo, Xusheng
    Luo, Kangqi
    Chen, Xianyang
    Zhu, Kenny Q.
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 362 - 369