Cross-lingual analysis of English and Chinese web search

被引:0
|
作者
Lin, Peiguang [1 ]
Zhang, Tong [2 ]
Xia, Menglong [3 ]
Zhou, Jin [4 ]
Nie, Peiyao [1 ]
机构
[1] Shandong Univ Finance & Econ, Sch Comp Sci & Technol, Jinan 250001, Shandong, Peoples R China
[2] South China Univ Technol, Sch Elect & Informat Engn, Guangzhou 510000, Guangdong, Peoples R China
[3] Macau Univ Sci & Technol, Fac Hospitality & Tourism Management, Ave Wai Long, Taipa 999078, Macau, Peoples R China
[4] Univ Jinan, Shandong Prov Key Lab Network Based Intelligent C, Jinan 250001, Shandong, Peoples R China
关键词
cross-lingual analysis; web search analysis; search query; POS distribution; search session; session entropy; query reformulation; click graph analysis; query features; web search burstiness; ENGINE; ALGORITHM; BEHAVIOR;
D O I
10.1504/IJWGS.2018.095663
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
There is a growing number of the non-English Web in recent years. So the language-dependent and user-based search paradigms are becoming increasingly important for search engines. Unfortunately, most of the works are available on web search analysis are still English-based. In order to understand the behavioural commonality and distinction of non-English users, we propose a framework for analysing the web search behaviours of users in a cross-lingual context. This framework is composed of 10 factors, which can be applied at the query level, session level and corpus level respectively. The integral employment of these factors could help us with characterising the user behaviour of web search, even in different languages, with regard to both statistical and semantic perspectives. This framework shows a better efficiency not only in revealing the commonality and distinction of web search, but also in informing the design of search paradigms in a cross-lingual scenario.
引用
收藏
页码:376 / 399
页数:24
相关论文
共 50 条
  • [31] Cross-lingual Romanian to English question answering at CLEF 2006
    Puscasu, Georgiana
    Iftene, Adrian
    Pistol, Ionut
    Trandabat, Diana
    Tufis, Dan
    Ceausu, Alin
    Stefanescu, Dan
    Ion, Radu
    Dornescu, Lustin
    Moruz, Alex
    Cristea, Dan
    EVALUATION OF MULTILINGUAL AND MULTI-MODAL INFORMATION RETRIEVAL, 2007, 4730 : 385 - +
  • [32] Manipuri-English comparable corpus for cross-lingual studies
    Laitonjam, Lenin
    Singh, Sanasam Ranbir
    LANGUAGE RESOURCES AND EVALUATION, 2023, 57 (01) : 377 - 413
  • [33] Cross-lingual Named Entity List Search via Transliteration
    Khakhmovich, Aleksandr
    Pavlova, Svetlana
    Kirillova, Kira
    Arefyev, Nikolay
    Savilova, Ekaterina
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 4247 - 4255
  • [34] English-to-Korean Cross-Lingual Link Detection for Wikipedia
    Marigomen, Ralph
    Kang, In-Su
    U- AND E-SERVICE, SCIENCE AND TECHNOLOGY, 2011, 264 : 274 - 280
  • [35] English-Malayalam Cross-Lingual Information Retrieval - an experience
    Nikesh, P. L.
    Sumam, Mary Idicula
    David, Peter S.
    2008 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY, 2008, : 271 - 275
  • [36] A parametric linguistics based approach for cross-lingual web querying
    Kapetanios, Epaminondas
    Sugumaran, Vijayan
    Tanase, Diana
    DATA & KNOWLEDGE ENGINEERING, 2008, 66 (01) : 35 - 52
  • [37] Cross-Lingual Product Retrieval in E-Commerce Search
    Zhu, Wenya
    Lv, Xiaoyu
    Yang, Baosong
    Zhang, Yinghua
    Yong, Xu
    Xu, Linlong
    Feng, Yinfu
    Zhang, Haibo
    Da, Qing
    Zeng, Anxiang
    Chen, Ronghua
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT II, 2022, 13281 : 458 - 471
  • [38] Cross-Lingual Propagation for Deep Sentiment Analysis
    Dong, Xin
    de Melo, Gerard
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 5771 - 5778
  • [39] A comparative study of cross-lingual sentiment analysis
    Priban, Pavel
    Smid, Jakub
    Steinberger, Josef
    Mistera, Adam
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 247
  • [40] Linear Transformations for Cross-lingual Sentiment Analysis
    Priban, Pavel
    Smid, Jakub
    Mistera, Adam
    Kral, Pavel
    TEXT, SPEECH, AND DIALOGUE (TSD 2022), 2022, 13502 : 125 - 137