Multi-task Ranking with User Behaviors for Text-video Search

被引:3
|
作者
Liu, Peidong [1 ,2 ]
Liao, Dongliang [2 ]
Wang, Jinpeng [1 ,2 ]
Wu, Yangxin [2 ]
Li, Gongfu [2 ]
Xia, Shu-Tao [1 ,3 ]
Xu, Jin [4 ]
机构
[1] Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Beijing, Peoples R China
[2] Tencent Inc, Wechat Grp, Shenzhen, Peoples R China
[3] Peng Cheng Lab, Res Ctr Artificial Intelligence, Shenzhen, Peoples R China
[4] South China Univ Technol, Sch Future Technol, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Text-video Search; Ranking Model; Multi-task Learning; User Behaviors; Multi-modal Fusion;
D O I
10.1145/3487553.3524207
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text-video search has become an important demand in many industrial video sharing platforms, e.g., YouTube, TikTok, and WeChat Channels, thereby attracting increasing research attention. Traditional relevance-based ranking methods for text-video search concentrate on exploiting the semantic relevance between video and query. However, relevance is no longer the principal issue in the ranking stage, because the candidate items retrieved from the matching stage naturally guarantee adequate relevance. Instead, we argue that boosting user satisfaction should be an ultimate goal for ranking and it is promising to excavate cheap and rich user behaviors for model training. To achieve this goal, we propose an effective Multi-Task Ranking pipeline with User Behaviors (MTRUB) for text-video search. Specifically, to exploit the multi-modal data effectively, we put forward a Heterogeneous Multi-modal Fusion Module (HMFM) to fuse the query and video features of different modalities in adaptive ways. Besides that, we design an Independent Multi-modal Input Scheme (IMIS) to alleviate competing task correlation problems in multi-task learning. Experiments on the offline dataset gathered from WeChat Search demonstrate that MTRUB outperforms the baseline by 12.0% in mean gAUC and 13.3% in mean nDCG@10. We also conduct live experiments on a large-scale mobile search engine, i.e., WeChat Search, and MTRUB obtains substantial improvement compared with the traditional relevance-based ranking model.
引用
收藏
页码:126 / 130
页数:5
相关论文
共 50 条
  • [1] Text-video retrieval method based on enhanced self-attention and multi-task learning
    Xiaoyu Wu
    Jiayao Qian
    Tiantian Wang
    Multimedia Tools and Applications, 2023, 82 : 24387 - 24406
  • [2] Text-video retrieval method based on enhanced self-attention and multi-task learning
    Wu, Xiaoyu
    Qian, Jiayao
    Wang, Tiantian
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (16) : 24387 - 24406
  • [3] Cross-Modal Learning Based on Semantic Correlation and Multi-Task Learning for Text-Video Retrieval
    Wu, Xiaoyu
    Wang, Tiantian
    Wang, Shengjin
    ELECTRONICS, 2020, 9 (12) : 1 - 17
  • [4] Instance-based Video Search via Multi-task Retrieval and Re-ranking
    Zhao, Zhicheng
    Chen, Guanyu
    Chen, Chong
    Li, Xinyu
    Xiang, Xuanlu
    Zhao, Yanyun
    Su, Fei
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 1873 - 1878
  • [5] Multi-Task Learning for Entity Recommendation and Document Ranking in Web Search
    Huang, Jizhou
    Wang, Haifeng
    Zhang, Wei
    Liu, Ting
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2020, 11 (05)
  • [6] Multi-Task Learning for Email Search Ranking with Auxiliary Query Clustering
    Shen, Jiaming
    Karimzadehgan, Maryam
    Bendersky, Michael
    Qin, Zhen
    Metzler, Donald
    CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 2127 - 2135
  • [7] Stock Ranking with Multi-Task Learning
    Ma, Tao
    Tan, Ying
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 199
  • [8] Ranking Performance Measures in Multi-Task Agencies
    Christensen, Peter O.
    Sabac, Florin
    Tian, Jie
    ACCOUNTING REVIEW, 2010, 85 (05): : 1545 - 1575
  • [9] Multi-task ranking SVM for image cosegmentation
    Liang, Xianpeng
    Zhu, Lin
    Huang, De-Shuang
    NEUROCOMPUTING, 2017, 247 : 126 - 136
  • [10] Text Augmentation in a Multi-Task View
    Wei, Jason
    Huang, Chengyu
    Xu, Shiqi
    Vosoughi, Soroush
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 2888 - 2894