Multi-task Ranking with User Behaviors for Text-video Search

被引:3
|
作者
Liu, Peidong [1 ,2 ]
Liao, Dongliang [2 ]
Wang, Jinpeng [1 ,2 ]
Wu, Yangxin [2 ]
Li, Gongfu [2 ]
Xia, Shu-Tao [1 ,3 ]
Xu, Jin [4 ]
机构
[1] Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Beijing, Peoples R China
[2] Tencent Inc, Wechat Grp, Shenzhen, Peoples R China
[3] Peng Cheng Lab, Res Ctr Artificial Intelligence, Shenzhen, Peoples R China
[4] South China Univ Technol, Sch Future Technol, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Text-video Search; Ranking Model; Multi-task Learning; User Behaviors; Multi-modal Fusion;
D O I
10.1145/3487553.3524207
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text-video search has become an important demand in many industrial video sharing platforms, e.g., YouTube, TikTok, and WeChat Channels, thereby attracting increasing research attention. Traditional relevance-based ranking methods for text-video search concentrate on exploiting the semantic relevance between video and query. However, relevance is no longer the principal issue in the ranking stage, because the candidate items retrieved from the matching stage naturally guarantee adequate relevance. Instead, we argue that boosting user satisfaction should be an ultimate goal for ranking and it is promising to excavate cheap and rich user behaviors for model training. To achieve this goal, we propose an effective Multi-Task Ranking pipeline with User Behaviors (MTRUB) for text-video search. Specifically, to exploit the multi-modal data effectively, we put forward a Heterogeneous Multi-modal Fusion Module (HMFM) to fuse the query and video features of different modalities in adaptive ways. Besides that, we design an Independent Multi-modal Input Scheme (IMIS) to alleviate competing task correlation problems in multi-task learning. Experiments on the offline dataset gathered from WeChat Search demonstrate that MTRUB outperforms the baseline by 12.0% in mean gAUC and 13.3% in mean nDCG@10. We also conduct live experiments on a large-scale mobile search engine, i.e., WeChat Search, and MTRUB obtains substantial improvement compared with the traditional relevance-based ranking model.
引用
收藏
页码:126 / 130
页数:5
相关论文
共 50 条
  • [31] Adaptive Dynamic Search for Multi-Task Learning
    Kim, Eunwoo
    APPLIED SCIENCES-BASEL, 2022, 12 (22):
  • [32] Evolutionary Planning for Multi-User Multi-Task Missions
    Kala, Rahul
    2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 2689 - 2696
  • [33] A user behavior-aware multi-task learning model for enhanced short video recommendation
    Wu, Yuewei
    Fu, Ruiling
    Xing, Tongtong
    Yu, Zhenyu
    Yin, Fulian
    NEUROCOMPUTING, 2025, 617
  • [34] Multi-task and multi-view learning of user state
    Kandemir, Melih
    Vetek, Akos
    Goenen, Mehmet
    Klami, Arto
    Kaski, Samuel
    NEUROCOMPUTING, 2014, 139 : 97 - 106
  • [35] A Multi-Task Learning Approach for Improving Product Title Compression with User Search Log Data
    Wang, Jingang
    Tian, Junfeng
    Qiu, Long
    Li, Sheng
    Lang, Jun
    Si, Luo
    Lan, Man
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 451 - 458
  • [36] Multi-Task Decision-Making for Multi-User 360° Video Processing over Wireless Networks
    Badnava, Babak
    Chakareski, Jacob
    Hashemi, Morteza
    2024 IEEE 7TH INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL, MIPR 2024, 2024, : 294 - 300
  • [37] CommuSpotter: Scene Text Spotting with Multi-Task Communication
    Zhao, Liang
    Wilsbacher, Greg
    Wang, Song
    APPLIED SCIENCES-BASEL, 2023, 13 (23):
  • [38] MCapsNet: Capsule Network for Text with Multi-Task Learning
    Xiao, Liqiang
    Zhang, Honglun
    Chen, Wenqing
    Wang, Yongkun
    Jin, Yaohui
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 4565 - 4574
  • [39] A Multi-Task Learning Framework for Abstractive Text Summarization
    Lu, Yao
    Liu, Linqing
    Jiang, Zhile
    Yang, Min
    Goebel, Randy
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9987 - 9988
  • [40] Multi-task Fuzzy Clustering-Based Multi-task TSK Fuzzy System for Text Sentiment Classification
    Gu, Xiaoqing
    Xia, Kaijian
    Jiang, Yizhang
    Jolfaei, Alireza
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (02)