Catboost-based Framework with Additional User Information for Social Media Popularity Prediction

被引:31
|
作者
Kang, Peipei [1 ]
Lin, Zehang [2 ]
Teng, Shaohua [1 ]
Zhang, Guipeng [1 ]
Guo, Lingni [1 ]
Zhang, Wei [1 ]
机构
[1] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou, Peoples R China
[2] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Social Media Prediction; Categorical Features; Catboost;
D O I
10.1145/3343031.3356060
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, a Catboost-based framework is proposed to predict social media popularity. The framework is constituted by two components: feature representation and Catboost training. In the component of feature representation, numerical features are directly used, while categorical features are converted into numerical features by a method of order target statistics in Catboost. Besides, some additional user information is also tracked to enrich the feature space. In the other component, Catboost is adopted as the regression model which is trained by using post-related, user-related and additional user information. Moreover, to make full use of the dataset for model training, a dataset augmentation strategy based on pseudo labels is proposed. This strategy involves in two-stage training. In the first stage, it trains a first-stage model that is used to label the test set as pseudo labeled. In the next stage, a final model is trained based on the new training set that includes original validation set and the pseudo labeled test set. The proposed method achieves the 2nd place in the leader board of the Grand Challenge of Social Media Prediction.
引用
收藏
页码:2677 / 2681
页数:5
相关论文
共 50 条
  • [1] CatBoost-Based Framework for Intelligent Prediction and Reaction Condition Analysis of Coupling Reaction
    Wang, Hengzhe
    Peng, Lichao
    Chang, Li
    Li, Zixin
    Guo, Yanhui
    Li, Qian
    Yang, Xiaohui
    [J]. MATCH-COMMUNICATIONS IN MATHEMATICAL AND IN COMPUTER CHEMISTRY, 2023, 90 (01) : 53 - 71
  • [2] A Framework for Policy Information Popularity Prediction in New Media
    Luo, Yin
    Wang, Fangfang
    Zhao, Feifei
    Guo, Jianbin
    Wang, Lei
    Hao, Yanni
    Zeng, Daniel Dajun
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS (ISI), 2019, : 209 - 211
  • [3] A Feature Generalization Framework for Social Media Popularity Prediction
    Wang, Kai
    Wang, Penghui
    Chen, Xin
    Huang, Qiushi
    Mao, Zhendong
    Zhang, Yongdong
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4570 - 4574
  • [4] Personality Prediction Based on All Characters of User Social Media Information
    Wan, Danlin
    Zhang, Chuang
    Wu, Ming
    An, Zhixiang
    [J]. SOCIAL MEDIA PROCESSING, 2014, 489 : 220 - 230
  • [5] Multimodal Deep Learning Framework for Image Popularity Prediction on Social Media
    Abousaleh, Fatma S.
    Cheng, Wen-Huang
    Yu, Neng-Hao
    Tsao, Yu
    [J]. IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2021, 13 (03) : 679 - 692
  • [6] Social Circle and Attention Based Information Popularity Prediction
    Zheng, Zuo-Wu
    Shao, Si-Qi
    Gao, Xiao-Feng
    Chen, Gui-Hai
    [J]. Jisuanji Xuebao/Chinese Journal of Computers, 2021, 44 (05): : 921 - 936
  • [7] Prediction of hydrogen storage in metal-organic frameworks using CatBoost-based approach
    Qiu, Hui
    Xia, Yongpeng
    Xiang, Cuili
    Xu, Fen
    Sun, Lixian
    Zou, Yongjin
    [J]. INTERNATIONAL JOURNAL OF HYDROGEN ENERGY, 2024, 79 : 952 - 961
  • [8] Enhancing Financial Risk Prediction for Listed Companies: A Catboost-Based Ensemble Learning Approach
    Lu, Haitao
    Hu, Xiaofeng
    [J]. JOURNAL OF THE KNOWLEDGE ECONOMY, 2024, 15 (02) : 9824 - 9840
  • [9] Deeply Exploit Visual and Language Information for Social Media Popularity Prediction
    Wu, Jianmin
    Zhao, Liming
    Li, Dangwei
    Xie, Chen-Wei
    Sun, Siyang
    Zheng, Yun
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 7045 - 7049
  • [10] Enhanced CatBoost with Stacking Features for Social Media Prediction
    Mao, Shijian
    Xi, Wudong
    Yu, Lei
    Lu, Gaotian
    Xing, Xingxing
    Zhou, Xingchen
    Wan, Wei
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 9430 - 9435