User demographics prediction based on mobile data

被引:32
|
作者
Zhong, Erheng [1 ]
Tan, Ben [1 ]
Mo, Kaixiang [1 ]
Yang, Qiang [1 ,2 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Hong Kong, Peoples R China
[2] Huawei Noahs Ark Res Lab, Units 525 530, Shatin, Hong Kong, Peoples R China
关键词
Demographics; Mobile; Feature construction; Multi-task learning; Cost-sensitive classification; Ensemble; TUTORIAL;
D O I
10.1016/j.pmcj.2013.07.009
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Demographics prediction is an important component of user profile modeling. The accurate prediction of users' demographics can help promote many applications, ranging from web search, personalization to behavior targeting. In this paper, we focus on how to predict users' demographics, including "gender", "job type", "marital status", "age" and "number of family members", based on mobile data, such as users' usage logs, physical activities and environmental contexts. The core idea is to build a supervised learning framework, where each user is represented as a feature vector and users' demographics are considered as prediction targets. The most important component is to construct features from raw data and then supervised learning models can be applied. We propose a feature construction framework, CFC (contextual feature construction), where each feature is defined as the conditional probability of one user activity under the given contexts. Consequently, besides employing standard supervised learning models, we propose a regularized multi-task learning framework to model different kinds of demographics predictions collectively. We also propose a cost-sensitive classification framework for regression tasks, in order to benefit from the existing dimension reduction methods. Finally, due to the limited training instances, we employ ensemble to avoid overfitting. The experimental results show that the framework achieves classification accuracies on "gender", "job" and "marital status" as high as 96%, 83% and 86%, respectively, and achieves Root Mean Square Error (RMSE) on "age" and "number of family members" as low as 0.69 and 0.66 respectively, under the leave-one-out evaluation. Crown Copyright (C) 2013 Published by Elsevier B.V. All rights reserved.
引用
收藏
页码:823 / 837
页数:15
相关论文
共 50 条
  • [1] User Demographic Prediction Based on the Fusion of Mobile and Survey Data
    Chen, Xingyu
    Guo, Ye
    Xu, Honglei
    Yan, Hongyan
    Lin, Lin
    IEEE ACCESS, 2022, 10 : 111507 - 111527
  • [2] Chinese Mobile Internet User Demographics 2012
    Samantha Wang
    China's Foreign Trade, 2012, (07) : 12 - 13
  • [3] User Preference Prediction in Visual Data on Mobile Devices
    Savchenko, Andrey, V
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [4] Mobile Phone Changing Prediction based on Large-scale User Behavioral Data
    Ma, Qingli
    Cui, Tiesheng
    Zheng, Jiewen
    Zhang, Sihai
    Zhou, Wuyang
    2016 19TH INTERNATIONAL SYMPOSIUM ON WIRELESS PERSONAL MULTIMEDIA COMMUNICATIONS (WPMC), 2016,
  • [5] Clustering and prediction of mobile user routes from cellular data
    Laasonen, K
    KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2005, 2005, 3721 : 569 - 576
  • [6] Mobile User Trajectory Prediction Based on Machine Learning
    Liu, Ya
    Yang, Hongwen
    Huang, Rui
    2022 IEEE 95TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-SPRING), 2022,
  • [7] Location Prediction Based on User Mobile Behavior Similarity
    Qiao, Jianzhong
    Li, Shengzhi
    Lin, Shukuan
    2017 IEEE 23RD INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2017, : 783 - 786
  • [8] Prediction of User Demographics from Music Listening Habits
    Krismayer, Thomas
    Schedl, Markus
    Knees, Peter
    Rabiser, Rick
    PROCEEDINGS OF THE 15TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2017,
  • [9] Data-driven Modeling and Prediction of User Acceptance for Mobile Apps
    Lu X.
    Chen Z.-P.
    Liu X.-Z.
    Mei H.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (11): : 3364 - 3379
  • [10] Telco User Activity Level Prediction with Massive Mobile Broadband Data
    Luo, Chen
    Zeng, Jia
    Yuan, Mingxuan
    Dai, Wenyuan
    Yang, Qiang
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2016, 7 (04)