Region Embedding With Intra and Inter-View Contrastive Learning

被引:3
|
作者
Zhang, Liang [1 ]
Long, Cheng [1 ]
Cong, Gao [1 ]
机构
[1] Nanyang Technol Univ, Dept Comp Sci, Singapore 639798, Singapore
基金
新加坡国家研究基金会;
关键词
Index Terms-Contrastive learning; region representation; multi-view representa-tion; urban computing;
D O I
10.1109/TKDE.2022.3220874
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unsupervised region representation learning aims to extract dense and effective features from unlabeled urban data. While some efforts have been made for solving this problem based on multiple views, existing methods are still insufficient in extracting representations in a view and/or incorporating representations from different views. Motivated by the success of contrastive learning for representation learning, we propose to leverage it for multi-view region representation learning and design a model called ReMVC (Region Embedding with Multi-View Contrastive Learning) by following two guidelines: $i$i) comparing a region with others within each view for effective representation extraction and $ii$ii) comparing a region with itself across different views for cross-view information sharing. We design the intra-view contrastive learning module which helps to learn distinguished region embeddings and the inter-view contrastive learning module which serves as a soft co-regularizer to constrain the embedding parameters and transfer knowledge across multi-views. We exploit the learned region embeddings in two downstream tasks named land usage clustering and region popularity prediction. Extensive experiments demonstrate that our model achieves impressive improvements compared with seven state-of-the-art baseline methods, and the margins are over 30% in the land usage clustering task.
引用
下载
收藏
页码:9031 / 9036
页数:6
相关论文
共 50 条
  • [21] Inter-view prediction method based on camera geometry
    Zhu, Gang
    Mei, Shunliang
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2009, 49 (08): : 1156 - 1159
  • [22] INTER-VIEW MOTION VECTOR PREDICTION FOR DEPTH CODING
    Thirumalai, Vijayaraghavan
    Zhang, Li
    Chen, Ying
    2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2014,
  • [23] Inter-view direct mode for multiview video coding
    Guo, Xun
    Lu, Yan
    Wu, Feng
    Gao, Wen
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2006, 16 (12) : 1527 - 1532
  • [24] MULTI-VIEW DVC SYSTEM BASED ON ITERATIVE INTER-VIEW PREDICTION
    Ma, Zhonghua
    Leung, Ka-Ming
    Becker-Lakus, Axel
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 2621 - 2624
  • [25] TEMPORAL AND INTER-VIEW SKIP MODES FOR MULTI-VIEW VIDEO CODING
    Lee, Jin Young
    Wey, Hochen
    Park, Du-Sik
    Kim, Chang-Yeong
    2011 3DTV CONFERENCE: THE TRUE VISION - CAPTURE, TRANSMISSION AND DISPLAY OF 3D VIDEO (3DTV-CON), 2011,
  • [26] EXTENDED INTER-VIEW DIRECT MODE FOR MULTIVIEW VIDEO CODING
    Konieczny, Jacek
    Domanski, Marek
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 845 - 848
  • [27] Inter-view Reference Frame Selection in Multi-view Video Coding
    Zhang, Guang Y.
    Abdelazim, Abdelrahman
    Mein, Stephen James
    Varley, Martin Roy
    Ait-Boudaoud, Djamel
    2013 DATA COMPRESSION CONFERENCE (DCC), 2013, : 534 - 534
  • [28] Efficient multi-view video coding using inter-view information
    Huang, Xin-Xian
    Chen, Mei-Juan
    Yeh, Chia-Hung
    Chi, Hao-Wen
    Chen, Chia-Yen
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2014, 29 (06) : 667 - 677
  • [29] Efficient Inter-View Motion Vector Prediction in Multi-View HEVC
    Lee, Jae-Yung
    Han, Jong-Ki
    Kim, Jae-Gon
    Nguyen, Truong Q.
    IEEE TRANSACTIONS ON BROADCASTING, 2018, 64 (03) : 666 - 680
  • [30] Quad-tree based Inter-view Motion Prediction
    Ma, Ji
    Zhang, Na
    Fan, Xiaopeng
    Xiong, Ruiqin
    Zhao, Debin
    2015 VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2015,