Photo Semantic Understanding and Retargeting by a Noise-Robust Regularized Topic Model

被引:0
|
作者
Wang, Guifeng [1 ]
Zhang, Luming [1 ]
Li, Yongbin [1 ]
Sheng, Yichuan [1 ]
机构
[1] Jinhua Polytech, Key Lab Crop Harvesting Equipment Technol Zhejiang, Jinhua 321007, Peoples R China
关键词
Aerial photo; deep feature; matrix factorization; probabilistic model; retargeting; COMMUNITIES; ALGORITHM;
D O I
10.1109/JSTARS.2023.3247745
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Retargeting aims at displaying a photo with an arbitrary aspect ratio, wherein the visually/semantically prominent objects are appropriately preserved and visual distortions can be well alleviated. Conventional retargeting models are built upon the visual perception of photos from a family of prespecified communities (e.g., "portrait"), wherein the underlying community-specific features are not learned explicitly. Thus, they cannot appropriately retarget aerial photos, which contains a rich variety of objects with different scales. In this article, a novel aerial photo retargeting framework is designed by encoding the deep features from automatically detected Google Maps (https://www.google.com/maps) communities into a regularized probabilistic model. Specifically, we first propose an enhanced matrix factorization (MF) algorithm to calculate communities based on million-scale Google Maps pictures, for each of which deep feature is learned simultaneously. The enhanced MF incorporates label denoising, between-communities correlation, and deep feature encoding collaboratively. Subsequently, a probabilistic model called latent topic model (LTM) is designed that quantifies the spatial layouts of multiple Google Maps communities in the underlying hidden space. To alleviate the overfitting from Google Maps communities with imbalanced numbers of aerial photos, a regularizer is added into the LTM. Finally, by leveraging the regularized LTM, we shrink the test photo horizontally/vertically to maximize the posterior probability of the retargted photo. Comprehensive subjective evaluations and visualizations have demonstrated the advantages of our method. Besides, our calculate Google Maps communities are competitively consistent with the ground truth, according to the quantitative comparisons on the 2 M Google Maps photos.
引用
收藏
页码:3495 / 3505
页数:11
相关论文
共 36 条
  • [1] Noise-Robust Diffusion Based Semantic Segmentation
    Kaya, Ahmet Kagan
    [J]. 2023 31ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2023,
  • [2] Noise-Robust Diffusion Based Semantic Segmentation
    Kaya, Ahmet Kaǧan
    [J]. 31st IEEE Conference on Signal Processing and Communications Applications, SIU 2023, 2023,
  • [3] Adaptive window thresholding for noise-robust photo detection in OCC
    Lee, Joon-Woo
    Kim, Sung-Jin
    Han, Sang-Kook
    [J]. OPTICS COMMUNICATIONS, 2018, 426 : 623 - 628
  • [4] An engineering model of the masking for the noise-robust speech recognition
    Park, KY
    Lee, SY
    [J]. NEUROCOMPUTING, 2003, 52-4 : 615 - 620
  • [5] Model-independent noise-robust extension of ptychography
    Konijnenberg, A. P.
    Coene, W. M. J.
    Urbach, H. P.
    [J]. OPTICS EXPRESS, 2018, 26 (05): : 5857 - 5874
  • [6] Time-regularized linear prediction for noise-robust extraction of the spectral envelope of speech
    Airaksinen, Manu
    Juvela, Lauri
    Rasanen, Okka
    Alku, Paavo
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 701 - 705
  • [7] Wavelet-based nearest-regularized subspace for noise-robust hyperspectral image classification
    Li, Wei
    Liu, Kui
    Su, Hongjun
    [J]. JOURNAL OF APPLIED REMOTE SENSING, 2014, 8
  • [8] Improved model parameter compensation methods for noise-robust speech recognition
    Chang, YH
    Chung, YJ
    Park, SU
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 561 - 564
  • [9] Method of Noise-Robust Estimation of Parameters of an Autoregressive Model in the Frequency Domain
    V. K. Zadiraka
    V. Yu. Semenov
    Ye. V. Semenova
    [J]. Cybernetics and Systems Analysis, 2021, 57 : 836 - 842
  • [10] Noise-Robust Sleep Staging via Adversarial Training With an Auxiliary Model
    Yoo, Chaehwa
    Liu, Xiaofeng
    Xing, Fangxu
    El Fakhri, Georges
    Woo, Jonghye
    Kang, Je-Won
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2023, 70 (04) : 1252 - 1263