Data modeling strategies for imbalanced learning in visual search

被引:0
|
作者
Tesic, Jelena [1 ]
Natsev, Apostol [1 ]
Xie, Lexing [1 ]
Smith, John R. [1 ]
机构
[1] IBM TJ Watson Res Ctr, Hawthorne, NY USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we examine a novel approach to the difficult problem of querying video databases using visual topics with few examples. Typically with visual topics, the examples are not sufficiently diverse to create a robust model of the user's need. As a result, direct modeling using the provided topic examples as training data is inadequate. Otherwise, systems resort to multiple content-based searches using each example in turn, which typically provides poor results. We propose a new technique of leveraging unlabeled data to expand the diversity of the topic examples as well as provide a robust set of negative examples that allow direct modeling. The approach intelligently models a pseudo-negative space using unbiased and biased methods for data sampling and data selection. We apply the proposed method in a fusion framework to improve discriminative support vector machine modeling, and improve the overall system performance. The result is an enhanced performance over any of the baseline models, as well as improved robustness with respect to training examples, visual features, and visual support of video topics in TRECVID. The proposed method outperforms a baseline retrieval approach by more than 18% on the TRECVID 2006 video collection and query topics.
引用
收藏
页码:1990 / 1993
页数:4
相关论文
共 50 条
  • [1] Few-shot learning on batch process modeling with imbalanced data
    Gu, Shaowu
    Chen, Junghui
    Xie, Lei
    [J]. CHEMICAL ENGINEERING SCIENCE, 2024, 285
  • [2] Learning from Imbalanced Data
    He, Haibo
    Garcia, Edwardo A.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (09) : 1263 - 1284
  • [3] Learning in Imbalanced Relational Data
    Ghanem, Amal S.
    Venkatesh, Svetha
    West, Geoff
    [J]. 19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 436 - 439
  • [4] Spatial constraints on learning in visual search: Modeling contextual cuing
    Brady, Timothy F.
    Chun, Marvin M.
    [J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2007, 33 (04) : 798 - 815
  • [5] Learning Imbalanced Data with Vision Transformers
    Xu, Zhengzhuo
    Liu, Ruikang
    Yang, Shuo
    Chai, Zenghao
    Yuan, Chun
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 15793 - 15803
  • [6] Metric Learning from Imbalanced Data
    Gautheron, Leo
    Habrard, Amaury
    Morvant, Emilie
    Sebban, Marc
    [J]. 2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 923 - 930
  • [7] A LEARNING METHOD FOR IMBALANCED DATA SETS
    de la Calleja, Jorge
    Fuentes, Olac
    Gonzalez, Jesus
    Aceves-Perez, Rita M.
    [J]. KDIR 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2009, : 307 - +
  • [8] Box Drawings for Learning with Imbalanced Data
    Goh, Siong Thye
    Rudin, Cynthia
    [J]. PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, : 333 - 342
  • [9] Machine learning for mining imbalanced data
    Arafat, Md. Yasir
    Hoque, Sabera
    Xu, Shuxiang
    Farid, Dewan Md
    [J]. IAENG International Journal of Computer Science, 2019, 46 (02) : 332 - 348
  • [10] Pairwise Learning for Imbalanced Data Classification
    Liu, Shu
    Wu, Qiang
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2021), 2021, : 186 - 189