Visual trigger templates for knowledge-based indexing

被引:0
|
作者
Jaimes, A [1 ]
Wang, QH
Kato, N
Ikeda, H
Miyazaki, J
机构
[1] Fuji Xerox Co Ltd, FXPal Japan, Kanagawa, Japan
[2] Fuji Xerox Co Ltd, Corp Res Lab, Kanagawa, Japan
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present an application to create binary Visual Trigger Templates (VTT) for automatic video indexing. Our approach is based on the observation that videos captured with fixed cameras have specific structures that depend on world constraints. Our system allows a user to graphically represent such constraints to automatically recognize simple actions or events. VTTs are constructed by manually drawing rectangles to define trigger spaces: when elements (e.g., a hand, a face) move inside the trigger spaces defined by the user, actions are recognized. For example, a user can define a raise hand action by drawing two rectangles: one for the face and one for the hand. Our approach uses motion, skin, and face detection algorithms. We present experiments on the PETS-ICVS dataset and on our own dataset to demonstrate that our system constitutes a simple but powerful mechanism for meeting video indexing.
引用
收藏
页码:154 / 161
页数:8
相关论文
共 50 条
  • [31] Knowledge is power: Open-world knowledge representation learning for knowledge-based visual reasoning
    Zheng, Wenbo
    Yan, Lan
    Wang, Fei-Yue
    [J]. Artificial Intelligence, 2024, 333
  • [32] Research on the Competitiveness of Knowledge-Based Workers in Knowledge-Based Organization
    Sun Xinqing
    Wang Pengju
    Ma Xiaohua
    [J]. PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON INNOVATION AND MANAGEMENT, VOLS I AND II, 2010, : 1409 - 1413
  • [33] Reconstructing the recent visual past: Hierarchical knowledge-based effects in visual working memory
    Poirier, Marie
    Heussen, Daniel
    Aldrovandi, Silvio
    Daniel, Lauren
    Tasnim, Saiyara
    Hampton, James A.
    [J]. PSYCHONOMIC BULLETIN & REVIEW, 2017, 24 (06) : 1889 - 1899
  • [34] Explainable Knowledge reasoning via thought chains for knowledge-based visual question answering
    Qiu, Chen
    Xie, Zhiqiang
    Liu, Maofu
    Hu, Huijun
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (04)
  • [35] Reconstructing the recent visual past: Hierarchical knowledge-based effects in visual working memory
    Marie Poirier
    Daniel Heussen
    Silvio Aldrovandi
    Lauren Daniel
    Saiyara Tasnim
    James A. Hampton
    [J]. Psychonomic Bulletin & Review, 2017, 24 : 1889 - 1899
  • [36] MKEAH: Multimodal knowledge extraction and accumulation based on hyperplane embedding for knowledge-based visual question answering
    Zhang, Heng
    Wei, Zhihua
    Liu, Guanming
    Wang, Rui
    Mu, Ruibin
    Liu, Chuanbao
    Yuan, Aiquan
    Cao, Guodong
    Hu, Ning
    [J]. Virtual Reality and Intelligent Hardware, 6 (04): : 280 - 291
  • [37] MKEAH: Multimodal knowledge extraction and accumulation based on hyperplane embedding for knowledge-based visual question answering
    Heng ZHANG
    Zhihua WEI
    Guanming LIU
    Rui WANG
    Ruibin MU
    Chuanbao LIU
    Aiquan YUAN
    Guodong CAO
    Ning HU
    [J]. 虚拟现实与智能硬件(中英文), 2024, 6 (04) : 280 - 291
  • [38] Multimodal Inverse Cloze Task for Knowledge-Based Visual Question Answering
    Lerner, Paul
    Ferret, Olivier
    Guinaudeau, Camille
    [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2023, PT I, 2023, 13980 : 569 - 587
  • [39] Cross-Modal Retrieval for Knowledge-Based Visual Question Answering
    Lerner, Paul
    Ferret, Olivier
    Guinaudeau, Camille
    [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT I, 2024, 14608 : 421 - 438
  • [40] A knowledge-based image retrieval system integrating semantic and visual features
    Allani, Olfa
    Zghal, Hajer Baazaoui
    Mellouli, Nedra
    Akdag, Herman
    [J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS: PROCEEDINGS OF THE 20TH INTERNATIONAL CONFERENCE KES-2016, 2016, 96 : 1428 - 1436