Visual trigger templates for knowledge-based indexing

被引：0

作者：

Jaimes, A ^{[1
]}

Wang, QH

Kato, N

Ikeda, H

Miyazaki, J

机构：

[1] Fuji Xerox Co Ltd, FXPal Japan, Kanagawa, Japan

[2] Fuji Xerox Co Ltd, Corp Res Lab, Kanagawa, Japan

来源：

ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2004, PT 2, PROCEEDINGS | 2004年 / 3332卷

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We present an application to create binary Visual Trigger Templates (VTT) for automatic video indexing. Our approach is based on the observation that videos captured with fixed cameras have specific structures that depend on world constraints. Our system allows a user to graphically represent such constraints to automatically recognize simple actions or events. VTTs are constructed by manually drawing rectangles to define trigger spaces: when elements (e.g., a hand, a face) move inside the trigger spaces defined by the user, actions are recognized. For example, a user can define a raise hand action by drawing two rectangles: one for the face and one for the hand. Our approach uses motion, skin, and face detection algorithms. We present experiments on the PETS-ICVS dataset and on our own dataset to demonstrate that our system constitutes a simple but powerful mechanism for meeting video indexing.

引用

页码：154 / 161

页数：8

共 50 条

[31] Knowledge is power: Open-world knowledge representation learning for knowledge-based visual reasoning
Zheng, Wenbo
Yan, Lan
Wang, Fei-Yue
[J]. Artificial Intelligence, 2024, 333
[32] Research on the Competitiveness of Knowledge-Based Workers in Knowledge-Based Organization
Sun Xinqing
Wang Pengju
Ma Xiaohua
[J]. PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON INNOVATION AND MANAGEMENT, VOLS I AND II, 2010, : 1409 - 1413
[33] Reconstructing the recent visual past: Hierarchical knowledge-based effects in visual working memory
Poirier, Marie
Heussen, Daniel
Aldrovandi, Silvio
Daniel, Lauren
Tasnim, Saiyara
Hampton, James A.
[J]. PSYCHONOMIC BULLETIN & REVIEW, 2017, 24 (06) : 1889 - 1899
[34] Explainable Knowledge reasoning via thought chains for knowledge-based visual question answering
Qiu, Chen
Xie, Zhiqiang
Liu, Maofu
Hu, Huijun
[J]. INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (04)
[35] Reconstructing the recent visual past: Hierarchical knowledge-based effects in visual working memory
Marie Poirier
Daniel Heussen
Silvio Aldrovandi
Lauren Daniel
Saiyara Tasnim
James A. Hampton
[J]. Psychonomic Bulletin & Review, 2017, 24 : 1889 - 1899
[36] MKEAH： Multimodal knowledge extraction and accumulation based on hyperplane embedding for knowledge-based visual question answering
Zhang, Heng
Wei, Zhihua
Liu, Guanming
Wang, Rui
Mu, Ruibin
Liu, Chuanbao
Yuan, Aiquan
Cao, Guodong
Hu, Ning
[J]. Virtual Reality and Intelligent Hardware, 6 (04): : 280 - 291
[37] MKEAH: Multimodal knowledge extraction and accumulation based on hyperplane embedding for knowledge-based visual question answering
Heng ZHANG
Zhihua WEI
Guanming LIU
Rui WANG
Ruibin MU
Chuanbao LIU
Aiquan YUAN
Guodong CAO
Ning HU
[J]. 虚拟现实与智能硬件(中英文), 2024, 6 (04) : 280 - 291
[38] Multimodal Inverse Cloze Task for Knowledge-Based Visual Question Answering
Lerner, Paul
Ferret, Olivier
Guinaudeau, Camille
[J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2023, PT I, 2023, 13980 : 569 - 587
[39] Cross-Modal Retrieval for Knowledge-Based Visual Question Answering
Lerner, Paul
Ferret, Olivier
Guinaudeau, Camille
[J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT I, 2024, 14608 : 421 - 438
[40] A knowledge-based image retrieval system integrating semantic and visual features
Allani, Olfa
Zghal, Hajer Baazaoui
Mellouli, Nedra
Akdag, Herman
[J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS: PROCEEDINGS OF THE 20TH INTERNATIONAL CONFERENCE KES-2016, 2016, 96 : 1428 - 1436

← 1 2 3 4 5 →