We address scene layout modeling for recognizing agent-in-place actions, which are actions associated with agents who perform them and the places where they occur, in the context of outdoor home surveillance. We introduce a novel representation to model the geometry and topology of scene layouts so that a network can generalize from the layouts observed in the training scenes to unseen scenes in the test set. This Layout-Induced Video Representation (LIVR) abstracts away low-level appearance variance and encodes geometric and topological relationships of places to explicitly model scene layout. LIVR partitions the semantic features of a scene into different places to force the network to learn generic place-based feature descriptions which are independent of specific scene layouts; then, LIVR dynamically aggregates features based on connectivities of places in each specific scene to model its layout. We introduce a new Agent-in-Place Action (APA) dataset to show that our method allows neural network models to generalize significantly better to unseen scenes.
机构:
NHK Japan Broadcasting Corp, Tokyo, Japan
NHK Japan Broadcasting Corp, Integrated Broadcast Broadband Syst Res Div, Sci & Technol Res Labs NHK, Tokyo, JapanNHK Japan Broadcasting Corp, Tokyo, Japan
Takahashi, Masaki
Naemura, Masahide
论文数: 0引用数: 0
h-index: 0
机构:
NHK Japan Broadcasting Corp, Tokyo, Japan
NHK Japan Broadcasting Corp, Integrated Broadcast Broadband Syst Res Div, Sci & Technol Res Labs NHK, Tokyo, JapanNHK Japan Broadcasting Corp, Tokyo, Japan
Naemura, Masahide
Fujii, Mahito
论文数: 0引用数: 0
h-index: 0
机构:
NHK Japan Broadcasting Corp, Tokyo, Japan
NHK Japan Broadcasting Corp, Human & Informat Sci Res Div, Sci & Technol Res Labs NHK, Tokyo, JapanNHK Japan Broadcasting Corp, Tokyo, Japan
Fujii, Mahito
Little, James J.
论文数: 0引用数: 0
h-index: 0
机构:
Univ British Columbia, Dept Comp Sci, Comp Sci, Vancouver, BC, CanadaNHK Japan Broadcasting Corp, Tokyo, Japan
Little, James J.
INTERNATIONAL JOURNAL OF MULTIMEDIA DATA ENGINEERING & MANAGEMENT,
2014,
5
(03):
: 28
-
46