Spatially Coherent Interpretations of Videos Using Pattern Theory

被引：0

作者：

Fillipe D. M. de Souza

Sudeep Sarkar

Anuj Srivastava

Jingyong Su

机构：

[1] University of South Florida,Department of Computer Science & Engineering

[2] Florida State University,Department of Statistics

[3] Texas Tech University,Department of Mathematics & Statistics

来源：

International Journal of Computer Vision | 2017年 / 121卷

关键词：

Activity detection; Pattern theory; Graphical methods; Compositional approach;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Activity interpretation in videos results not only in recognition or labeling of dominant activities, but also in semantic descriptions of scenes. Towards this broader goal, we present a combinatorial approach that assumes availability of algorithms for detecting and labeling objects and basic actions in videos, albeit with some errors. Given these uncertain labels and detected objects, we link them into interpretable structures using the domain knowledge, under the framework of Grenander’s general pattern theory. Here a semantic description is built using basic units, termed generators, that represent either objects or actions. These generators have multiple out-bonds, each associated with different types of domain semantics, spatial constraints, and image evidence. The generators combine, according to a set of pre-defined combination rules that capture domain semantics, to form larger configurations that represent video interpretations. This framework derives its representational power from flexibility in size and structure of configurations. We impose a probability distribution on the configuration space, with inferences generated using a Markov chain Monte Carlo-based simulated annealing process. The primary advantage of the approach is that it handles known challenges—appearance variabilities, errors in object labels, object clutter, simultaneous events, etc—without the need for exponentially-large (labeled) training data. Experimental results demonstrate its ability to successfully provide interpretations under clutter and the simultaneity of events. They show: (1) a performance increase of more than 30 % over other state-of-the-art approaches using more than 5000 video units from the Breakfast Actions dataset, and (2) an overall recall and precision improvement of more than 50 and 100 %, respectively, on the YouCook data set.

引用

页码：5 / 25

页数：20

共 50 条

[21] Are TEG pattern interpretations reliable?
Exner, T.
Parsi, K.
JOURNAL OF THROMBOSIS AND HAEMOSTASIS, 2009, 7 : 547 - 548
[22] Coherent source location using a pattern diversity technique
Sun, Y.
Roy, S.
Kassam, S.A.
Haber, F.
Journal of the Acoustical Society of America, 1992, 92 (06):
[23] COHERENT SOURCE LOCATION USING A PATTERN DIVERSITY TECHNIQUE
SUN, Y
ROY, S
KASSAM, SA
HABER, F
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1992, 92 (06): : 3213 - 3220
[24] Scalable pattern retrieval from videos using a Random Forest index
Henderson, Craig
Izquierdo, Ebroul
PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON INTERNET OF THINGS, DATA AND CLOUD COMPUTING (ICC 2017), 2017,
[25] Temporal Action Localization in Untrimmed Videos Using Action Pattern Trees
Song, Hao
Wu, Xinxiao
Zhu, Bing
Wu, Yuwei
Chen, Mei
Jia, Yunde
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (03) : 717 - 730
[26] Theory, data, interpretations, and more theory
Rotfeld, Herbert Jack
JOURNAL OF CONSUMER AFFAIRS, 2007, 41 (02) : 376 - 379
[27] Demodulation of polarization information for a spatially modulated snapshot imaging polarimeter based on the coherent demodulation theory
Pan, Yangliu
Zhang, Jing
Jiang, Min
Tang, Jinfeng
Jiang, Siyue
Jia, Chenling
Deng, Ting
Fan, Dongxin
Wang, Huahua
APPLIED OPTICS, 2022, 61 (21) : 6349 - 6355
[28] Fully spatially coherent EUV source
Bartels, RA
Backus, S
Paul, A
Kapteyn, H
Murnane, M
Liu, YW
Attwood, D
Jacobsen, C
ULTRAFAST PHENOMENA XIII, 2003, 71 : 66 - 68
[29] Information processing with spatially coherent light
Lohmann, AW
Mendlovic, D
Shabtay, G
OPTICS IN COMPUTING 2000, 2000, 4089 : 652 - 653
[30] SYNTHESIS IMAGING OF SPATIALLY COHERENT OBJECTS
ANANTHARAMAIAH, KR
CORNWELL, TJ
NARAYAN, R
SYNTHESIS IMAGING IN RADIO ASTRONOMY, 1989, 6 : 415 - 430

← 1 2 3 4 5 →