Learning Relational Event Models from Video

被引:29
|
作者
Dubba, Krishna S. R. [1 ]
Cohn, Anthony G. [1 ]
Hogg, David C. [1 ]
Bhatt, Mehut [2 ]
Dylla, Frank [2 ]
机构
[1] Univ Leeds, Sch Comp, Leeds LS2 9JT, W Yorkshire, England
[2] Univ Bremen, Cognit Syst, SFB TR Spatial Cognit 8, D-28334 Bremen, Germany
关键词
REPRESENTATION; DEFINITIONS; RECOGNITION;
D O I
10.1613/jair.4395
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Event models obtained automatically from video can be used in applications ranging from abnormal event detection to content based video retrieval. When multiple agents are involved in the events, characterizing events naturally suggests encoding interactions as relations. Learning event models from this kind of relational spatio-temporal data using relational learning techniques such as Inductive Logic Programming (ILP) hold promise, but have not been successfully applied to very large datasets which result from video data. In this paper, we present a novel framework REMIND (Relational Event Model INDuction) for supervised relational learning of event models from large video datasets using ILP. Efficiency is achieved through the learning from interpretations setting and using a typing system that exploits the type hierarchy of objects in a domain. The use of types also helps prevent over generalization. Furthermore, we also present a type-refining operator and prove that it is optimal. The learned models can be used for recognizing events from previously unseen videos. We also present an extension to the framework by integrating an abduction step that improves the learning performance when there is noise in the input data. The experimental results on several hours of video data from two challenging real world domains (an airport domain and a physical action verbs domain) suggest that the techniques are suitable to real world scenarios.
引用
收藏
页码:41 / 90
页数:50
相关论文
共 50 条
  • [41] Efficient Learning of Relational Object Class Models
    Aharon Bar-Hillel
    Daphna Weinshall
    International Journal of Computer Vision, 2008, 77 : 175 - 198
  • [42] Learning the Structure of Dynamic Hybrid Relational Models
    Nitti, Davide
    Ravkic, Irma
    Davis, Jesse
    De Raedt, Luc
    ECAI 2016: 22ND EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, 285 : 1283 - 1290
  • [43] Learning directed relational models with recursive dependencies
    Oliver Schulte
    Hassan Khosravi
    Tong Man
    Machine Learning, 2012, 89 : 299 - 316
  • [44] Statistical Relational Learning With Unconventional String Models
    Vu, Mai H.
    Zehfroosh, Ashkan
    Strother-Garcia, Kristina
    Sebok, Michael
    Heinz, Jeffrey
    Tanner, Herbert G.
    FRONTIERS IN ROBOTICS AND AI, 2018, 5
  • [45] Efficient learning of relational object class models
    Bar-Hillel, Aharon
    Weinshall, Daphna
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2008, 77 (1-3) : 175 - 198
  • [46] Online Learning of Weighted Relational Rules for Complex Event Recognition
    Katzouris, Nikos
    Michelioudakis, Evangelos
    Artikis, Alexander
    Paliouras, Georgios
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2018, PT II, 2019, 11052 : 396 - 413
  • [47] Discovering relational and numerical expressions from plan traces for learning action models
    José Á. Segura-Muros
    Raúl Pérez
    Juan Fernández-Olivares
    Applied Intelligence, 2021, 51 : 7973 - 7989
  • [48] Discovering relational and numerical expressions from plan traces for learning action models
    Segura-Muros, Jose A.
    Perez, Raul
    Fernandez-Olivares, Juan
    APPLIED INTELLIGENCE, 2021, 51 (11) : 7973 - 7989
  • [49] From video replay to the relational circuit to Threeing
    Ryan, Paul
    LEONARDO, 2006, 39 (03) : 199 - 203
  • [50] MEVG: Multi-event Video Generation with Text-to-Video Models
    Oh, Gyeongrok
    Jeong, Jaehwan
    Kim, Sieun
    Byeon, Wonmin
    Kim, Jinkyu
    Kim, Sungwoong
    Kim, Sangpil
    COMPUTER VISION-ECCV 2024, PT XLIII, 2025, 15101 : 401 - 418