Comparing Evaluation Protocols on the KTH Dataset

被引：0

作者：

Gao, Zan ^{[1
]}

Chen, Ming-yu ^{[2
]}

Hauptmann, Alexander G. ^{[2
]}

Cai, Anni ^{[1
]}

机构：

[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing 100876, Peoples R China

[2] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA

来源：

HUMAN BEHAVIOR UNDERSTANDING | 2010年 / 6219卷

基金：

美国国家科学基金会;

关键词：

Action Recognition; training/test data sets; partitioning; experimental methods; RECOGNITION; DENSE;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Human action recognition has become a hot research topic, and a lot of algorithms have been proposed Most of researchers evaluated their performances on the KTH dataset. but there is no unified standard how to evaluate algorithms on this dataset Different researchers have employed different test setups, so the comparison is not accurate. lair or complete In order to know how much difference there is when different experimental setups are used, we take our own spatiotemporal MoSIFT feature as an example to assess its performance on the KTH dataset using different test scenarios and different partitioning of the data In all experiments, support vector machine (SVM) with a chi-square kernel is adopted First, we evaluate performance changes resulting from differing vocabulary sizes of the codebook, and then decide on a suitable vocabulary size of codebook Then, we train the models using different training dataset partitions, and test the performances one the corresponding held-out test sets Experiments show that the best performance of MoSIF7 can reach 96 33% on the KTH dataset When different n-fold cross-validation methods are used, there can be up to 10 67% difference in the result And when different dataset segmentations are used (such as KTH1 and KTH2), the difference in results can be up to 5 8% absolute In addition, the performance changes dramatically when different scenarios are used in the training and test dataset When training on KTH1 S1+S2+S3+S4 and testing on KTH I SI and S3 scenarios, the performance can reach 97 33% and 89 33% respectively This paper shows how different test configurations can skew results, even on standard data set The recommendation is to use a simple leave-one-out as the most easily replicable clear-cut partitioning

引用

页码：88 / +

页数：3

共 50 条

[1] Dataset and protocols on the applicability of the BDM mechanism in product evaluation
Lichters, Marcel
Wackershauser, Verena
Han, Shixing
Vogt, Bodo
DATA IN BRIEF, 2019, 25
[2] Classification of Similar but Differently Paced Activities in the KTH Dataset
Sengupta, Shreeya
Wang, Hui
Ojha, Piyush
Blackburn, William
SEVENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2014), 2015, 9445
[3] Mobile Iris Challenge Evaluation (MICHE)-I, biometric iris dataset and protocols
De Marsico, Maria
Nappi, Michele
Riccio, Daniel
Wechsler, Harry
PATTERN RECOGNITION LETTERS, 2015, 57 : 17 - 23
[4] Comparing Routing Protocols
Kamali, Mojgan
Petre, Luigia
2015 20TH INTERNATIONAL CONFERENCE ON ENGINEERING OF COMPLEX COMPUTER SYSTEMS (ICECCS), 2015, : 206 - 209
[5] KTH Tangrams: A Dataset for Research on Alignment and Conceptual Pacts in Task-Oriented Dialogue
Shore, Todd
Androulakaki, Theofronia
Skantze, Gabriel
PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 768 - 775
[6] Comparing osteoprogenitor source and transplantation protocols.
Wang, L
Liu, Y
Jiang, X
Rowe, DW
JOURNAL OF BONE AND MINERAL RESEARCH, 2005, 20 (09) : S369 - S369
[7] Comparing the Energy Efficiency of Protocols in Real Networks
Durbeck, Lisa J. K.
2016 IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2016,
[8] Comparing two superovulation protocols in dromedary camels
Niasari-Naslaji, A.
EL-Badry, D.
REPRODUCTION IN DOMESTIC ANIMALS, 2012, 47 : 458 - 458
[9] Comparing Sedation Protocols In Medical Icu Patients
Ali, A.
Alwardi, A.
Kubbara, A.
Taleb, M.
Al-Ahwel, Y.
Kozodoy, N.
Khuder, S.
Yoon, Y.
AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2015, 191
[10] Comparing wireless sensor network routing protocols
Niezen, Gerrit
Hancke, Gerhard P.
Rudas, Imre J.
Horvath, Laszlo
2007 AFRICON, VOLS 1-3, 2007, : 799 - +

← 1 2 3 4 5 →