Subject independent human action recognition using spatio-depth information and meta-cognitive RBF network

Babu, Venkatesh R and Savitha, R and Suresh, S and Agarwal, Bhuvnesh (2013) Subject independent human action recognition using spatio-depth information and meta-cognitive RBF network. In: ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 26 (9). pp. 2010-2021.

PDF
Eng_App_Art_Int_26-9_2010_2013.pdf - Published Version
Restricted to Registered users only
Download (918kB) | Request a copy

Official URL: http://dx.doi.org/10.1016/j.engappai.2013.07.008

Abstract

In this paper, we present a machine learning approach for subject independent human action recognition using depth camera, emphasizing the importance of depth in recognition of actions. The proposed approach uses the flow information of all 3 dimensions to classify an action. In our approach, we have obtained the 2-D optical flow and used it along with the depth image to obtain the depth flow (Z motion vectors). The obtained flow captures the dynamics of the actions in space time. Feature vectors are obtained by averaging the 3-D motion over a grid laid over the silhouette in a hierarchical fashion. These hierarchical fine to coarse windows capture the motion dynamics of the object at various scales. The extracted features are used to train a Meta-cognitive Radial Basis Function Network (McRBFN) that uses a Projection Based Learning (PBL) algorithm, referred to as PBL-McRBFN, henceforth. PBL-McRBFN begins with zero hidden neurons and builds the network based on the best human learning strategy, namely, self-regulated learning in a meta-cognitive environment. When a sample is used for learning, PBLMcRBFN uses the sample overlapping conditions, and a projection based learning algorithm to estimate the parameters of the network. The performance of PBL-McRBFN is compared to that of a Support Vector Machine (SVM) and Extreme Learning Machine (ELM) classifiers with representation of every person and action in the training and testing datasets. Performance study shows that PBL-McRBFN outperforms these classifiers in recognizing actions in 3-D. Further, a subject-independent study is conducted by leave-one-subject-out strategy and its generalization performance is tested. It is observed from the subject-independent study that McRBFN is capable of generalizing actions accurately. The performance of the proposed approach is benchmarked with Video Analytics Lab (VAL) dataset and Berkeley Multimodal Human Action Database (MHAD). (C) 2013 Elsevier Ltd. All rights reserved.

Item Type:	Journal Article
Publication:	ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE
Publisher:	PERGAMON-ELSEVIER SCIENCE LTD
Additional Information:	copyright for this article belongs to Elsevier
Keywords:	Action recognition; 3-D optical flow; Kinect depth sensor; Projection based learning; Meta-cognition and self-regulated learning
Department/Centre:	Division of Interdisciplinary Sciences > Supercomputer Education & Research Centre
Date Deposited:	04 Nov 2013 06:31
Last Modified:	04 Nov 2013 06:31
URI:	http://eprints.iisc.ac.in/id/eprint/47674

Actions (login required)

View Item


	Powered by EPrints		A service from The J.R.D. Tata Memorial Library Indian Institute of Science, Bengaluru-560012, India