We propose a Tensor Decomposition based algorithm that recognizes the observed action performed by an
unknown person and unknown viewpoint not included in the database. Our previous research aimed motion recognition
from one single viewpoint. In this paper, we extend our approach for human motion recognition from an arbitrary
viewpoint. To achieve this issue, we set tensor database which are multi-dimensional vectors with dimensions
corresponding to human models, viewpoint angles, and action classes. The value of a tensor for a given combination of
human silhouette model, viewpoint angle, and action class is the series of mesh feature vectors calculated each frame
sequence. To recognize human motion, the actions of one of the persons in the tensor are replaced by the synthesized
actions. Then, the core tensor for the replaced tensor is computed. This process is repeated for each combination of
action, person, and viewpoint. For each iteration, the difference between the replaced and original core tensors is
computed. The assumption that gives the minimal difference is the action recognition result. The recognition results
show the validity of our proposed method, the method is experimentally compared with Nearest Neighbor rule. Our
proposed method is very stable as each action was recognized with over 75% accuracy.
This paper proposes a method that identifies and tracks a walking human across discontinuous fields of views of
multiple cameras for the purpose of video surveillance. A typical video surveillance system has multiple cameras, but
there are several spaces within the surveillance area that are not within any of the camera's field of view. Also, there are
discontinuities between the fields of views of adjacent cameras. In such a system, humans need to be tracked across
discontinuous fields of views of multiple cameras. Our proposed model addresses this issue using the concepts of gait
pattern, gait model, and motion signature. Each human's gait pattern is constructed and stored in a database. This gait
pattern spans a tensor space that consists of three dimensions: person, image feature, and spatio-temporal data. A
human's gait model can be constructed from the gait pattern using the "tensor decomposition based approach" described
in this paper. When human(s) appears in one of the camera's field of a view (which is often discontinuous from the other
camera's field of views), the human's motion signature is calculated and compared to each person in the database's gait
model. The person with the gait model that is most similar to the motion signature is identified as same person. After the
person is identified, the person is tracked within the field of view of the camera using the mean-shift algorithm based on
color parameters. We conducted two experiments; the first experiment was identifying and tracking humans in a single
video sequence, and experiments, the percentage of subjects that were correctly identified and tracked was better than
that of two currently widely-used methods, PCA and nearest-neighbor. In the second experiment was the same as the
first experiment but consisted of multiple-cameras with discontinuous views. The second experiment (human tracking
across discontinuous images), shows the potential validity of the proposed method in a typical surveillance system.
This paper proposes a Tensor Decomposition Based method that can recognize an unknown person's action
from a video sequence, where the unknown person is not included in the database (tensor) used for the recognition. The
tensor consists of persons, actions and time-series image features. For the observed unknown person's action, one of the
actions stored in the tensor is assumed. Using the motion signature obtained from the assumption, the unknown person's
actions are synthesized. The actions of one of the persons in the tensor are replaced by the synthesized actions. Then, the
core tensor for the replaced tensor is computed. This process is repeated for the actions and persons. For each iteration, the difference between the replaced and original core tensors is computed. The assumption that gives the minimal
difference is the action recognition result. For the time-series image features to be stored in the tensor and to be extracted
from the observed video sequence, the human body silhouette's contour shape based feature is used. To show the validity
of our proposed method, our proposed method is experimentally compared with Nearest Neighbor rule and Principal
Component analysis based method. Experiments using 33 persons' seven kinds of action show that our proposed method
achieves better recognition accuracies for the seven actions than the other methods.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.