part-aware lstm is proposed in 《NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis》, which is used for skeleton-based action recognition. It splits the whole body into 5 body part context. The specially designed lstm cell could better extract the context features than the normal one. The cell diagram is shown following:
Comparing with the normal lstm cell, it is not difficult to find that plstm has seperated i, g and g gate corresponding to different body part.
It is inspired. So let us implement it in TensorFlow.
Folder network
includes three files, PartAwareLSTMCell.py
,DataLoader.py
and PLSTM.py
.
PartAwareLSTMCell.py
is the Part-Aware LSTM cell.DataLoader.py
is the data loader used to load and preprocess the skeleton datas. Note that all skeleton data are formatted like Array [Number_clips, {'mat':data, 'view': v, 'class':c, 'actor':a}] data with shape of [Frames, 25, 3]PLSTM.py
is the main train and evaluation entry.
Folder utils
include one file, gendata.py
which uses to generate the formatted data in numpy arrays. Note that all raw skeleton is stored in txt file and i transform them to mat file in MATLAB and i only save the (x,y,z) information.
2018 4.13, add a jupyter notebook script in app
folder used for training and evaluation. The code has not clear yet but could be a reference.
2018 5.7, add a .mat file sample in datas
folder to give a typical example of the .mat data formation.
2020 1.11, add the NTU RGBD 120 skeleton data parser with python in folder ntu_rgbd_parser
, for more detail, go to my another repository [2].
[1]. Shahroudy A, Liu J, Ng T T, et al. Ntu rgb+ d: A large scale dataset for 3d human activity analysis[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 1010-1019.
[2]. https://github.com/FesianXu/NTU_RGBD120_Parser_python
[3]. Liu J, Shahroudy A, Perez M L, et al. NTU RGB+ D 120: A Large-Scale Benchmark for 3D Human Activity Understanding[J]. IEEE transactions on pattern analysis and machine intelligence, 2019.