EN.601.661 Final Project: Exploring Multiple Visual Cues for Human Action Recognition

Introduction

This is the code repository of the final project for the 19Fall Computer Vision Course (EN.601.661) at JHU. The team members are Heather Han, Zili Huang, Yingda Xia and Yi Zhang. The code is based on MMAction.

The master branch is for RGB modality. Checkout the optical_flow and rgb+kp branch for optical flow modality and human 2D keypoint modality.

Installation

Please refer to INSTALL.md for installation.

Data Preparation

We use a subset of NTU RGB+D dataset. We provide a script to process the dataset and generate necessary files for training and testing.

bash prepare_nturgbd.sh

Test Pretrained Model

We provide pretrained models for testing. Download them to modelzoo/,

bash download_models.sh

Test models on the testset,

bash test_rgb.sh

Ensemble Different Modalities

To ensemble the results of different modalities, we use late fusion which averages the logits from the output of different models. The output results in our experiment are saved in results/. Ensembling using the following script,

python ensemble.py

Training

We provide a script for training RGB network.

bash train_rgb.sh

Name		Name	Last commit message	Last commit date
Latest commit History 162 Commits
configs		configs
data		data
data_tools		data_tools
mmaction		mmaction
modelzoo		modelzoo
results		results
third_party		third_party
tools		tools
.gitignore		.gitignore
.gitmodules		.gitmodules
.style.yapf		.style.yapf
CONTRIBUTING.md		CONTRIBUTING.md
DATASET.md		DATASET.md
GETTING_STARTED.md		GETTING_STARTED.md
INSTALL.md		INSTALL.md
ISSUES.md		ISSUES.md
LICENSE		LICENSE
MODEL_ZOO.md		MODEL_ZOO.md
README.md		README.md
README_original.md		README_original.md
analyze_result.py		analyze_result.py
compile.sh		compile.sh
download_models.sh		download_models.sh
ensemble.py		ensemble.py
get_result.py		get_result.py
prepare_nturgbd.sh		prepare_nturgbd.sh
read_ntu_skeleton.py		read_ntu_skeleton.py
setup.py		setup.py
test_rgb.sh		test_rgb.sh
train_rgb.sh		train_rgb.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EN.601.661 Final Project: Exploring Multiple Visual Cues for Human Action Recognition

Introduction

Installation

Data Preparation

Test Pretrained Model

Ensemble Different Modalities

Training

About

Releases

Packages

Languages

License

edz-o/VideoActionCues

Folders and files

Latest commit

History

Repository files navigation

EN.601.661 Final Project: Exploring Multiple Visual Cues for Human Action Recognition

Introduction

Installation

Data Preparation

Test Pretrained Model

Ensemble Different Modalities

Training

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages