R(2+1)D and Mixed-Convolutions for Action Recognition

If you find this work helpful for your research, please cite our following paper:

D. Tran, H. Wang, L. Torresani, J. Ray, Y. LeCun and M. Paluri. A Closer Look at Spatiotemporal Convolutions for Action Recognition. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2018.

@inproceedings{r2plus1d_cvpr18,
    title = {A Closer Look at Spatiotemporal Convolutions for Action Recognition},
    author = {Du Tran and Heng Wang and Lorenzo Torresani and Jamie Ray and Yann LeCun and
               Manohar Paluri},
    booktitle = {CVPR},
    year = 2018
}

If you have any question or feedback about the code, please contact: [email protected], [email protected].

Requirements

R2Plus1D requires the following dependencies:

OpenCV (tested on 3.4.1) and ffmpeg.
Caffe2 and its dependencies.
- You will need to build from source and install with USE_OPENCV=1 USE_FFMPEG=1 USE_LMDB=1 python setup.py install for OpenCV, ffmpeg, and lmdb support.
And lmdb, python-lmdb, and pandas.

Installation

You need to install ffmpeg, OpenCV, and caffe2. Caffe2 source build instructions can be found here but make sure you install with USE_OPENCV=1 USE_FFMPEG=1 USE_LMDB=1 python setup.py install. You also need to install lmdb, python-lmdb, and pandas.

Tutorials

We provide some basic tutorials for you to get familar with the code and tools.

License

R2Plus1D is Apache 2.0 licensed, as found in the LICENSE file.

Acknowledgements

The authors would like to thank Ahmed Taei, Aarti Basant, Aapo Kyrola, and the Facebook Caffe2 team for their help in implementing ND-convolution, in optimizing video I/O, and in providing support for distributed training. We are grateful to Joao Carreira for sharing I3D results on the Kinetics validation set.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
data		data
lib		lib
scripts		scripts
tools		tools
tutorials		tutorials
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

R(2+1)D and Mixed-Convolutions for Action Recognition

Requirements

Installation

Tutorials

License

Acknowledgements

About

Releases

Packages

Languages

License

loomlike/VMZ

Folders and files

Latest commit

History

Repository files navigation

R(2+1)D and Mixed-Convolutions for Action Recognition

Requirements

Installation

Tutorials

License

Acknowledgements

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages