Sequence-to-sequence Model with Temporal Attention

seq2seq_temporal_attention is a tool for automatic video captioning. This is an implementation of Generating Video Description using Sequence-to-sequence Model with Temporal Attention (PDF).

Requirements (Linux/Mac)

Python 2 or Python 3
To train a model, Python 2 is required.
OpenCV
Make sure that modules for video are included. If you encounter an error while extracting frames, perhaps you can get helpful information from here: OpenCV video capture from file fails on Linux.
Chainer
youtube-dl

For requirements for Windows, read docs/requirements-windows.md.

Examples

Captioning a video

To test out the tool, run example.sh. It gives a caption for an excerpt of the video titled playing wool ball with my cat : ). Our models were trained on Microsoft Video Description Dataset.

git clone [email protected]:aistairc/seq2seq_temporal_attention.git --recursive
./download.sh
./example.sh --gpu GPU_ID  # It will generate a caption *a cat is playing with a toy*

Note: In most cases, setting GPU_ID to 0 will work. If you want to run it without GPU, set the parameter to -1.

Training

This is an example command to train.

cd code
python chainer_seq2seq_att.py \
    --mode train \
    --gpu GPU_ID \
    --batchsize 40 \
    --dropout 0.3 \
    --align ('dot'|'bilinear'|'concat'|'none') \
    --feature feature_file_name \
    output_folder

Test

There are two ways for test, test and test-batch. The latter runs much faster, but it does not use beam search. Be careful to specify which alignment model you want to use. It has to match your pre-trained model, in order to make it work correctly.

cd code
python chainer_seq2seq_att.py \
    --mode ('test'|'test-batch') \
    --gpu GPU_ID \
    --model path_to_model_file \
    --align ('dot'|'bilinear'|'concat'|'none') \
    --feature feature_file_name \
    output_folder

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
coco-caption @ 3f0fe9b		coco-caption @ 3f0fe9b
code		code
data/Y2T		data/Y2T
docs		docs
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
download.sh		download.sh
example.sh		example.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sequence-to-sequence Model with Temporal Attention

Requirements (Linux/Mac)

Examples

Captioning a video

Training

Test

About

Releases 1

Packages

Contributors 3

Languages

License

aistairc/seq2seq_temporal_attention

Folders and files

Latest commit

History

Repository files navigation

Sequence-to-sequence Model with Temporal Attention

Requirements (Linux/Mac)

Examples

Captioning a video

Training

Test

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 3

Languages

Packages