Stars
Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.org/abs/1804.05448)
A simplified implemention of Faster R-CNN that replicate performance from origin paper
Deep Learning 101 with PaddlePaddle (『飞桨』深度学习框架入门教程)