8000 GitHub - justin4ai/image-captioning: RNN/LSTM/Transformer image caption generator implementation in Pytorch.
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

RNN/LSTM/Transformer image caption generator implementation in Pytorch.

Notifications You must be signed in to change notification settings

justin4ai/image-captioning

Repository files navigation

Image Captioning

This repositoriy contains RNN, LSTM and Transformer-based image captioning models implementation during Computer Vision 2024 course.

Models are trained on COCO dataset and evaluated on NICE Challenge dataset.

Note

In Transformer_Load.ipynb and load_checkpoint.py, beam search, which allows my Transformer to achieve higher scores in terms of BLEU@k and CIDEr, is implemented.

Also I figured out that applying n_gram blocking together was also helpful in achieving higher CIDEr scores.

Evaluation

My implementation of CIDEr evaluation metric is available at cider-python3 repository.

Summary Report

Summary report of my work is attached as ./report/image_captioning.pdf.

About

RNN/LSTM/Transformer image caption generator implementation in Pytorch.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published
0