8000 xiaoqian-shen's list / QA · GitHub

More Web Proxy on the site http://driver.im/

xiaoqian-shen

Follow

Xiaoqian Shen xiaoqian-shen

Follow

66 followers · 2 following

Highlights

Pro

Stars

QA

5 repositories

davidnvq / visdial

Visual Dialog: Light-weight Transformer for Many Inputs (ECCV 2020)

Python 29 5 Updated Aug 5, 2021

batra-mlp-lab / visdial

[CVPR 2017] Torch code for Visual Dialog

Lua 228 69 Updated Nov 29, 2018

fawazsammani / nlxgpt

NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks, CVPR 2022 (Oral)

Python 48 10 Updated Jan 30, 2024

kohjingyu / fromage

🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".

Jupyter Notebook 482 37 Updated Oct 30, 2023

Vision-CAIR / ChatCaptioner

Official Repository of ChatCaptioner

Jupyter Notebook 464 30 Updated Apr 13, 2023

0