-
KAUST
- Saudi Arabia
-
17:14
(UTC +03:00) - https://xiaoqian-shen.github.io
- @xiaoqian_shen
- in/xiaoqian-shen-759991264
Highlights
- Pro
Stars
QA
5 repositories
Visual Dialog: Light-weight Transformer for Many Inputs (ECCV 2020)
NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks, CVPR 2022 (Oral)
🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".
Official Repository of ChatCaptioner