8000 xiaoqian-shen (Xiaoqian Shen) / Starred · GitHub

More Web Proxy on the site http://driver.im/

xiaoqian-shen

Follow

Xiaoqian Shen xiaoqian-shen

Follow

66 followers · 2 following

Highlights

Pro

Lists (22)

Sort

3d

audio

backbone

datasets

difussion

15 repositories

edit

evaluation

face

22 repositories

interpretability

LLM

18 repositories

QA

quant

RLHF

stock

story

10 repositories

style

T2I

T2Ibenchmark

video

41 repositories

vqgan

webpage

ZSL

Stars

2U1 / Qwen2-VL-Finetune

An open-source implementaion for fine-tuning Qwen2-VL and Qwen2.5-VL series by Alibaba Cloud.

Python 803 104 Updated Jun 2, 2025

parthmodi152 / alpha-gpt

Python 24 9 Updated Mar 14, 2025

MrFadiAi / ai-agents-for-trading

Forked from daydy-dev/moon-dev-ai-agents-for-trading

ai agents for trading

Python 22 5 Updated Jan 6, 2025

Monkfishare / Scientific_American

Scientific American epub/pdf 科学美国人

53 4 Updated May 20, 2025

Vision-CAIR / LongVU

[ICML 2025] Official PyTorch implementation of LongVU

Python 380 28 Updated May 8, 2025

metauto-ai / agent-as-a-judge

⚖️ The First Coding Agent-as-a-Judge

Python 541 82 Updated May 14, 2025

deepseek-ai / DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 3,864 571 Updated Apr 24, 2024

microsoft / PhiCookBook

This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi models are the most capable and cost-effective small langua…

Jupyter Notebook 3,342 420 Updated Jun 2, 2025

facebookresearch / open-eqa

OpenEQA Embodied Question Answering in the Era of Foundation Models

Jupyter Notebook 285 25 Updated Sep 20, 2024

mutonix / Vript

Python 149 3 Updated Jan 16, 2025

mira-space / MiraData

Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"

Python 446 14 Updated Sep 2, 2024

mira-space / Mira

Python 359 15 Updated Oct 21, 2024

apple / ml-mgie

Python 3,883 250 Updated Mar 15, 2024

omriav / blended-diffusion

Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]

Jupyter Notebook 575 44 Updated Jun 4, 2024

timothybrooks / instruct-pix2pix

Python 6,680 563 Updated Mar 3, 2024

baaivision / Emu

Emu Series: Generative Multimodal Models from BAAI

Python 1,724 85 Updated Sep 27, 2024

google / storybench

Python 49 3 Updated Oct 16, 2023

Fantasy-Studio / Paint-by-Example

Paint by Example: Exemplar-based Image Editing with Diffusion Models

Python 1,192 104 Updated Nov 28, 2023

mbzuai-oryx / groundingLMM

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 886 48 Updated Nov 23, 2024

bytedance / Shot2Story

A new multi-shot video understanding benchmark Shot2Story with comprehensive video summaries and detailed shot-level captions.

Python 134 7 Updated Jan 30, 2025

allenai / unified-io-2

Python 614 31 Updated Feb 15, 2024

ali-vilab / VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Python 3,108 274 Updated Jan 10, 2025

xiaoqian-shen / StoryGPT-V

[CVPR 2025] Official PyTorch implementation of StoryGPT-V

Jupyter Notebook 37 3 Updated Mar 13, 2025

Doubiiu / DynamiCrafter

[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Python 2,868 227 Updated Sep 8, 2024

Vchitect / LaVie

[IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models

Python 928 63 Updated Nov 13, 2024

YingqingHe / LVDM

LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation

Python 482 21 Updated Nov 16, 2024

yonseivnl / cmota

Python 10 1 Updated Sep 12, 2024

Vchitect / SEINE

[ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction

Python 939 63 Updated Nov 13, 2024

AILab-CVC / FreeNoise

[ICLR 2024] Code for FreeNoise based on VideoCrafter

Python 410 25 Updated Jul 11, 2024

AILab-CVC / TaleCrafter

[SIGGRAPH Asia 2023] An interactive story visualization tool that support multiple characters

261 13 Updated Mar 22, 2024

0