8000 xiaoqian-shen (Xiaoqian Shen) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View xiaoqian-shen's full-sized avatar

Highlights

  • Pro

Block or report xiaoqian-shen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An open-source implementaion for fine-tuning Qwen2-VL and Qwen2.5-VL series by Alibaba Cloud.

Python 803 104 Updated Jun 2, 2025
Python 24 9 Updated Mar 14, 2025

ai agents for trading

Python 22 5 Updated Jan 6, 2025

Scientific American epub/pdf 科学美国人

53 4 Updated May 20, 2025

[ICML 2025] Official PyTorch implementation of LongVU

Python 380 28 Updated May 8, 2025

⚖️ The First Coding Agent-as-a-Judge

Python 541 82 Updated May 14, 2025

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 3,864 571 Updated Apr 24, 2024

This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi models are the most capable and cost-effective small langua…

Jupyter Notebook 3,342 420 Updated Jun 2, 2025

OpenEQA Embodied Question Answering in the Era of Foundation Models

Jupyter Notebook 285 25 Updated Sep 20, 2024
Python 149 3 Updated Jan 16, 2025

Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"

Python 446 14 Updated Sep 2, 2024
Python 359 15 Updated Oct 21, 2024
Python 3,883 250 Updated Mar 15, 2024

Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]

Jupyter Notebook 575 44 Updated Jun 4, 2024

Emu Series: Generative Multimodal Models from BAAI

Python 1,724 85 Updated Sep 27, 2024
Python 49 3 Updated Oct 16, 2023

Paint by Example: Exemplar-based Image Editing with Diffusion Models

Python 1,192 104 Updated Nov 28, 2023

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 886 48 Updated Nov 23, 2024

A new multi-shot video understanding benchmark Shot2Story with comprehensive video summaries and detailed shot-level captions.

Python 134 7 Updated Jan 30, 2025
Python 614 31 Updated Feb 15, 2024

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Python 3,108 274 Updated Jan 10, 2025

[CVPR 2025] Official PyTorch implementation of StoryGPT-V

Jupyter Notebook 37 3 Updated Mar 13, 2025

[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Python 2,868 227 Updated Sep 8, 2024

[IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models

Python 928 63 Updated Nov 13, 2024

LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation

Python 482 21 Updated Nov 16, 2024
Python 10 1 Updated Sep 12, 2024

[ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction

Python 939 63 Updated Nov 13, 2024

[ICLR 2024] Code for FreeNoise based on VideoCrafter

Python 410 25 Updated Jul 11, 2024

[SIGGRAPH Asia 2023] An interactive story visualization tool that support multiple characters

261 13 Updated Mar 22, 2024
Next
0