xdshang

Xindi Shang xdshang

Multimodal Research Scientist

62 followers · 89 following

Achievements

Stars

bytedance / MegaTTS3

Python 5,155 358 Updated May 11, 2025

xinntao / Real-ESRGAN

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Python 30,875 3,838 Updated Aug 6, 2024

simular-ai / Agent-S

Agent S: an open agentic framework that uses computers like a human

Python 4,621 450 Updated May 9, 2025

camel-ai / camel

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Python 12,385 1,306 Updated May 12, 2025

browser-use / browser-use

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 59,752 6,555 Updated May 12, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 7,618 753 Updated May 12, 2025

OpenMOSS / VLABench

Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.

Python 217 5 Updated Apr 30, 2025

conghui1002 / UCDIR

Python 11 Updated Aug 8, 2022

FoundationVision / VAR

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 7,831 479 Updated Mar 22, 2025

reworkd / tarsier

Vision utilities for web interaction agents 👀

Jupyter Notebook 1,669 105 Updated Nov 25, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 26,402 2,551 Updated Apr 30, 2025

DAMO-NLP-SG / Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Python 2,999 272 Updated Jun 4, 2024

teacherpeterpan / self-correction-llm-papers

This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.

520 30 Updated Oct 28, 2024

yunlong10 / Awesome-LLMs-for-Video-Understanding

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

2,271 102 Updated May 4, 2025

aisingapore / sealion

South-East Asia Large Language Models

Shell 305 23 Updated May 6, 2025

ytongbai / LVM

Python 1,807 60 Updated Jun 28, 2024

01-ai / Yi

A series of large language models trained from scratch by developers @01-ai

Jupyter Notebook 7,830 493 Updated Nov 27, 2024

facebookresearch / fairseq2

FAIR Sequence Modeling Toolkit 2

Python 898 106 Updated May 12, 2025

Stability-AI / generative-models

Generative Models by Stability AI

Python 25,836 2,867 Updated Apr 4, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 13,666 1,872 Updated May 9, 2025

OpenBMB / BMTools

Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins

Python 2,787 257 Updated Dec 5, 2023

cleanlab / cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Python 10,518 826 Updated Apr 10, 2025

OpenGVLab / CaFo

[CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners

Python 369 19 Updated Jun 1, 2023

RetroCirce / HTS-Audio-Transformer

The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"

Python 407 68 Updated Aug 16, 2024

juncongmoo / chatllama

ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT

Python 1,203 135 Updated Jan 18, 2025

chenfei-wu / TaskMatrix

Python 34,482 3,292 Updated Jan 6, 2024

tatsu-lab / stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,983 4,052 Updated Jul 17, 2024

szx503045266 / VidVRD-MHA

Forked from xdshang/VidVRD-helper

Video Relation Detection via Multiple Hypothesis Association (ACM MM 2020)

Python 1 Updated Oct 21, 2021

lllyasviel / ControlNet

Let us control diffusion models!

Python 32,258 2,885 Updated Feb 25, 2024

chenhaoxing / DiffusionInst

This repo is the code of paper "DiffusionInst: Diffusion Model for Instance Segmentation" (ICASSP'24).

Python 241 15 Updated Jan 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xindi Shang xdshang

Achievements

Achievements

Block or report xdshang

Stars

bytedance / MegaTTS3

xinntao / Real-ESRGAN

simular-ai / Agent-S

camel-ai / camel

browser-use / browser-use

deepseek-ai / DeepEP

OpenMOSS / VLABench

conghui1002 / UCDIR

FoundationVision / VAR

reworkd / tarsier

hpcaitech / Open-Sora

DAMO-NLP-SG / Video-LLaMA

teacherpeterpan / self-correction-llm-papers

yunlong10 / Awesome-LLMs-for-Video-Understanding

aisingapore / sealion

ytongbai / LVM

01-ai / Yi

facebookresearch / fairseq2

Stability-AI / generative-models

huggingface / trl

OpenBMB / BMTools

cleanlab / cleanlab

OpenGVLab / CaFo

RetroCirce / HTS-Audio-Transformer

juncongmoo / chatllama

chenfei-wu / TaskMatrix

tatsu-lab / stanford_alpaca

szx503045266 / VidVRD-MHA

lllyasviel / ControlNet

chenhaoxing / DiffusionInst