8000 xdshang (Xindi Shang) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View xdshang's full-sized avatar

Block or report xdshang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 5,155 358 Updated May 11, 2025

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Python 30,875 3,838 Updated Aug 6, 2024

Agent S: an open agentic framework that uses computers like a human

Python 4,621 450 Updated May 9, 2025

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Python 12,385 1,306 Updated May 12, 2025

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 59,752 6,555 Updated May 12, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,618 753 Updated May 12, 2025

Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.

Python 217 5 Updated Apr 30, 2025
Python 11 Updated Aug 8, 2022

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 7,831 479 Updated Mar 22, 2025

Vision utilities for web interaction agents 👀

Jupyter Notebook 1,669 105 Updated Nov 25, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 26,402 2,551 Updated Apr 30, 2025

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Python 2,999 272 Updated Jun 4, 2024

This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.

520 30 Updated Oct 28, 2024

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

2,271 102 Updated May 4, 2025

South-East Asia Large Language Models

Shell 305 23 Updated May 6, 2025
Python 1,807 60 Updated Jun 28, 2024

A series of large language models trained from scratch by developers @01-ai

Jupyter Notebook 7,830 493 Updated Nov 27, 2024

FAIR Sequence Modeling Toolkit 2

Python 898 106 Updated May 12, 2025

Generative Models by Stability AI

Python 25,836 2,867 Updated Apr 4, 2025

Train transformer language models with reinforcement learning.

Python 13,666 1,872 Updated May 9, 2025

Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins

Python 2,787 257 Updated Dec 5, 2023

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Python 10,518 826 Updated Apr 10, 2025

[CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners

Python 369 19 Updated Jun 1, 2023

The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"

Python 407 68 Updated Aug 16, 2024

ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT

Python 1,203 135 Updated Jan 18, 2025

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,983 4,052 Updated Jul 17, 2024

Video Relation Detection via Multiple Hypothesis Association (ACM MM 2020)

Python 1 Updated Oct 21, 2021

Let us control diffusion models!

Python 32,258 2,885 Updated Feb 25, 2024

This repo is the code of paper "DiffusionInst: Diffusion Model for Instance Segmentation" (ICASSP'24).

Python 241 15 Updated Jan 10, 2025
Next
< 10C4 template id="site-details-dialog">
0