-
ByteDance
- Singpapore
- https://xdshang.github.io
- https://orcid.org/0000-0002-9308-2927
Stars
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Agent S: an open agentic framework that uses computers like a human
🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
DeepEP: an efficient expert-parallel communication library
Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…
Vision utilities for web interaction agents 👀
Open-Sora: Democratizing Efficient Video Production for All
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
A series of large language models trained from scratch by developers @01-ai
Generative Models by Stability AI
Train transformer language models with reinforcement learning.
Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
[CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners
The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"
ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT
Code and documentation to train Stanford's Alpaca models, and generate the data.
szx503045266 / VidVRD-MHA
Forked from xdshang/VidVRD-helperVideo Relation Detection via Multiple Hypothesis Association (ACM MM 2020)
This repo is the code of paper "DiffusionInst: Diffusion Model for Instance Segmentation" (ICASSP'24).