Stars
Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents"
✨✨Latest Advances on Multimodal Large Language Models
Large Language Model based Multi-Agents: A Survey of Progress and Challenges
This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning
Code Release for DiffusionRig (CVPR 2023)
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Nightly release of ControlNet 1.1
A high performance impermentation of Unsupervised Image Segmentation by Backpropagation - Asako Kanezaki
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation