Starred repositories
🦜🔗 Build context-aware reasoning applications
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
🔊 Text-Prompted Generative Audio Model
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
A High-Quality Real Time Upscaler for Anime Video
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.