8000 aFewThings (EunBeen Kim) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View aFewThings's full-sized avatar
🖐️
🖐️
  • Multimedia Information Lab.
  • South Korea
  • 09:21 (UTC +09:00)

Block or report aFewThings

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

The official Python SDK for Model Context Protocol servers and clients

Python 11,424 1,253 Updated May 4, 2025

An open protocol enabling communication and interoperability between opaque agentic applications.

Python 14,286 1,257 Updated May 5, 2025

An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.

Python 8,217 825 Updated May 6, 2025

10 Lessons to Get Started Building AI Agents

Jupyter Notebook 18,558 4,775 Updated May 5, 2025

Open-source simulator for autonomous driving research.

C++ 12,411 3,993 Updated May 5, 2025

Pytorch implementation of our work "Domain-Invariant Representation Learning of Bird Sounds" (arXiv 2024)

Python 8 1 Updated Feb 20, 2025

An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"

Python 31 Updated May 31, 2023

Pre-trained models for bioacoustic classification tasks

Python 43 7 Updated Apr 21, 2025

A benchmark dataset collection for bird sound classification

Jupyter Notebook 47 15 Updated Apr 30, 2025

CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)

Python 1,221 166 Updated Aug 19, 2024

iNatSounds Datasets

20 1 Updated Dec 13, 2024

[NeurIPS 2024] AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation

Python 98 4 Updated Oct 5, 2024

This toolbox aims to unify audio generation model evaluation for easier comparison.

Python 337 31 Updated Sep 29, 2024

Elucidating the Design Space of Diffusion-Based Generative Models (EDM)

Python 1,638 162 Updated Mar 16, 2024

Minimum implementation of EDM (Elucidating the Design Space of Diffusion-Based Generative Models) on cifar10 and mnist

Python 51 5 Updated Dec 16, 2023

Karras et al. (2022) diffusion models for PyTorch

Python 2,452 385 Updated Jan 7, 2025

PyTorch Implementation of AudioLCM (ACM-MM'24): a efficient and high-quality text-to-audio generation with latent consistency model.

Python 1,068 151 Updated Apr 3, 2025

OpenMusic: SOTA Text-to-music (TTM) Generation

Python 557 55 Updated Apr 28, 2025

A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline

Python 129 2 Updated Dec 13, 2024

Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)

Python 562 40 Updated Apr 23, 2024

Code for Fast Training of Diffusion Models with Masked Transformers

Python 401 14 Updated May 15, 2024

Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stability AI.

Python 183 11 Updated Jul 25, 2024

Vector (and Scalar) Quantization, in Pytorch

Python 3,197 258 Updated May 3, 2025

A family of diffusion models for text-to-audio generation.

Python 1,163 99 Updated Dec 31, 2024
Python 14 2 Updated Jun 25, 2024

Generative models for conditional audio generation

Python 3,049 308 Updated Apr 30, 2025

Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.

Python 214 20 Updated Apr 18, 2025

[NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models

Python 644 42 Updated Jul 17, 2024

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,025 711 Updated Apr 12, 2025

The dataset and baseline code for Text-to-Audio Grounding (TAG)

Python 42 1 Updated Jan 14, 2025
Next
0