8000 aFewThings (EunBeen Kim) / Starred · GitHub

More Web Proxy on the site http://driver.im/

aFewThings

Follow

🖐️

EunBeen Kim aFewThings

🖐️

Follow

13 followers · 13 following

Multimedia Information Lab.
South Korea
09:21 (UTC +09:00)

Achievements

Achievements

Lists (7)

Sort

AI Agent

Audio Domain

59 repositories

GANs

Generative Modeling

Geospatial modeling / SDMs

10 repositories

Image Domain

Simulation

Starred repositories

modelcontextprotocol / python-sdk

The official Python SDK for Model Context Protocol servers and clients

Python 11,424 1,253 Updated May 4, 2025

google / A2A

An open protocol enabling communication and interoperability between opaque agentic applications.

Python 14,286 1,257 Updated May 5, 2025

google / adk-python

An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.

Python 8,217 825 Updated May 6, 2025

microsoft / ai-agents-for-beginners

10 Lessons to Get Started Building AI Agents

Jupyter Notebook 18,558 4,775 Updated May 5, 2025

carla-simulator / carla

Open-source simulator for autonomous driving research.

C++ 12,411 3,993 Updated May 5, 2025

ilyassmoummad / ProtoCLR

Pytorch implementation of our work "Domain-Invariant Representation Learning of Bird Sounds" (arXiv 2024)

Python 8 1 Updated Feb 20, 2025

JinhuaLiang / lam4fsl

An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"

Python 31 Updated May 31, 2023

kitzeslab / bioacoustics-model-zoo

Pre-trained models for bioacoustic classification tasks

Python 43 7 Updated Apr 21, 2025

DBD-research-group / BirdSet

A benchmark dataset collection for bird sound classification

Jupyter Notebook 47 15 Updated Apr 30, 2025

marl / crepe

CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)

Python 1,221 166 Updated Aug 19, 2024

visipedia / inat_sounds

iNatSounds Datasets

20 1 Updated Dec 13, 2024

MCG-NJU / AWT

[NeurIPS 2024] AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation

Python 98 4 Updated Oct 5, 2024

haoheliu / audioldm_eval

This toolbox aims to unify audio generation model evaluation for easier comparison.

Python 337 31 Updated Sep 29, 2024

NVlabs / edm

Elucidating the Design Space of Diffusion-Based Generative Models (EDM)

Python 1,638 162 Updated Mar 16, 2024

yuanzhi-zhu / mini_edm

Minimum implementation of EDM (Elucidating the Design Space of Diffusion-Based Generative Models) on cifar10 and mnist

Python 51 5 Updated Dec 16, 2023

crowsonkb / k-diffusion

Karras et al. (2022) diffusion models for PyTorch

Python 2,452 385 Updated Jan 7, 2025

Text-to-Audio / AudioLCM

PyTorch Implementation of AudioLCM (ACM-MM'24): a efficient and high-quality text-to-audio generation with latent consistency model.

Python 1,068 151 Updated Apr 3, 2025

ivcylc / OpenMusic

OpenMusic: SOTA Text-to-music (TTM) Generation

Python 557 55 Updated Apr 28, 2025

JishengBai / AudioSetCaps

A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline

Python 129 2 Updated Dec 13, 2024

sail-sg / MDT

Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)

Python 562 40 Updated Apr 23, 2024

Anima-Lab / MaskDiT

Code for Fast Training of Diffusion Models with Masked Transformers

Python 401 14 Updated May 15, 2024

yukara-ikemiya / friendly-stable-audio-tools

Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stability AI.

Python 183 11 Updated Jul 25, 2024

lucidrains / vector-quantize-pytorch

Vector (and Scalar) Quantization, in Pytorch

Python 3,197 258 Updated May 3, 2025

declare-lab / tango

A family of diffusion models for text-to-audio generation.

Python 1,163 99 Updated Dec 31, 2024

blingcho / VFLIP-esorics24

Python 14 2 Updated Jun 25, 2024

Stability-AI / stable-audio-tools

Generative models for conditional audio generation

Python 3,049 308 Updated Apr 30, 2025

Stability-AI / stable-audio-metrics

Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.

Python 214 20 Updated Apr 18, 2025

ShihaoZhaoZSH / Uni-ControlNet

[NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models

Python 644 42 Updated Jul 17, 2024

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,025 711 Updated Apr 12, 2025

wsntxxn / TextToAudioGrounding

The dataset and baseline code for Text-to-Audio Grounding (TAG)

Python 42 1 Updated Jan 14, 2025

Starred topics

Machine learning

0