8000 sripathisridhar (Sripathi Sridhar) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View sripathisridhar's full-sized avatar
🏠
Mountain climb
🏠
Mountain climb

Highlights

  • Pro

Block or report sripathisridhar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

PyTorch implementation of Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities.

Python 465 27 Updated Apr 29, 2025

AudioLDM training, finetuning, evaluation and inference.

Python 248 47 Updated Dec 13, 2024

Official Release of NeurIPS 2023 Spotlight paper "Object-Centric Slot Diffusion"

Python 65 17 Updated Mar 9, 2024

Audio Large Language Models

Python 514 30 Updated Mar 9, 2025

Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986

Python 45 4 Updated Oct 13, 2024

This is the PyTorch implementation of the Universal Source Separation with Weakly labelled Data.

Python 349 19 Updated Sep 1, 2023

Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"

Python 38 Updated Jun 6, 2024

Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.

Python 198 15 Updated Mar 7, 2025

kmeans using PyTorch

Jupyter Notebook 6 1 Updated Mar 9, 2024

Serverless IMDB API powered by Cloudflare Worker

JavaScript 289 318 Updated Jul 6, 2024

[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)

Python 2,168 168 Updated Dec 22, 2022

📄 🤖 Semantic search and workflows for medical/scientific papers

Python 1,396 110 Updated Apr 21, 2025

Implementation of Slot Attention from GoogleAI

Python 426 33 Updated Aug 20, 2024

Official implementation of MW-MAE in Jax

Python 4 1 Updated Feb 14, 2024

Collection of audio-focused loss functions in PyTorch

Python 774 72 Updated Jul 30, 2024
Python 23 2 Updated Aug 26, 2023

Actionable and opinionated no-bs ideas, frameworks and resources from successful operators in crypto to help build, grow and scale web3 products

16 Updated Jun 18, 2024

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

Go 139,677 11,661 Updated May 6, 2025

PaSh: Light-touch Data-Parallel Shell Processing

Shell 572 44 Updated Apr 14, 2025

Python 3.8+ toolbox for submitting jobs to Slurm

Python 1,420 137 Updated Apr 28, 2025

End-to-End Object Detection with Transformers

Python 14,300 2,553 Updated Mar 12, 2024

Mamba SSM architecture

Python 14,774 1,287 Updated Apr 1, 2025

Official repo of ICASSP 2022 paper - Don't Separate, Learn to Remix: End-to-End Neural Remixing with Joint Optimization

Python 16 2 Updated Jan 7, 2025

Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".

Python 431 40 Updated Apr 24, 2024

A technical report on convolution arithmetic in the context of deep learning

TeX 14,315 2,291 Updated Jun 8, 2023

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

672 39 Updated Aug 3, 2024

This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.

Python 225 12 Updated Jul 25, 2024
Next
0