8000 nauman-daw (Nauman Dawalatabad) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View nauman-daw's full-sized avatar
:octocat:
:octocat:

Block or report nauman-daw

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A collection of AWESOME things about domian adaptation

5,286 885 Updated Oct 14, 2024
Python 305 69 Updated Feb 28, 2020

Re-implementation of SLAM-ASR paper's experiment, using Phi-2 and Hubert

Python 18 4 Updated Jun 14, 2024

Unlock your displays on your Mac! Flexible HiDPI scaling, XDR/HDR extra brightness, virtual screens, DDC control, extra dimming, PIP/streaming, EDID override and lots more!

25,158 431 Updated Jun 3, 2025
Python 295 20 Updated Jun 14, 2024

Voice Conversion With Just Nearest Neighbors

Python 490 68 Updated Mar 18, 2024
Python 103 23 Updated Sep 2, 2021

A python package to analyze and compare voices with deep learning

Python 3,001 448 Updated Oct 12, 2023

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 2,123 183 Updated Jun 6, 2025

End-to-End Speech Processing Toolkit

Python 9,205 2,280 Updated Jun 16, 2025

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

Python 698 121 Updated Apr 11, 2024
Python 10 5 Updated Mar 21, 2018

In defence of metric learning for speaker recognition

Python 1,109 283 Updated Mar 26, 2024

Mamba SSM architecture

Python 15,106 1,336 Updated May 25, 2025

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 3,454 242 Updated Feb 13, 2025
Python 24 2 Updated Dec 14, 2021

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

Python 4,522 751 Updated Mar 19, 2025

A resource for learning about Machine learning & Deep Learning

Python 8,122 2,765 Updated Aug 17, 2024

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,715 123 Updated Jul 5, 2024

Example code for a neural transducer model.

Jupyter Notebook 61 18 Updated Feb 10, 2024

Learn System Design concepts and prepare for interviews using free resources.

Java 23,844 5,726 Updated Jun 17, 2025

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

55,657 5,958 Updated Jun 4, 2025

Python interface to the WebRTC Voice Activity Detector

C 2,268 417 Updated Jul 4, 2024
Python 8,631 509 Updated Oct 9, 2024

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 10,158 862 Updated Jul 6, 2024

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 38,034 4,518 Updated Aug 19, 2024

All Algorithms implemented in Python

Python 201,457 46,894 Updated Jun 9, 2025

Align word sequences and calculate metrics like word error rate (WER)

Java 22 12 Updated Dec 23, 2011

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Python 939 143 Updated May 19, 2025

Meditron is a suite of open-source medical Large Language Models (LLMs).

Python 2,034 196 Updated Apr 10, 2024
Next
0