8000 qwer55252 (Sangheon Jeong) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View qwer55252's full-sized avatar

Highlights

  • Pro

Block or report qwer55252

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal is…

Python 3,662 276 Updated May 9, 2025

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,393 499 Updated Mar 11, 2025

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

Python 700 116 Updated Oct 23, 2023

PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASSP 2020)

Python 104 19 Updated Feb 27, 2022

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 31,426 6,521 Updated Jan 9, 2025
Python 1,100 335 Updated May 12, 2025

Fast and memory-efficient exact attention

Python 17,349 1,680 Updated May 8, 2025

Digital Signal Processing - Theory and Computational Examples

Jupyter Notebook 826 211 Updated Jan 13, 2025

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 4,499 1,133 Updated Mar 29, 2025

Accepted as [NeurIPS 2024] Spotlight Presentation Paper

Jupyter Notebook 6,287 631 Updated Sep 26, 2024

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 32,211 3,312 Updated Apr 19, 2025

컴퓨터 전공생을 위한 ‘AI 기반 맞춤형’ 모의 면접 어플

Kotlin 7 Updated Oct 12, 2023

A latent text-to-image diffusion model

Jupyter Notebook 70,615 10,434 Updated Jun 18, 2024

Deep Speaker: an End-to-End Neural Speaker Embedding System.

Python 926 241 Updated Apr 13, 2024

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Python 7,511 572 Updated Jul 17, 2024

Faster Whisper transcription with CTranslate2

Python 15,978 1,323 Updated Apr 29, 2025

한국어 음성인식 STT API 리스트. 각 성능 벤치마크.

403 22 Updated Apr 23, 2025

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,911 3,444 Updated May 18, 2024

머신러닝 입문자 혹은 스터디를 준비하시는 분들에게 도움이 되고자 만든 repository입니다. (This repository is intented for helping whom are interested in machine learning study)

Jupyter Notebook 2,734 879 Updated Apr 5, 2024

final-project-level2-cv-04 created by GitHub Classroom

Python 4 4 Updated Feb 18, 2023

level2_objectdetection_cv-level2-cv-04 created by GitHub Classroom

Jupyter Notebook 4 4 Updated Dec 9, 2022
0