8000 Jiaxin-Ye (Jiaxin Ye) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Jiaxin-Ye's full-sized avatar
💭
Keep Improving
💭
Keep Improving

Block or report Jiaxin-Ye

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

Go 147,405 12,491 Updated Jul 24, 2025

21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Jupyter Notebook 92,398 47,680 Updated Jul 24, 2025

 Now we have become very big, Different from the original idea. Collect premium software in various categories.

JavaScript 85,257 6,606 Updated Jul 23, 2025

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 62,105 6,289 Updated Jul 20, 2025

🙌 OpenHands: Code Less, Make More

Python 61,067 7,206 Updated Jul 24, 2025

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 54,746 9,038 Updated May 30, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 41,585 5,419 Updated Aug 16, 2024

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 38,245 4,563 Updated Aug 19, 2024

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 33,424 3,561 Updated Apr 19, 2025

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 31,664 6,574 Updated Jun 10, 2025

Unlock your displays on your Mac! Flexible HiDPI scaling, XDR/HDR extra brig CD27 htness, virtual screens, DDC control, extra dimming, PIP/streaming, EDID override and lots more!

25,918 452 Updated Jun 30, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 25,883 2,360 Updated Jul 23, 2025

Fully open reproduction of DeepSeek-R1

Python 25,121 2,341 Updated Jul 21, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 23,068 1,561 Updated Jul 23, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,568 2,644 Updated Jul 3, 2025

CVPR 2025 论文和开源项目合集

20,554 2,713 Updated Jul 2, 2025

Fast and memory-efficient exact attention

Python 18,523 1,826 Updated Jul 24, 2025

🔠Foreign language reading and translation assistant based on copy and translate.

TypeScript 17,434 1,935 Updated Nov 29, 2024

PyTorch implementations of Generative Adversarial Networks.

Python 17,175 4,098 Updated Jun 18, 2024

✨✨Latest Advances on Multimodal Large Language Models

15,897 1,037 Updated Jul 11, 2025

《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases

Jupyter Notebook 15,503 3,052 Updated Jun 13, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 15,364 1,629 Updated Jul 16, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 15,193 3,010 Updated Jul 24, 2025

This repository contains the source code for the paper First Order Motion Model for Image Animation

Jupyter Notebook 14,895 3,282 Updated Nov 14, 2024

📋 A list of open LLMs available for commercial use.

12,211 885 Updated Feb 13, 2025

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 12,209 2,605 Updated Jun 22, 2025

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 11,659 1,179 Updated Jul 23, 2025

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,607 1,143 Updated Nov 14, 2024

[CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"

Python 10,816 1,104 Updated Jun 21, 2024
Next
0