AI
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Stable Diffusion web UI
Using Low-rank adaptation to quickly fine-tune diffusion models.
this repository is the implementation of MTCNN with no framework, Just need opencv and openblas, support linux and windows
A tools can generate samples for OCR trainning. 用于OCR的字符样本生成工具
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
Clone a voice in 5 seconds to generate arbitrary speech in real-time
A generative speech model for daily dialogue.
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.
解决Cursor在免费订阅期间出现以下提示的问题: Your request has been blocked as our system has detected suspicious activity / You've reached your trial request limit. / Too many free trial accounts used on this machine.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
An Open Source text-to-speech system built by inverting Whisper.
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
An AI plugin for assisting IDA reverse analysis, which facilitates quickly summarizing the functions of code and accelerates the analysis efficiency.
🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.