8000 ai-all-in repositories · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
Change the repository type filter

All

    Repositories list

    • dia

      Public
      A TTS model capable of generating ultra-realistic dialogue in one pass.
      Python
      Apache License 2.0
      1.4k000Updated May 15, 2025May 15, 2025
    • KrillinAI

      Public
      A video translation and dubbing tool powered by LLMs, offering professional-grade translations and one-click full-process deployment. It can generate content optimized for platforms like YouTube,TikTok, and Shorts. 基于AI大模型的视频翻译和配音工具,专业级翻译,一键部署全流程,可以生成适配抖音,小红书,哔哩哔哩,视频号,TikTok,Youtube Shorts等形态的内容
      Go
      GNU General Public License v3.0
      616000Updated May 14, 2025May 14, 2025
    • Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation
      Python
      258000Updated Apr 25, 2025Apr 25, 2025
    • This repository contains the Hugging Face Agents Course.
      MDX
      Apache License 2.0
      1.4k000Updated Feb 12, 2025Feb 12, 2025
    • A curated list of Large Language Model resources, covering model training, serving, fine-tuning, and building LLM applications.
      MIT License
      436000Updated Jan 30, 2025Jan 30, 2025
    • TEN-Agent

      Public
      TEN Agent is a realtime conversational AI agent powered by TEN. It seamlessly integrates the OpenAI Realtime API, RTC capabilities, and advanced features like weather updates, web search, computer vision, and Retrieval-Augmented Generation (RAG).
      Python
      Apache License 2.0
      739000Updated Dec 14, 2024Dec 14, 2024
    • PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker
      Python
      GNU Affero General Public License v3.0
      2.1k000Updated Dec 13, 2024Dec 13, 2024
    • docling

      Public
      Get your documents ready for gen AI
      Python
      MIT License
      2.1k000Updated Dec 9, 2024Dec 9, 2024
    • gradio

      Public
      Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
      Python
      Apache License 2.0
      3k0 10000 00Updated Nov 21, 2024Nov 21, 2024
    • Let your Claude able to think
      JavaScript
      1.8k000Updated Nov 14, 2024Nov 14, 2024
    • mini-omni

      Public
      open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
      Python
      MIT License
      283000Updated Nov 5, 2024Nov 5, 2024
    • candle

      Public
      Minimalist ML framework for Rust
      Rust
      Apache License 2.0
      1.1k000Updated Oct 17, 2024Oct 17, 2024
    • F5-TTS

      Public
      Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
      Python
      MIT License
      1.8k000Updated Oct 16, 2024Oct 16, 2024
    • real time face swap and one-click video deepfake with only a single image
      Python
      GNU Affero General Public License v3.0
      10k000Updated Aug 16, 2024Aug 16, 2024
    • one-api

      Public
      OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistribution system, using a single API for all LLMs, and features an English UI.
      JavaScript
      MIT License
      5.3k000Updated Aug 7, 2024Aug 7, 2024
    • lobe-chat

      Public
      🤯 Lobe Chat - an open-source, modern-design LLMs/AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Bedrock / Azure / Mistral / Perplexity ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT chat application.
      TypeScript
      Other
      13k000Updated Aug 7, 2024Aug 7, 2024
    • BrainyAI

      Public
      a free and open-source browser sidebar plugin that offers a cost-free alternative to products like Sider, Monica, and Merlin.
      TypeScript
      GNU General Public License v3.0
      100000Updated Jun 17, 2024Jun 17, 2024
    • Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript
      C++
      Apache License 2.0
      742000Updated Jun 15, 2024Jun 15, 2024
    • A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.
      C#
      MIT License
      443000Updated Jun 14, 2024Jun 14, 2024
    • Inference and training library for high-quality TTS models.
      Python
      Apache License 2.0
      566000Updated Jun 14, 2024Jun 14, 2024
    • piper

      Public
      A fast, local neural text to speech system
      C++
      MIT License
      747000Updated Jun 5, 2024Jun 5, 2024
    • 🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation
      JavaScript
      Apache License 2.0
      449000Updated May 16, 2024May 16, 2024
    • polyglot

      Public
      🤖️ Cross-platform AI language practice app (跨平台AI语言练习应用)
      TypeScript
      GNU General Public License v3.0
      274000Updated Mar 14, 2024Mar 14, 2024
    • 使用Torch VITS语音合成,结合OpenAI ChatGPT进行互动
      Python
      3000Updated Feb 19, 2024Feb 19, 2024
    • paper2gui

      Public
      Convert AI papers to GUI,Make it easy and convenient for everyone to use artificial intelligence technology。让每个人都简单方便的使用前沿人工智能技术
      Jupyter Notebook
      MIT License
      878000Updated Jul 29, 2023Jul 29, 2023
    0