Lists (21)
Sort Name ascending (A-Z)
AI Agent
Audio Enhancement
Chrome Extension Learning
Data Extraction
DataGemma
Full Stack Framework
Fully writtein Python YC W23 batchGen AI
IT Automation
JVM Learning
ML
News Feed
PaaS
Platform As a ServiceProductivity
Python Data Science
Use full for streamlit project around Price to Rent ratio or some other data from Zillow/Redfin for Real EstateSaaS
Tools
TTS
UI Development
Visual
Web Task Automation
Stars
Self-hosted audiobook and podcast server
A Conversational Speech Generation Model
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
Evolution API is an open-source WhatsApp integration API
🤗 smolagents: a barebones library for agents that think in code.
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
Generate a comprehensive review from an arXiv paper, then turn it into a blog post. This project powers the website below for the HuggingFace's Daily Papers (https://huggingface.co/papers).
first base model for full-duplex conversational audio
🪄 Create rich visualizations with AI
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Awesome RSS feeds - A curated list of RSS feeds (and OPML files) used in Recommended Feeds and local news sections of Plenary - an RSS reader, article downloader and a podcast player app for android
A community-maintained Python framework for creating mathematical animations.
An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI
Detect whether or not an audio file was generated by NotebookLM
⏩ Create, share, and use custom AI code assistants with our open-source IDE extensions and hub of models, rules, prompts, docs, and other building blocks
Daily Research Bot helps you stay on top of new AI-related research and updates. Currently supports: `huggingface.co/papers` and `hype.replicate.dev`
A Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS
Implementing the 4 agentic patterns from scratch
Easily train a good VC model with voice data <= 10 mins!
React UI + elegant infrastructure for AI Copilots, AI chatbots, and in-app AI agents. The Agentic last-mile 🪁
Foundational model for human-like, expressive TTS
Multimodal AI agent with Llama 3.2: A Streamlit app that processes text, images, PDFs, and PPTs, integrating NIM microservices, Milvus, and Llama-3.2 models.
Composable building blocks to build Llama Apps
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key