Lists (18)
Sort Name ascending (A-Z)
Starred repositories
Your CrewAI Powered Video Editing Assistant
Vision utilities for web interaction agents 👀
A Data analysis agent powered by llm for querying database and visualizing results
A framework for Claude Opus to intelligently orchestrate subagents.
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large language model
A programmable version of Neil Thapen's Pink Trombone
Build your own generative UI chatbot using the Vercel AI SDK and Google Gemini
🐻 Bear necessities for state management in React
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
ai副业赚钱大集合,教你如何利用ai做一些副业项目,赚取更多额外收益。The Ultimate Guide to Making Money with AI Side Hustles: Learn how to leverage AI for some cool side gigs and rake in some extra cash. Check out the English versi…
This repo is meant to serve as a guide for Machine Learning/AI technical interviews.
Official implementations for paper: Anydoor: zero-shot object-level image customization
Crawl a site to generate knowledge files to create your own custom GPT from a URL
Modal LLM LLama.cpp based model deployment as part of series of Model as a Service (MaaS)
Whisper realtime streaming for long speech-to-text transcription and translation
Fast Audio/Video transcribe using Openai's Whisper and Modal, an hour audio/video file can be transcribed in ~1 minute
Flutter App That Can Transcribe Audio Offline/On Device with Whisper C++ Bindings via Rust
React hook for OpenAI Whisper with speech recorder, real-time transcription, and silence removal built-in
LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.
A browser extension that lets you chat with YouTube videos using Llama2-7b. Built using 🤗 Inference Endpoints and Vercel's AI SDK.
🤖 Build voice-based LLM agents. Modular + open source.
Pure Python, lightweight, Pillow-based solver for Amazon's text captcha.