Stars
Easily crop and trim videos, play multiple at once, and .... just download it.
ACE-Step: A Step Towards Music Generation Foundation Model
Download your Spotify playlists and songs along with album art and metadata (from YouTube if a match is found).
A TTS model capable of generating ultra-realistic dialogue in one pass.
Wan: Open and Advanced Large-Scale Video Generative Models
Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
Transcription using parakeet or whisper; either drop in an audio file or speak into your microphone. Text is copied to your clipboard, and can be commanded with Ollama to summarize or translate
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Local Deep Research achieves ~95% on SimpleQA benchmark (tested with GPT-4.1-mini) and includes benchmark tools to test on your own setup. Searches 10+ sources - arXiv, PubMed, GitHub, web, and you…
What If Language Models Expertly Routed All Inference? WilmerAI allows prompts to be routed to specialized workflows based on the domain chosen by your LLM. Also allows chat Assistants to be powere…
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]
The world's smartest system-wide grammar assistant; a better version of the Apple Intelligence Writing Tools. Works on Windows, Linux, & macOS, with the free Gemini API, local LLMs, & more.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
Use your locally running AI models to assist you in your web browsing