Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,338 256 Updated Jun 12, 2025

QwenLM / Qwen2.5-VL

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 11,646 860 Updated May 15, 2025

QwenLM / Qwen2.5-Coder

Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.

Python 5,095 411 Updated Jun 20, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 22,603 1,530 Updated Jun 26, 2025

pointave / Antistentorian

Transcription using parakeet or whisper; either drop in an audio file or speak into your microphone. Text is copied to your clipboard, and can be commanded with Ollama to summarize or translate

Python 1 Updated Jul 9, 2025

kijai / ComfyUI-WanVideoWrapper

Python 3,356 246 Updated Jul 18, 2025

Tencent-Hunyuan / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 10,709 996 Updated Jul 8, 2025

LearningCircuit / local-deep-research

Local Deep Research achieves ~95% on SimpleQA benchmark (tested with GPT-4.1-mini) and includes benchmark tools to test on your own setup. Searches 10+ sources - arXiv, PubMed, GitHub, web, and you…

Python 3,169 316 Updated Jul 19, 2025

SomeOddCodeGuy / WilmerAI

What If Language Models Expertly Routed All Inference? WilmerAI allows prompts to be routed to specialized workflows based on the domain chosen by your LLM. Also allows chat Assistants to be powere…

Python 727 47 Updated Jul 13, 2025

kijai / ComfyUI-HunyuanVideoWrapper

Python 2,504 196 Updated May 12, 2025

Peterande / D-FINE

D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]

Python 2,569 233 Updated Jul 9, 2025

theJayTea / WritingTools

The world's smartest system-wide grammar assistant; a better version of the Apple Intelligence Writing Tools. Works on Windows, Linux, & macOS, with the free Gemini API, local LLMs, & more.

Swift 1,584 89 Updated Jul 14, 2025

bmaltais / kohya_ss

Python 11,082 1,457 Updated Jul 18, 2025

KwaiVGI / LivePortrait

Bring portraits to life!

Python 16,633 1,716 Updated Jun 14, 2025

open-webui / open-webui

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 103,180 13,836 Updated Jul 19, 2025

JarodMica / audiobook_maker

Python 472 82 Updated Jun 12, 2025

TMElyralab / MusePose

MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation

Python 2,571 194 Updated Mar 5, 2025

n4ze3m / page-assist

Use your locally running AI models to assist you in your web browsing

TypeScript 6,876 614 Updated Jul 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

point.aveugle pointave

Block or report pointave

Stars

pointave / SharkBit

ace-step / ACE-Step

spotDL / spotify-downloader

nari-labs / dia

Wan-Video / Wan2.1

QwenLM / Qwen2.5-Omni