Stars
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
Cursor Talk To Figma MCP
🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.
An open source framework for building AI-powered apps with familiar code-centric patterns. Genkit makes it easy to develop, integrate, and test AI features with observability and evaluations. Genki…
A simple screen parsing tool towards pure vision based GUI agent
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
A simple animated circular menu for Flutter, Adjustable radius, colors, alignment, animation curve and animation duration.
This is a travel demo built in Flutter using Firebase Data Connect and Firebase Genkit to find ideal itineraries from a database of travel plans.
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS…
Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, L…
Speech-to-text server framework with next-gen Kaldi
Awesome LLMs on Device: A Comprehensive Survey
[CVPR'25 highlight] RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness
Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (…
A collection of guides and examples for the Gemma open models from Google.
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…
User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)
Build LLM-powered Dart/Flutter applications.
Outfit Anyone(最新修复版): Ultra-high quality virtual try-on for Any Clothing and Any Person
Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person
Silero VAD: pre-trained enterprise-grade Voice Activity Detector