Lists (1)
Sort Name ascending (A-Z)
Stars
[CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"
The most comprehensive authentication framework for TypeScript
[CVPR'25 Oral] Official implementation for "DiffusionRenderer: Neural Inverse and Forward Rendering with Video Diffusion Models"
Your memories are in ChatGPT... But nowhere else. Universal Memory MCP makes your memories available to every single LLM. No logins or paywall. One command to set it up.
🚀 ClipServe: A fast API server for embedding text, images, and performing zero-shot classification using OpenAI’s CLIP model. Powered by FastAPI, Redis, and CUDA for lightning-fast, scalable AI app…
Let your GStreamer pipelines describe what they see! 👁️🗨️ GstGeminiVision brings Google's Gemini Vision AI to your media streams for some serious (and fun!) video analysis. 🎥🤖✨
Using LLMs to implement an open source YouTube video recommendation system.
A gallery that showcases on-device ML/GenAI use cases and allows people to try and use models locally.
KBY-AI Empowered KYC Verification For Digital Onboarding Process Including Face Liveness, Face Recognition And ID Card Recognition
State-of-the-art 2D and 3D Face Analysis Project
Tiktok is an advanced multimedia recommender system that fuses the generative modality-aware collaborative self-augmentation and contrastive cross-modality dependency encoding to achieve superior p…
The Video Creation Engine: Edit videos with code, featuring the fastest WebCodecs renderer for in-browser video processing.
BillionMail gives you open-source MailServer, NewsLetter, Email Marketing — fully self-hosted, dev-friendly, and free from monthly fees. Join the discord: https://discord.gg/asfXzBUhZr
🐶 Pegada is a beautiful Dog Dating App made with Expo and Next.js
workbench for learning and practicing on-device AI technology in real scenario with online-TV on Android phone, powered by ggml(llama.cpp,whisper.cpp...) and FFmpeg and opencv-mobile
A cross-platform tun/tap interface infrastructure powered by Rust
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
A nearly-live implementation of OpenAI's Whisper.
LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025)
Original reference implementation of "VRSplat: Fast and Robust Gaussian Splatting for Virtual Reality"
An app that brings language models directly to your phone.
Multiprotocol (SRT, RTMP and others) live streaming libraries for Android
A TTS model capable of generating ultra-realistic dialogue in one pass.
Everything about the SmolLM2 and SmolVLM family of models
Real-time webcam demo with SmolVLM and llama.cpp server
Have a natural, spoken conversation with AI!