10000 johan149 (Johan Romero) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View johan149's full-sized avatar

Block or report johan149

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"

Jupyter Notebook 431 28 Updated Apr 30, 2025

The most comprehensive authentication framework for TypeScript

TypeScript 15,218 1,069 Updated Jun 23, 2025

[CVPR'25 Oral] Official implementation for "DiffusionRenderer: Neural Inverse and Forward Rendering with Video Diffusion Models"

Python 109 8 Updated Jun 13, 2025

Your memories are in ChatGPT... But nowhere else. Universal Memory MCP makes your memories available to every single LLM. No logins or paywall. One command to set it up.

TypeScript 1,026 92 Updated Jun 22, 2025

🚀 ClipServe: A fast API server for embedding text, images, and performing zero-shot classification using OpenAI’s CLIP model. Powered by FastAPI, Redis, and CUDA for lightning-fast, scalable AI app…

Python 6 1 Updated Sep 29, 2024

Let your GStreamer pipelines describe what they see! 👁️‍🗨️ GstGeminiVision brings Google's Gemini Vision AI to your media streams for some serious (and fun!) video analysis. 🎥🤖✨

C 2 Updated May 10, 2025

Gorse open source recommender system engine

Go 8,998 821 Updated Jun 8, 2025

Using LLMs to implement an open source YouTube video recommendation system.

Python 63 3 Updated Mar 9, 2024

A gallery that showcases on-device ML/GenAI use cases and allows people to try and use models locally.

Kotlin 11,531 816 Updated Jun 18, 2025

Open-source unified multimodal model

Python 4,272 358 Updated Jun 17, 2025

KBY-AI Empowered KYC Verification For Digital Onboarding Process Including Face Liveness, Face Recognition And ID Card Recognition

21 8 Updated Mar 20, 2025

State-of-the-art 2D and 3D Face Analysis Project

Python 25,552 5,632 Updated Jun 16, 2025

Tiktok is an advanced multimedia recommender system that fuses the generative modality-aware collaborative self-augmentation and contrastive cross-modality dependency encoding to achieve superior p…

Python 13 Updated Aug 18, 2023

The Video Creation Engine: Edit videos with code, featuring the fastest WebCodecs renderer for in-browser video processing.

TypeScript 783 78 Updated Mar 21, 2025

SoTA open-source TTS

Python 8,630 925 Updated Jun 13, 2025

BillionMail gives you open-source MailServer, NewsLetter, Email Marketing — fully self-hosted, dev-friendly, and free from monthly fees. Join the discord: https://discord.gg/asfXzBUhZr

Go 6,191 510 Updated Jun 23, 2025

🐶 Pegada is a beautiful Dog Dating App made with Expo and Next.js

TypeScript 13 4 Updated Apr 19, 2025

workbench for learning and practicing on-device AI technology in real scenario with online-TV on Android phone, powered by ggml(llama.cpp,whisper.cpp...) and FFmpeg and opencv-mobile

C++ 166 22 Updated Jun 12, 2025

SmolVLM2 Demo

Swift 154 15 Updated Mar 20, 2025

A cross-platform tun/tap interface infrastructure powered by Rust

Rust 79 14 Updated Jun 23, 2025

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

Python 7,875 651 Updated Jun 17, 2025

A nearly-live implementation of OpenAI's Whisper.

Python 3,009 405 Updated Jun 2, 2025

LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025)

Python 228 35 Updated Jun 15, 2025

Original reference implementation of "VRSplat: Fast and Robust Gaussian Splatting for Virtual Reality"

Python 29 4 Updated Apr 26, 2025

An app that brings language models directly to your phone.

TypeScript 3,886 364 Updated Jun 21, 2025

Multiprotocol (SRT, RTMP and others) live streaming libraries for Android

Kotlin 256 80 Updated Jun 2, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 17,087 1,384 Updated May 28, 2025

Everything about the SmolLM2 and SmolVLM family of models

Python 2,583 159 Updated Mar 31, 2025

Real-time webcam demo with SmolVLM and llama.cpp server

HTML 3,956 561 Updated May 12, 2025

Have a natural, spoken conversation with AI!

Python 2,620 237 Updated Jun 17, 2025
Next
0