Lists (1)
Sort Name ascending (A-Z)
Stars
- All languages
- Arduino
- Blade
- C
- C#
- C++
- CMake
- CSS
- Dart
- Dockerfile
- Eagle
- FreeMarker
- Go
- Groovy
- HTML
- IDL
- Java
- JavaScript
- Jupyter Notebook
- Kotlin
- Lua
- MATLAB
- MDX
- Makefile
- Markdown
- Objective-C
- Objective-C++
- PHP
- Perl
- PowerShell
- Processing
- Pug
- Python
- Rich Text Format
- Ruby
- Rust
- SCSS
- Shell
- Swift
- TypeScript
- Vim Script
Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
Supercharge Git inside VS Code a 10000 nd unlock untapped knowledge within each repository — Visualize code authorship at a glance via Git blame annotations and CodeLens, seamlessly navigate and explore G…
Open Source Application for Advanced LLM Engineering: interact, train, fine-tune, and evaluate large language models on your own computer.
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
SwinIR: Image Restoration Using Swin Transformer (official repository)
A node-based image processing GUI aimed at making chaining image processing tasks easy and customizable. Born as an AI upscaling application, chaiNNer has grown into an extremely flexible and power…
Your android camera streamed on your desktop: use as a source for OBS, or as a webcam with v4l2. Free✅, No Ads✅, Open Source✅
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.
Retrieval-Augmented Generation (RAG) combines retrieval of information from a document database with generative AI to provide accurate and contextually aware answers. In this article, we'll walk th…
The official implementation of CVPR'25 Oral paper "Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise"
Python tool for converting files and office documents to Markdown.
[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
Fast and accurate automatic speech recognition (ASR) for edge devices
fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
A lightweight Truecaller alternative app utilizing Truecaller’s API for enhanced call identification and spam protection. Efficient, reliable, and user-friendly for seamless communication management.
App update framework for Windows, inspired by Sparkle for macOS
VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration
The Meilisearch API client written for Dart