- United Kingdom
-
10:12
(UTC +01:00) - @PTudosiu
Lists (10)
Sort Name ascending (A-Z)
Stars
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
AeroSpace is an i3-like tiling window manager for macOS
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
SIGGRAPH 2024 Conference Paper: Deep Fourier-based Arbitrary-scale Super-resolution for Real-time Rendering
FastVideo is a unified framework for accelerated video generation.
[ICLR'25] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints
[CVPR2025] PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Noise-Free Framework for Cross-Modality Evolution'
Python bindings to the Zstandard (zstd) compression library
[CVPR 2025] Official implementation of the paper "Generative Inbetweening through Frame-wise Conditions-Driven Video Generation"
Enhance-A-Video: Better Generated Video for Free
[CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project
A generative world for general-purpose robotics & embodied AI learning.
[WIP] Resources for AI engineers. Also contains supporting materials for the book AI Engineering (Chip Huyen, 2025)
Kickstart your LLMOps initiative with a flexible, robust, and productive Python package.
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.
Port of OpenAI's Whisper model in C/C++
[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition
A natural language interface for computers
Create a Conda environment file from a Python project using uv.
[CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding
Simple, unified interface to multiple Generative AI providers
Kickstart your MLOps initiative with a flexible, robust, and productive Python package.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Efficient Triton Kernels for LLM Training