Lists (31)
Sort Name ascending (A-Z)
1m
1M-AI-Fluencer
camera
chatbot
CV
database
Deployment
etc_diarization
example repo to learn
export
funny
Generative
head_detect
kalapa
langs: cpp, nim, rust, mojo
math
mlops-project
multitask
nextframe
NLP
Programing
puzzle
rubic
SDK demo
segment-challenge
Speech
startup
statble diffusion
Tools
Trending Demo
UI
Starred repositories
Audio segmentation powered by speaker diarization
A TTS model capable of generating ultra-realistic dialogue in one pass.
Streamable Text-to-Speech model using a language modeling approach, without vector quantization
TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models
Feather is a free on-device iOS application manager/installer, using certificates part of the Apple Developer Program.
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …
Hunyuan3D-v2 + TreLLIS for Fast and Memory-Efficient Textured 3D Mesh Generation
A nano two-tower recommendation system
DeepFuze is a state-of-the-art deep learning tool that seamlessly integrates with ComfyUI to revolutionize facial transformations, lipsyncing, Face Swapping, Lipsync Translation, video generation, …
HunyuanDiT with TensorRT and libtorch
Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x faster on consumer devices.
Dippy Synthetic Speech Subnet
Source code for the SIGGRAPH 2024 paper "X-Portrait: Expressive Portrait Animation with Hierarchical Motion Attention"
Reinplemtation of paper "A reinforcement learning approach for optimizing multiple traveling salesman problems over graphs"
1st Place Solution for LLM - Detect AI Generated Text Kaggle Competition
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
[ECCV'24] Kalman-Inspired Feature Propagation for Video Face Super-Resolution
🇺🇦 Speech Recognition & Synthesis for Ukrainian