8000 Gpwner (Gpwner) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Gpwner's full-sized avatar

Block or report Gpwner

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 5,076 343 Updated Apr 12, 2025

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Python 2,390 182 Updated Mar 31, 2025

The code repository of paper "PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts"

Shell 5 Updated Mar 28, 2025
Python 697 31 Updated Apr 18, 2025

The python library for real-time communication

JavaScript 3,828 330 Updated Apr 23, 2025

VoiceBench: Benchmarking LLM-Based Voice Assistants

Python 186 10 Updated May 6, 2025

Paper, Code and Resources for Speech Language Model and End2End Speech Dialogue System.

164 12 Updated Nov 10, 2024

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Python 555 51 Updated Jun 9, 2024

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 8,162 677 Updated May 5, 2025

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,716 129 Updated Apr 21, 2025

Drag & drop UI to build your customized LLM flow

TypeScript 37,904 19,741 Updated May 6, 2025
Python 239 25 Updated Feb 25, 2025

fork of ConcurrentLogHandler

Python 331 58 Updated Dec 10, 2023

Using modified BiSeNet for face parsing in PyTorch

Python 2,436 473 Updated May 21, 2023

Fast neural radiance field training with free camera trajectories

C 944 69 Updated Feb 28, 2024

CVPR2023 talking face implementation for Identity-Preserving Talking Face Generation With Landmark and Appearance Priors

Python 726 90 Updated Jan 6, 2024

Code to accompany "A Method for Animating Children's Drawings of the Human Figure"

Python 12,421 1,072 Updated Apr 28, 2025

[ECCV2022] The implementation for "Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head Synthesis".

Python 341 40 Updated Jan 10, 2023

🔥 General Radiance Field (ICCV, 2021)

Python 282 18 Updated Oct 4, 2021

The official PyTorch implementation of Towards Fast, Accurate and Stable 3D Dense Face Alignment, ECCV 2020.

Python 2,981 520 Updated Feb 2, 2024

Official repository for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation

Python 487 34 Updated Apr 15, 2024

A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.

Python 5,754 1,099 Updated Jul 25, 2024

GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code

Python 2,600 296 Updated Oct 18, 2024

[ICCV2023] Delicate Textured Mesh Recovery from NeRF via Adaptive Surface Refinement

Python 941 90 Updated Nov 11, 2024

Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition

Python 917 155 Updated Apr 4, 2024

WebRTC and ORTC implementation for Python using asyncio

Python 4,630 805 Updated Apr 6, 2025

Apache JMeter open-source load testing tool for analyzing and measuring the performance of a variety of services

Java 8,751 2,173 Updated Apr 23, 2025

Munkres algorithm for Python

Python 229 81 Updated Jul 3, 2024
Next
0