thomas-yanxin

Regular bencher

thomas-yanxin thomas-yanxin

Regular bencher

不是逢人苦眷君，亦狂亦侠亦温文。

283 followers · 189 following

Achievements

x2 x3

Achievements

x2 x3

Highlights

Developer Program Member

Organizations

Lists (13)

Sort

Starred repositories

TeleHuman / PBHC

Official Implementation of "KungfuBot: Physics-Based Humanoid Whole-Body Control for Learning Highly-Dynamic Skill 10000 s"

Python 235 26 Updated Jun 25, 2025

facebookresearch / vjepa2

PyTorch code and models for VJEPA2 self-supervised learning from video.

Python 1,547 121 Updated Jun 20, 2025

OpenDriveLab / RoboDual

RoboDual: Dual-System for Robotic Manipulation

Python 81 2 Updated Apr 28, 2025

modelscope / MCPBench

The evaluation benchmark on MCP servers

Python 133 5 Updated May 21, 2025

wisent-ai / wisent-guard

This is an open-source version of the representation engineering framework for stopping harmful outputs or hallucinations on the level of activations. 100% free, self-hosted and open-source.

Python 315 20 Updated Jun 24, 2025

ByteDance-Seed / VeOmni

VeOmni: Scaling any Modality Model Training to any Accelerators with PyTorch native Training Framework

Python 357 19 Updated May 12, 2025

huggingface / yourbench

Forked from sumukshashidhar/yourbench

🤗 Benchmark Large Language Models Reliably On Your Data

Python 331 30 Updated Jun 25, 2025

cyfyifanchen / one-person-company

遇事不决，Vibe 力学! One-Person Company AI Tools Series – continuously updated to help boost productivity and empower your solo business!

2,117 176 Updated May 8, 2025

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 4,335 444 Updated Jun 25, 2025

Roblox / cube

Roblox Foundation Model for 3D Intelligence

Jupyter Notebook 738 63 Updated May 15, 2025

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 25,341 2,280 Updated Jun 25, 2025

pzhren / InfiniteWorld

Python 62 12 Updated May 30, 2025

UMass-Embodied-AGI / 3D-Mem

[CVPR 2025] Source codes for the paper "3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning"

Python 135 7 Updated Jun 11, 2025

Psi-Robot / DexGraspVLA

DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping

Python 286 18 Updated Jun 17, 2025

fuse-model / FuSe

Python 53 1 Updated Jan 13, 2025

valeoai / VideoActionModel

VaViM and VaVAM: Autonomous Driving through Video Generative Modeling (official repository).

Jupyter Notebook 94 4 Updated Jun 11, 2025

gradio-app / fastrtc

The python library for real-time communication

JavaScript 4,043 366 Updated Jun 13, 2025

jishengpeng / WavTokenizer

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

Python 1,160 100 Updated Mar 2, 2025

Tencent-Hunyuan / Hunyuan3D-1

Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation

Python 3,402 266 Updated Jan 21, 2025

abi / screenshot-to-code

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Python 70,244 8,672 Updated Jun 18, 2025

fishaudio / audio-preprocess

Preprocess Audio for training

Python 351 62 Updated Mar 3, 2025

THUDM / GLM-4-Voice

GLM-4-Voice | 端到端中英语音对话模型

Python 2,960 252 Updated Dec 5, 2024

opendilab / CleanS2S

High-quality and streaming Speech-to-Speech interactive agent in a single file. 只用一个文件实现的流式全双工语音交互原型智能体！

Python 444 40 Updated Jun 16, 2025

g-battaglia / kerykeion

Data-Driven Astrology 💫 Kerykeion is a Python library for astrology. It generates SVG charts and extracts detailed structured data for birth charts, synastry, transits, composite charts, and more.

Python 420 140 Updated Jun 19, 2025

theriftlab / immanuel-python

Quickly produce both human-readable and JSON-formatted astrology chart data based on the Swiss Ephemeris and astro.com.

Python 78 19 Updated May 8, 2025

CorentinJ / Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 54,562 9,011 Updated May 30, 2025

chenfei-wu / TaskMatrix

Python 34,440 3,288 Updated Jan 6, 2024

TEN-framework / ten-framework

Open-source framework for all AI agents.

C 6,281 739 Updated Jun 25, 2025

thomas-yanxin / LLM-Inference

5 Updated Sep 24, 2024

ictnlp / LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,944 198 Updated May 19, 2025