8000 Cuiunbo (Cui Junbo) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Cuiunbo's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report Cuiunbo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

TEMPURA enables video-language models to reason about causal event relationships and generate fine-grained, timestamped descriptions of untrimmed videos.

Python 12 Updated May 8, 2025

Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence

Python 906 61 Updated Jan 31, 2025

Speech-to-text transcription VST3/ARA plugin

C++ 42 1 Updated May 21, 2025

Standard Open Arm 100

2,145 154 Updated May 23, 2025

SAPIEN Manipulation Skill Framework, an open source GPU parallelized robotics simulator and benchmark, led by Hillbot, Inc.

Python 1,633 278 Updated May 23, 2025

The Fast Cross-Platform Package Manager

C++ 7,386 388 Updated May 23, 2025

[Lumina Embodied AI Community] 具身智能技术指南 Embodied-AI-Guide

5,350 344 Updated May 19, 2025

Implementation of π₀, the robotic foundation model architecture proposed by Physical Intelligence

Python 428 18 Updated May 16, 2025

Audio Dataset for training CLAP and other models

Python 682 57 Updated Feb 5, 2024

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Python 1,568 153 Updated Jul 15, 2024

Utility for Feetech SCS/STS servo

C++ 30 3 Updated Aug 20, 2023

An easy-to-use, fast, and easily integrable tool for evaluating audio LLM

Python 100 3 Updated May 22, 2025

Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice

Python 296 37 Updated Jan 15, 2025

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 9,216 8000 1,024 Updated May 25, 2025

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 13,704 1,655 Updated May 25, 2025

LeKiwi - Low-Cost Mobile Manipulator

698 78 Updated May 20, 2025

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 4,519 1,134 Updated Mar 29, 2025

ACL 2025: Synthetic data generation pipelines for text-rich images.

Python 71 14 Updated Mar 1, 2025

SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer

Python 259 7 Updated Dec 29, 2024

The official repository of SpeechCraft dataset, a large-scale expressive bilingual speech dataset with natural language descriptions.

Python 131 2 Updated Apr 14, 2025

My learning notes/codes for ML SYS.

Python 2,264 141 Updated May 25, 2025

LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 565 39 Updated Apr 8, 2025

Build & Optimize your RAG.

Python 662 50 Updated May 13, 2025

Human-Pose-and-Motion History

54 7 Updated Jul 5, 2018

An aggregation of human motion understanding research.

138 8 Updated May 24, 2025

A spoken question answering dataset on SQUAD

Shell 48 6 Updated May 3, 2025

Official PyTorch Implementation for Paper "No More Adam: Learning Rate Scaling at Initialization is All You Need"

Python 51 1 Updated Jan 27, 2025
2EBE
Next
0