-
The University of Texas at Austin
- Austin, TX
-
04:26
(UTC -05:00) - cuijiaxun.github.io
- https://orcid.org/0009-0009-1987-9549
- @cuijiaxun
Stars
A Pokemon battle engine that can search through Pokemon states
SGLang is a fast serving framework for large language models and vision language models.
Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparen…
verl: Volcano Engine Reinforcement Learning for LLMs
MLGym A New Framework and Benchmark for Advancing AI Research Agents
Official repository of the paper, PokeChamp: an Expert-level Minimax Language Agent for Competitive Pokemon.
TradingAgents: Multi-Agents LLM Financial Trading Framework
The official implementation of flow Q-learning (FQL)
Simulation Streams is a programming paradigm designed to efficiently control and leverage Large Language Models (LLMs) for complex, dynamic simulations and agentic workflows.
PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tuning, ad-hoc coordination, and more.
Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence
[IEEE-TVT(2025)]Towards Interactive and Learnable Cooperative Driving Automation: a Large Language Model-Driven Decision-making Framework
Model Predictive Path Integral (MPPI) with approximate dynamics implemented in pytorch
ExORL: Exploratory Data for Offline Reinforcement Learning
Interaction Dataset Python Scripts
New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos
Simple language-driven navigation tasks for studying compositional learning
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Booster Gym is a reinforcement learning (RL) framework designed for humanoid robot locomotion developed by Booster Robotics.
The Booster T1 Robocup official demo allows the robot to make autonomous decisions to kick the ball and complete the full Robocup match. It includes three programs: vision, brain, and game_controller.
Unified framework for robot learning built on NVIDIA Isaac Sim
A Telegram bot to recommend arXiv papers
Code repository for "N-agent Ad Hoc Teamwork" paper (Wang et al., Neurips 2024).