Official repository of the paper, PokeChamp: an Expert-level Minimax Language Agent for Competitive Pokemon.
-
Updated
Mar 28, 2025 - Python
8000
Official repository of the paper, PokeChamp: an Expert-level Minimax Language Agent for Competitive Pokemon.
Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers
Test-Time Memory Framework: Control Hallucinations in Foundation Models
A Framework Enabling Web Agents to Master Workflows From Human Demonstration
An experimental project using MCTS to refine LLM responses for better accuracy and decision-making.
Add a description, image, and links to the test-time-compute topic page so that developers can more easily learn about it.
To associate your repository with the test-time-compute topic, visit your repo's landing page and select "manage topics."