8000 Add entropy based filtering inside the GRPOTrainer. by pramodith · Pull Request #3563 · huggingface/trl · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Add entropy based filtering inside the GRPOTrainer.#3563

Open
pramodith wants to merge 18 commits intohuggingface:mainfrom
pramodith:pramodith/grpo_entropy_filter
Open

Add entropy based filtering inside the GRPOTrainer.#3563
pramodith wants to merge 18 commits intohuggingface:mainfrom
pramodith:pramodith/grpo_entropy_filter

Commits

Commits on Jun 10, 2025

Commits on Jun 12, 2025

Commits on Jun 13, 2025

Commits on Jun 15, 2025

Commits on Jun 22, 2025

0