More Web Proxy on the site http://driver.im/

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Any plan to release the GRPO code? #6

Open

Viper403 opened this issue Aug 12, 2024 · 2 comments

Viper403 commented

Congratulations to Qwen team! Another outstanding job!

I noticed that you use GRPO to RL your math model. For now, there is no released implementation of GRPO. Do you have any plan to release the code?

Thank you very much!

RayWang-iat commented

+1

fzyzcjy commented

+1 Any updates?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment