We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Congratulations to Qwen team! Another outstanding job!
I noticed that you use GRPO to RL your math model. For now, there is no released implementation of GRPO. Do you have any plan to release the code?
Thank you very much!
The text was updated successfully, but these errors were encountered:
+1
Sorry, something went wrong.
+1 Any updates?
No branches or pull requests
Congratulations to Qwen team! Another outstanding job!
I noticed that you use GRPO to RL your math model. For now, there is no released implementation of GRPO. Do you have any plan to release the code?
Thank you very much!
The text was updated successfully, but these errors were encountered: