8000
We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
SFT时候,请问你们是将整个question加answer都作为label吗(除image和video pad部分),我看源码中是这样写的。但是常见的不应该是将整个question部分都-100掩码住,只用answer部分参与计算loss吗?
The text was updated successfully, but these errors were encountered:
您好,我们主要是参考R1-V和TRL里面的代码来写的,我目前也不太清楚为啥会是这样,参考代码如下:
https://github.com/huggingface/trl/blob/main/examples/scripts/sft_video_llm.py
https://github.com/Deep-Agent/R1-V/blob/main/src/r1-v/src/open_r1/sft.py
Sorry, something went wrong.
No branches or pull requests
SFT时候,请问你们是将整个question加answer都作为label吗(除image和video pad部分),我看源码中是这样写的。但是常见的不应该是将整个question部分都-100掩码住,只用answer部分参与计算loss吗?
The text was updated successfully, but these errors were encountered: