-
Notifications
You must be signed in to change notification settings - Fork 41
Issues: NVIDIA/NeMo-RL
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Wrong generation length for HFPolicy with stop strings
bug
Something isn't working
#499
opened Jun 10, 2025 by
KiddoZhu
CUDA OOM when running deepscaler tutorial on 1 A6000
bug
Something isn't working
#493
opened Jun 9, 2025 by
okuchaiev
fix
base
target to install prescribed python and ray version so users can start cluster
#472
opened Jun 3, 2025 by
terrykong
OOM when trying to reproduce the grpo-deepscaler run
bug
Something isn't working
#456
opened May 29, 2025 by
LeonMalteW
gemma-3-4b-it failed on 'Gemma3TextModel' object has no attribute 'model'
bug
Something isn't working
#453
opened May 29, 2025 by
yuki-666
Create a separate
eval
function to replace train
+ eval_mode=True
#448
opened May 27, 2025 by
ashors1
Support
exp_manager.max_time_per_run
to guarantee save before deadline
#444
opened May 27, 2025 by
terrykong
init_process_group timeout on 32B model 16 nodes
bug
Something isn't working
#443
opened May 27, 2025 by
yuki-666
Allow saving checkpoints in sft without running validation
enhancement
New feature or request
#441
opened May 23, 2025 by
yfw
Qwen3 Moe with Megatron backend
help wanted
Extra attention is needed
#424
opened May 20, 2025 by
terrykong
gemma-3-4b-it got nan probs_ratio in both FSDP1/FSDP2
bug
Something isn't working
#419
opened May 20, 2025 by
yuki-666
[Feature] Explicit failure for unmatched model and checkpoints
bug
Something isn't working
#415
opened May 19, 2025 by
KiddoZhu
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.