uw-esfrankel
Popular repositories Loading
-
-
OpenRLHF
OpenRLHF PublicForked from OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
Python
-
-
reward-bench
reward-bench PublicForked from allenai/reward-bench
RewardBench: the first evaluation tool for reward models.
Python
-
Repositories
- ppi-rm-training Public
uw-esfrankel/ppi-rm-training’s past year of commit activity - OpenRLHF Public Forked from OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
uw-esfrankel/OpenRLHF’s past year of commit activity - reward-bench Public Forked from allenai/reward-bench
RewardBench: the first evaluation tool for reward models.
uw-esfrankel/reward-bench’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…