8000 uw-esfrankel · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
@uw-esfrankel

uw-esfrankel

Popular repositories Loading

  1. ppi-rm-training ppi-rm-training Public

    Python

  2. OpenRLHF OpenRLHF Public

    Forked from OpenRLHF/OpenRLHF

    An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)

    Python

  3. PPE PPE Public

    Forked from lmarena/PPE

    Jupyter Notebook

  4. reward-bench reward-bench Public

    Forked from allenai/reward-bench

    RewardBench: the first evaluation tool for reward models.

    Python

  5. evalchemy evalchemy Public

    Forked from mlfoundations/evalchemy

    Automatic evals for LLMs

    HTML

Repositories

Showing 5 of 5 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…

0