Highlights
- Pro
RLHFlow
RLHFlow
Code for the Workflow of Reinforcement Learning from Human Feedback (RLHF)
United States of America
BigScience Workshop
bigscience-workshop
Research workshop on large language models - The Summer of Language Models 21
WangLab @ U of T
bowang-lab
BoWang's Lab at University of Toronto
190 Elizabeth St, Toronto, ON M5G 2C4 Canada