My name is Angela Yuan. PhD in statistics of Peking University, master's in computer science of UCLA. Research interests: diffusion models, RL, optimization
Highlights
- Pro
-
SPPO Public
Forked from uclaml/SPPOThe official implementation of Self-Play Preference Optimization (SPPO)
-
-
MARS_ Public
Forked from AGI-Arena/MARSThe official implementation of MARS (forked from AGI-Arena/MARS)
Python Apache License 2.0 UpdatedNov 18, 2024 -
SPIN Public
Forked from uclaml/SPINThe official implementation of Self-Play Fine-Tuning (SPIN)
Python Apache License 2.0 UpdatedJul 9, 2024 -
alpaca_eval Public
Forked from tatsu-lab/alpaca_evalAn automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Jupyter Notebook Apache License 2.0 UpdatedJun 29, 2024 -
SELM Public
Forked from shenao-zhang/SELMThe official implementation of Self-Exploring Language Models (SELM)
Python UpdatedJun 4, 2024 -
TinyLlama Public
Forked from jzhang38/TinyLlamaThe TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Python Apache License 2.0 UpdatedMay 3, 2024 -
trl Public
Forked from huggingface/trlTrain transformer language models with reinforcement learning.
Python Apache License 2.0 UpdatedMay 3, 2024 -
-
CS180-Programming-Assignment Public
Forked from Eydcao/CS180-Programming-AssignmentPython UpdatedAug 14, 2023