8000 angelahzyuan (angelahzyuan) / Repositories · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View angelahzyuan's full-sized avatar

Highlights

  • Pro

Block or report angelahzyuan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • SPPO Public

    Forked from uclaml/SPPO

    The official implementation of Self-Play Preference Optimization (SPPO)

    Python 2 1 Apache License 2.0 Updated Mar 27, 2025
  • HTML Updated Dec 9, 2024
  • MARS_ Public

    Forked from AGI-Arena/MARS

    The official implementation of MARS (forked from AGI-Arena/MARS)

    Python Apache License 2.0 Updated Nov 18, 2024
  • SPIN Public

    Forked from uclaml/SPIN

    The official implementation of Self-Play Fine-Tuning (SPIN)

    Python Apache License 2.0 Updated Jul 9, 2024
  • An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

    Jupyter Notebook Apache License 2.0 Updated Jun 29, 2024
  • SELM Public

    Forked from shenao-zhang/SELM

    The official implementation of Self-Exploring Language Models (SELM)

    Python Updated Jun 4, 2024
  • TinyLlama Public

    Forked from jzhang38/TinyLlama

    The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

    Python Apache License 2.0 Updated May 3, 2024
  • trl Public

    Forked from huggingface/trl

    Train transformer language models with reinforcement learning.

    Python Apache License 2.0 Updated May 3, 2024
  • v202 Public

    Forked from mlresearch/v202

    Proceedings of ICML 2023

    TeX Updated Aug 19, 2023
  • Python Updated Aug 14, 2023
0