10000 codestar12 (Cody Blakeney) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View codestar12's full-sized avatar
  • Austin

Highlights

  • Pro

Block or report codestar12

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,273 304 Updated May 13, 2025

🤗 Benchmark Large Language Models Reliably On Your Data

Python 306 23 Updated May 20, 2025

An Open Math Pre-trainng Dataset with 370B Tokens.

XSLT 86 5 Updated Apr 4, 2025

DataComp for Language Models

HTML 1,301 118 Updated Mar 19, 2025

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 1,860 192 Updated Aug 17, 2024

DSIR large-scale data selection framework for language model training

Python 248 19 Updated Apr 7, 2024

Multithreaded Python without the GIL

Python 2,915 105 Updated May 20, 2025

the first library to let you embed a developer agent in your own app!

Python 11,966 1,061 Updated Apr 7, 2024

Dataflow is a data processing library, primarily for machine learning.

Rust 21 Updated Jun 6, 2023

Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascript

Go 579 21 Updated Jul 2, 2024
Python 695 73 Updated Jan 10, 2025

A framework for few-shot evaluation of language models.

Python 8,969 2,402 Updated May 21, 2025

Inference code for Llama models

Python 58,255 9,770 Updated Jan 26, 2025

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python 7,808 677 Updated Apr 23, 2025

A Data Streaming Library for Efficient Neural Network Training

Python 1,298 160 Updated May 20, 2025

Supercharge Your Model Training

Python 5,356 440 Updated May 21, 2025

Joining the modern data stack with the modern ML stack

Python 197 22 Updated May 16, 2023

Meta Optimal Transport

Python 102 6 Updated May 25, 2023

Official code for "A Normalized Gaussian Wasserstein Distance for Tiny Object Detection"

Python 227 24 Updated Jun 21, 2022

For optimization algorithm research and development.

Python 517 39 Updated May 21, 2025

Official implementation of the Generalized Wasserstein Dice Loss in PyTorch

Python 80 9 Updated Jul 5, 2022

Optimal Transport Dataset Distance

Python 164 46 Updated May 23, 2022

An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

4,865 493 Updated Jul 30, 2024

This code provides a PyTorch implementation for OTTER (Optimal Transport distillation for Efficient zero-shot Recognition), as described in the paper.

Python 69 5 Updated Dec 20, 2021

Knowledge distillation (KD) transfers discriminative knowledge from a large and complex model (known as Teacher) to a smaller and faster one (known as Student). Existing advanced knowledge distilla…

Python 6 Updated Jun 3, 2021

[CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.

Python 3,015 350 Updated Apr 25, 2025

【PyTorch】Easy-to-use,Modular and Extendible package of deep-learning based CTR models.

Python 3,197 718 Updated Jul 2, 2024

Pytorch domain library for recommendation systems

Python 2,177 512 Updated May 21, 2025

Factorization Machine models in PyTorch

Python 1,062 228 Updated Apr 8, 2024

A tool for measuring energy consumption of Intel CPUs

C 334 31 Updated Nov 2, 2023
Next
0