Highlights
-
-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedJun 20, 2025 -
-
OLMo Public
Forked from allenai/OLMoModeling, training, eval, and inference code for OLMo
Python Apache License 2.0 UpdatedMay 9, 2025 -
OLMoE Public
Forked from allenai/OLMoEOLMoE: Open Mixture-of-Experts Language Models
Jupyter Notebook Apache License 2.0 UpdatedMay 9, 2025 -
prime Public
Forked from PrimeIntellect-ai/primeprime is a framework for efficient, globally distributed training of AI models over the internet.
Python Apache License 2.0 UpdatedMay 5, 2025 -
lighteval Public
Forked from huggingface/lightevalLighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
Python MIT License UpdatedApr 28, 2025 -
trl Public
Forked from huggingface/trlTrain transformer language models with reinforcement learning.
Python Apache License 2.0 UpdatedApr 18, 2025 -
open-r1 Public
Forked from huggingface/open-r1Fully open reproduction of DeepSeek-R1
Python Apache License 2.0 UpdatedJan 25, 2025 -
smollm Public
Forked from huggingface/smollmEverything about the SmolLM & SmolLM2 family of models
Python Apache License 2.0 UpdatedJan 7, 2025 -
transformers-1 Public
Forked from bigcode-project/transformers -
PaperScraper Public
Forked from NLPatVCU/PaperScraperA web scraping tool to systematically extract the text of scientific papers and corresponding metadata from university accessible journals.
Python GNU General Public License v3.0 UpdatedAug 22, 2023 -
beir Public
Forked from beir-cellar/beirA Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
Python Apache License 2.0 UpdatedAug 15, 2023 -
-
ggml Public
Forked from ggml-org/ggmlTensor library for machine learning
C MIT License UpdatedMay 20, 2023 -
bloomz.cpp Public
C++ implementation for BLOOM
-
apex Public
Forked from NVIDIA/apexA PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Python BSD 3-Clause "New" or "Revised" License UpdatedMar 20, 2023 -
DeepSpeed Public
Forked from deepspeedai/DeepSpeedDeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Python MIT License UpdatedDec 29, 2022 -
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
Python Other UpdatedDec 29, 2022 -
optimum Public
Forked from huggingface/optimumποΈ Accelerate training and inference of π€ Transformers with easy to use hardware optimization tools
Python Apache License 2.0 UpdatedDec 22, 2022 -
Megatron-DeepSpeed Public
Forked from TurkuNLP/Megatron-DeepSpeedOngoing research training transformer language models at scale, including: BERT & GPT-2
-
bigscience Public
Forked from bigscience-workshop/bigscienceCentral place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
Shell Other UpdatedNov 30, 2022 -
fast-stable-diffusion Public
Forked from TheLastBen/fast-stable-diffusionfast-stable-diffusion, +25-50% speed increase + memory efficient + DreamBooth
-
api-inference-community Public
Forked from huggingface/api-inference-communityPython Apache License 2.0 UpdatedNov 17, 2022 -
diffusers Public
Forked from huggingface/diffusersπ€ Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
-
-
-
-
transformers Public
Forked from huggingface/transformersπ€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedOct 13, 2022 -
accelerate Public
Forked from huggingface/accelerateπ A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
Python Apache License 2.0 UpdatedOct 5, 2022