8000 thepowerfuldeez (George Grigorev) / Starred · GitHub

More Web Proxy on the site http://driver.im/

thepowerfuldeez

Follow

George Grigorev thepowerfuldeez

Follow

Research Engineer

86 followers · 75 following

London

Achievements

Achievements

Stars

allenai / dolma

Data and tools for generating and inspecting OLMo pre-training data.

Python 1,237 144 Updated Jun 13, 2025

nuprl / MultiPL-T

Knowledge transfer from high-resource to low-resource programming languages for Code LLMs

Jupyter Notebook 14 2 Updated Aug 27, 2024

SuperGPQA / SuperGPQA

Python 152 10 Updated Apr 30, 2025

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4…

Python 8,072 698 Updated Jun 13, 2025

comet-ml / opik

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

Python 9,670 661 Updated Jun 13, 2025

agronholm / anyio

High level asynchronous concurrency and networking framework that works on top of either trio or asyncio

Python 2,066 151 Updated Jun 9, 2025

marin-community / marin

HTML 189 12 Updated Jun 13, 2025

ansh / template-2

TypeScript 589 481 Updated Jan 17, 2025

huggingface / Math-Verify

Python 757 31 Updated Apr 28, 2025

hkust-nlp / simpleRL-reason

Simple RL training for reasoning

Python 3,625 271 Updated Apr 10, 2025

codelion / optillm

Optimizing inference proxy for LLMs

Python 2,517 191 Updated Jun 9, 2025

aidanmclaughlin / AidanBench

Aidan Bench attempts to measure <big_model_smell> in LLMs.

Python 302 12 Updated Jun 11, 2025

kevinwu23 / StanfordFineTuneBench

Jupyter Notebook 29 3 Updated Nov 14, 2024

princeton-nlp / HELMET

The HELMET Benchmark

Jupyter Notebook 154 22 Updated Apr 16, 2025

exists-forall / striped_attention

Python 38 2 Updated Nov 10, 2023

huggingface / lighteval

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 1,618 287 Updated Jun 12, 2025

stanford-cs149 / asst1

Stanford CS149 -- Assignment 1

C++ 109 123 Updated Oct 2, 2024

pytorch-labs / attention-gym

Helpful tools and examples for working with flex-attention

Python 823 50 Updated Jun 10, 2025

peak / s5cmd

Parallel S3 and local filesystem execution tool.

Go 3,294 270 Updated Jun 12, 2025

JonasGeiping / linear_cross_entropy_loss

A fusion of a linear layer and a cross entropy loss, written for pytorch in triton.

Python 68 5 Updated Aug 2, 2024

pytorch / PiPPy

Pipeline Parallelism for PyTorch

Python 767 86 Updated Aug 21, 2024

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 3,914 391 Updated Jun 13, 2025

zhuzilin / ring-flash-attention

Ring attention implementation with flash attention

Python 781 67 Updated Jun 12, 2025

QwenLM / Qwen-Agent

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 9,506 804 Updated May 29, 2025

fe1ixxu / CPO_SIMPO

This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.

Python 53 5 Updated Aug 13, 2024

allenai / OLMo-Eval

Evaluation suite for LLMs

Python 349 41 Updated Mar 31, 2025

markusmobius / go-trafilatura

go-trafilatura is a Go port of the trafilatura Python library.

HTML 89 5 Updated May 21, 2025

mistralai / mistral-finetune

Python 2,957 276 Updated Sep 13, 2024

qodo-ai / qodo-cover

Qodo-Cover: An AI-Powered Tool for Automated Test Generation and Code Coverage Enhancement! 💻🤖🧪🐞

Python 5,068 430 Updated May 27, 2025

lmarena / arena-hard-auto

Arena-Hard-Auto: An automatic LLM benchmark.

Python 845 103 Updated Jun 10, 2025

0