8000 C-dessert / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View C-dessert's full-sized avatar

Block or report C-dessert

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 52,349 6,341 Updated Jun 12, 2025

A framework for few-shot evaluation of language models.

Python 9,283 2,464 Updated Jun 16, 2025

Building DeepSeek R1 from Scratch

Jupyter Notebook 626 101 Updated Mar 21, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 40,590 3,222 Updated Jun 12, 2025

dpo算法实现

Python 38 2 Updated Jun 12, 2024

O1 Replication Journey

1,992 65 Updated Jan 14, 2025

An Open Large Reasoning Model for Real-World Solutions

Python 1,497 78 Updated May 30, 2025

MonoCoder

Python 6 Updated May 20, 2024

A Parallel Code Evaluation Benchmark

C++ 33 9 Updated Jun 11, 2025

A benchmark suite containing 1 million compilable programs, mined from the largest public C repositories on GitHub.

108 22 Updated Dec 18, 2019

MutAP: A prompt_based learning technique to automatically generate test cases with Large Language Model

Python 40 10 Updated Mar 7, 2025

Decision Making in Non-Stationary Environments with Policy-Augmented Search

Python 5 2 Updated Mar 17, 2024

Automatic AI-powered test suite generator

Python 70 9 Updated Jun 10, 2025

Bandit is a tool designed to find common security issues in Python code.

Python 7,067 654 Updated Jun 15, 2025

The official repo for the paper Can ChatGPT replace StackOverflow? A Study on Robustness and Reliability of Large Language Model Code Generation (AAAI'24).

Java 16 3 Updated Feb 27, 2024

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024

Python 1,483 158 Updated May 24, 2025

The MATH Dataset (NeurIPS 2021)

Python 1,136 101 Updated Aug 5, 2024

Reformer, the efficient Transformer, in Pytorch

Python 2,170 257 Updated Jun 21, 2023

An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"

Python 1,190 102 Updated Oct 22, 2023

Huggingface compatible implementation of RetNet (Retentive Networks, https://arxiv.org/pdf/2307.08621.pdf) including parallel, recurrent, and chunkwise forward.

Jupyter Notebook 226 27 Updated Mar 12, 2024

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 56,428 6,762 Updated Jun 13, 2025

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python 16,311 1,675 Updated Jun 15, 2025

A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-bench lite and 46.2% tasks (pass@1) in SWE-bench verified with…

Python 2,951 323 Updated Apr 24, 2025

LongBench v2 and LongBench (ACL 25'&24')

Python 900 88 Updated Jan 15, 2025

试一试

PHP 1 Updated Aug 17, 2017
0