8000 LinesHogan (CengYuanhao) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View LinesHogan's full-sized avatar

Highlights

  • Pro

Block or report LinesHogan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A collection of papers on discrete diffusion models

118 2 Updated May 28, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 8,842 1,107 Updated Jun 2, 2025

爬取微信公众号文章

Python 818 147 Updated Feb 21, 2025

Pretraining code for a large-scale depth-recurrent language model

Python 771 65 Updated May 29, 2025

NanoGPT (124M) in 3 minutes

Python 2,605 313 Updated May 27, 2025

✔(已完结)最全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】

Jupyter Notebook 10,505 1,279 Updated May 29, 2025

Synthetic Patient Population Simulator

Java 2,563 736 Updated May 7, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 11,841 1,489 Updated Apr 24, 2025

maps between 1-D space filling hilbert curve and N-D coordinates

Python 259 38 Updated Apr 28, 2024

SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.

Python 768 83 Updated Apr 1, 2025

[arXiv 2025] Official pytorch implementation of "FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors"

Python 377 11 Updated Mar 10, 2025

[NeurIPS 2024] Large Language Model Unlearning via Embedding-Corrupted Prompts

Python 29 1 Updated Sep 26, 2024

A benchmark for emotional intelligence in large language models

Python 302 23 Updated Jul 26, 2024

An Open Large Reasoning Model for Real-World Solutions

Python 1,494 79 Updated May 30, 2025

ChatGPT for wechat https://github.com/AutumnWhj/ChatGPT-wechat-bot

TypeScript 4,726 978 Updated Aug 19, 2024
Jupyter Notebook 931 112 Updated May 9, 2025

An Open Source Toolkit For LLM Distillation

Python 615 77 Updated Jun 1, 2025

O1 Replication Journey

1,990 65 Updated Jan 14, 2025

Numbers every LLM developer should know

4,232 140 Updated Jan 16, 2024

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 9,534 943 Updated May 16, 2025

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

JavaScript 6,381 647 Updated Jan 8, 2025

Fast inference from large lauguage models via speculative decoding

Python 744 70 Updated Aug 22, 2024

Code for Diversity-Enhanced Learning for Instruction Adaptation in Large Language Models

Python 8 Updated Aug 31, 2024

The first autonomous computer program that can do anything to earn money without human operators.

Python 100 7 Updated Mar 1, 2025

LLM101n: Let's build a Storyteller

33,537 1,829 Updated Aug 1, 2024

A plug-and-play tool for visualizing attention-score heatmap in generative LLMs. Easy to customize for your own need.

Python 37 3 Updated May 16, 2024

让你的数字变成牢大!

Typst 38 Updated Sep 18, 2024

Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.

417 15 Updated Apr 18, 2024

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,869 274 Updated Jun 2, 2025
Next
0