Lists (1)
Sort Name ascending (A-Z)
Stars
A collection of papers on discrete diffusion models
verl: Volcano Engine Reinforcement Learning for LLMs
Pretraining code for a large-scale depth-recurrent language model
✔(已完结)最全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】
Minimal reproduction of DeepSeek R1-Zero
maps between 1-D space filling hilbert curve and N-D coordinates
SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.
[arXiv 2025] Official pytorch implementation of "FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors"
[NeurIPS 2024] Large Language Model Unlearning via Embedding-Corrupted Prompts
A benchmark for emotional intelligence in large language models
An Open Large Reasoning Model for Real-World Solutions
ChatGPT for wechat https://github.com/AutumnWhj/ChatGPT-wechat-bot
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
Fast inference from large lauguage models via speculative decoding
Code for Diversity-Enhanced Learning for Instruction Adaptation in Large Language Models
The first autonomous computer program that can do anything to earn money without human operators.
A plug-and-play tool for visualizing attention-score heatmap in generative LLMs. Easy to customize for your own need.
Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。