8000 xiongchenyan (Chenyan Xiong) / Starred · GitHub

More Web Proxy on the site http://driver.im/

xiongchenyan

Follow

Chenyan Xiong xiongchenyan

Follow

Associate Professor at CMU

224 followers · 19 following

Seattle
www.cs.cmu.edu/~cx/

Achievements

Achievements

Highlights

Pro

Stars

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 22,247 1,502 Updated Jun 26, 2025

jacobdunefsky / transcoder_circuits

Jupyter Notebook 136 26 Updated Nov 17, 2024

safety-research / circuit-tracer

Python 2,066 202 Updated Jun 25, 2025

OSU-NLP-Group / Mind2Web-2

Mind2Web-2 Benchmark: Evaluating Agentic Search with Agent-as-a-Judge

Python 16 1 Updated Jun 28, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 10,088 1,665 Updated Jun 28, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 8,222 824 Updated Jun 27, 2025

ScalingIntelligence / KernelBench

KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA problems

Python 437 46 Updated Jun 1, 2025

cmu-flame / FLAME-MoE

Official repository for FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models

Jupyter Notebook 20 2 Updated May 28, 2025

assafelovic / gpt-researcher

LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.

Python 22,105 2,904 Updated Jun 26, 2025

cxcscmu / Craw4LLM

Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"

Python 630 57 Updated Feb 24, 2025

RUC-NLPIR / WebThinker

🌐 WebThinker: Empowering Large Reasoning Models with Deep Research Capability

Python 1,087 116 Updated Jun 20, 2025

hao-ai-lab / FastVideo

FastVideo is a unified framework for accelerated video generation.

Python 1,560 105 Updated Jun 28, 2025

SakanaAI / AI-Scientist

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 11,191 1,634 Updated Apr 26, 2025

MadryLab / context-cite

Attribute (or cite) statements generated by LLMs back to in-context information.

Jupyter Notebook 242 18 Updated Oct 8, 2024

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 24,908 2,312 Updated Jun 26, 2025

nflverse / nflverse-data

Automated nflverse data repository

R 236 21 Updated Jun 1, 2025

qmaclean / all22_computer_vision

Computer Vision tags on all 22 football film

Jupyter Notebook 2 Updated Oct 7, 2022

jstrieb / panopto-download

Script to facilitate batch downloading of lecture videos from Panopto

Python 52 2 Updated Aug 7, 2021

baaivision / Emu3

Next-Token Prediction is All You Need

Python 2,156 81 Updated Mar 17, 2025

thunlp / Optima

Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"

Python 59 4 Updated Nov 14, 2024

bytedance / HLLM

HLLM: Enhancing Sequential Recommendations via Hierarchical Large Language Models for Item and User Modeling

Python 405 51 Updated Oct 4, 2024

Yu-Group / clinical-rule-vetting

Learning clinical-decision rules with interpretable models.

Jupyter Notebook 20 11 Updated Aug 10, 2023

BlinkDL / RWKV-LM

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 13,746 919 Updated Jun 17, 2025

openai / sparse_autoencoder

Python 495 54 Updated Jul 19, 2024

mingze-yuan / Awesome-LLM-Healthcare

The paper list of the review on LLMs in medicine - "Large Language Models Illuminate a Progressive Pathway to Artificial Healthcare Assistant: A Review".

246 25 Updated Dec 23, 2023

aiwaves-cn / agents

An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents

Python 5,633 436 Updated Sep 26, 2024

RulinShao / retrieval-scaling

Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".

Python 206 18 Updated Jun 5, 2025

logix-project / logix

AI Logging for Interpretability and Explainability🔬

Python 123 8 Updated Jun 7, 2024

TRAIS-Lab / dattri

`dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.

Python 77 17 Updated Jun 9, 2025

huggingface / nanotron

Minimalistic large language model 3D-parallelism training

Python 1,953 197 Updated Jun 25, 2025

0