-
Dartmouth College
- Hanover, NH
-
12:06
(UTC -12:00) - https://yefanzhou.github.io/
-
cs-paper-checklist Public
Forked from yzhao062/cs-paper-checklistA final sanity checklist to help your CS paper get accepted, not desk rejected.
MIT License UpdatedMay 7, 2025 -
GRPO-Zero Public
Forked from policy-gradient/GRPO-ZeroImplementing DeepSeek R1's GRPO algorithm from scratch
Python Apache License 2.0 UpdatedApr 18, 2025 -
TempBalance Public
[NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training
-
Awesome-Efficient-LLM Public
Forked from horseee/Awesome-Efficient-LLMA curated list for Efficient Large Language Models
-
Autonomous-Agents Public
Forked from tmgthb/Autonomous-AgentsAutonomous Agents (LLMs) research papers. Updated Daily.
MIT License UpdatedJan 26, 2025 -
id-llm-abstraction Public
Forked from chengemily1/id-llm-abstractionCode for ICLR 2025 paper "Emergence of a High-Dimensional Abstraction Phase in Language Transformers"
-
Awesome-Code-LLM Public
Forked from codefuse-ai/Awesome-Code-LLM[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.
UpdatedJan 17, 2025 -
llm-reasoners Public
Forked from maitrix-org/llm-reasonersA library for advanced large language model reasoning
Python Apache License 2.0 UpdatedJan 10, 2025 -
LLMs-from-scratch Public
Forked from rasbt/LLMs-from-scratchImplement a ChatGPT-like LLM in PyTorch from scratch, step by step
Jupyter Notebook Other UpdatedJan 8, 2025 -
YefanZhou.github.io Public
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
HTML UpdatedDec 8, 2024 -
LightZero Public
Forked from opendilab/LightZero[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
Python Apache License 2.0 UpdatedOct 30, 2024 -
rl4co Public
Forked from ai4co/rl4coA PyTorch library for all things Reinforcement Learning (RL) for Combinatorial Optimization (CO)
Python MIT License UpdatedSep 18, 2024 -
early-exit-papers Public
Forked from falcon-xu/early-exit-papersA curated list of early exiting (LLM, CV, NLP, etc)
MIT License UpdatedAug 21, 2024 -
nanoGPT Public
Forked from karpathy/nanoGPTThe simplest, fastest repository for training/finetuning medium-sized GPTs.
Python MIT License UpdatedAug 19, 2024 -
attention-learn-to-route Public
Forked from wouterkool/attention-learn-to-routeAttention based model for learning to solve different routing problems
Jupyter Notebook MIT License UpdatedAug 4, 2024 -
ModelDiagnosis Public
[ICML 2024] MD tree: a model-diagnostic tree grown on loss landscape
-
in-context-learning Public
Forked from dtsip/in-context-learningJupyter Notebook MIT License UpdatedMay 10, 2024 -
-
zenodo-upload Public
Forked from jhpoelen/zenodo-uploadupload big files to Zenodo using cURL, jq and bash
Shell MIT License UpdatedFeb 23, 2024 -
awesome-llm-interpretability Public
Forked from JShollaj/awesome-llm-interpretabilityA curated list of Large Language Model (LLM) Interpretability resources.
UpdatedFeb 13, 2024 -
ThreeRegimePruning Public
[ICML 2023] Code for "A Three-regime Model of Network Pruning"
-
peft Public
Forked from huggingface/peft🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Python Apache License 2.0 UpdatedSep 18, 2023 -
LLM-Adapters Public
Forked from AGI-Edgerunners/LLM-AdaptersLLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models
Python Apache License 2.0 UpdatedJul 16, 2023 -
-
-
deep-learning-dynamics-paper-list Public
Forked from xie-lab-ml/deep-learning-dynamics-paper-listThis is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success of deep learning attributes to both network architecture and …
MIT License UpdatedMay 2, 2023 -
Awesome-ScalingLaws Public
Forked from RZFan525/Awesome-ScalingLawsA curated list of awesome resources dedicated to Scaling Laws for LLMs
UpdatedApr 10, 2023 -
-
-
pytorch-cifar100 Public
Forked from weiaicunzai/pytorch-cifar100Practice on cifar100(ResNet, DenseNet, VGG, GoogleNet, InceptionV3, InceptionV4, Inception-ResNetv2, Xception, Resnet In Resnet, ResNext,ShuffleNet, ShuffleNetv2, MobileNet, MobileNetv2, SqueezeNet…
Python UpdatedNov 6, 2022