Pulse · Lightning-AI/litgpt

May 18, 2025 – May 25, 2025

11 Active pull requests

2 Active issues
- 11 Merged pull requests
- 0 Open pull requests
- 2 Closed issues
- 0 New issues

Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.

[WIP] Simplified preparation of pretraining datasets
#1057 commented on May 23, 2025 • 0 new comments
Add LongLora for both full and lora fine-tuning
#1350 commented on May 23, 2025 • 0 new comments
WIP: TensorParallel with new strategy
#1421 commented on May 23, 2025 • 0 new comments
OpenCoder series
#1880 commented on May 23, 2025 • 0 new comments
OLMo 2
#1897 commented on May 23, 2025 • 0 new comments
Raise error if disk is full before downloading weights
#1903 commented on May 23, 2025 • 0 new comments
qwen2.5 long context
#1933 commented on May 23, 2025 • 0 new comments
Support for KV caching and batched inference
#1934 commented on May 23, 2025 • 0 new comments
Add Multi-head Latent Attention (DeepSeekv2)
#1945 commented on May 23, 2025 • 0 new comments
wandb logger args
#1973 commented on May 24, 2025 • 0 new comments
(WIP) DeepseekV3 (and Multi-Head Latent Attention)
#2012 commented on May 23, 2025 • 0 new comments
LLaMAMoE fixes
#2014 commented on May 23, 2025 • 0 new comments
Qwen3 Dense
#2044 commented on May 24, 2025 • 0 new comments
Qwen3 MoE Preliminary: add intermediate_size argument to MLP modules
#2046 commented on May 23, 2025 • 0 new comments
phi-4 reasoning models
#2047 commented on May 23, 2025 • 0 new comments