-
Notifications
You must be signed in to change notification settings - Fork 410
Pull requests: pytorch/torchtitan
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[SimpleFSDP] Add support for hsdp+tp
CLA Signed
This label is managed by the Meta Open Source bot.
#1343
opened Jun 26, 2025 by
ruisizhang123
Loading…
Only calls destroy_process_group if the trainer exist successfully
CLA Signed
This label is managed by the Meta Open Source bot.
#1342
opened Jun 26, 2025 by
fegin
Loading…
[DSV3] Apply TP on DSV3
CLA Signed
This label is managed by the Meta Open Source bot.
#1341
opened Jun 26, 2025 by
wwwjn
Loading…
Always ignore freqs_cis
CLA Signed
This label is managed by the Meta Open Source bot.
#1338
opened Jun 25, 2025 by
fegin
Loading…
missing dependency in pyproject for tyro
CLA Signed
This label is managed by the Meta Open Source bot.
#1335
opened Jun 24, 2025 by
wesleytruong
Loading…
[WIP] Refactor Tokenizer -> BaseTokenizer
CLA Signed
This label is managed by the Meta Open Source bot.
[kernels][blackwell] add cutlass/cute group gemm forward for blackwell
CLA Signed
This label is managed by the Meta Open Source bot.
#1327
opened Jun 22, 2025 by
lessw2020
Loading…
Support finetuning from a pretrained model
CLA Signed
This label is managed by the Meta Open Source bot.
#1321
opened Jun 20, 2025 by
vwxyzjn
Loading…
[float8] add _auto_filter_for_recipe for float8 training
CLA Signed
This label is managed by the Meta Open Source bot.
#1319
opened Jun 18, 2025 by
danielvegamyhre
Loading…
Support different tokenizers
CLA Signed
This label is managed by the Meta Open Source bot.
#1318
opened Jun 18, 2025 by
H-Huang
Loading…
[not for land] testing out float8 128_1_128_128 blockwise scaling
CLA Signed
This label is managed by the Meta Open Source bot.
#1317
opened Jun 18, 2025 by
vkuzo
Loading…
Do not submit: Multinode training seems to be working
CLA Signed
This label is managed by the Meta Open Source bot.
#1314
opened Jun 17, 2025 by
ahmadsharif1
•
Draft
Do not submit: Multinode is working with multiple controllers
CLA Signed
This label is managed by the Meta Open Source bot.
#1313
opened Jun 17, 2025 by
ahmadsharif1
•
Draft
[llama4][auxiliary-loss-free load balancing] update expert_bias without backward hooks
CLA Signed
This label is managed by the Meta Open Source bot.
#1304
opened Jun 16, 2025 by
hann-wang
Loading…
Finetune from pre-trained models
CLA Signed
This label is managed by the Meta Open Source bot.
#1300
opened Jun 15, 2025 by
vwxyzjn
Loading…
[not for land] Use new AC
CLA Signed
This label is managed by the Meta Open Source bot.
#1294
opened Jun 13, 2025 by
soulitzer
Loading…
WIP: Try to use monarch to run torchtitan.
CLA Signed
This label is managed by the Meta Open Source bot.
#1288
opened Jun 12, 2025 by
ahmadsharif1
•
Draft
Titan changes to use DCP ZOC instead of titan default Async + Pinned Memory
CLA Signed
This label is managed by the Meta Open Source bot.
#1287
opened Jun 12, 2025 by
Saiteja64
Loading…
DO NOT SUBMIT: WIP: Try to use monarch to run torchtitan.
CLA Signed
This label is managed by the Meta Open Source bot.
#1286
opened Jun 12, 2025 by
ahmadsharif1
•
Draft
[deepseek][kernels][blackwell] Cutlass blackwell grouped gemm using cute dsl (forward,backward)
CLA Signed
This label is managed by the Meta Open Source bot.
#1276
opened Jun 8, 2025 by
lessw2020
Loading…
[deepseek][blackwell] add Cutlass cute dsl blackwell dense based looping group gemm
CLA Signed
This label is managed by the Meta Open Source bot.
#1274
opened Jun 8, 2025 by
lessw2020
Loading…
[llama4] enable expert parallel on the same device mesh as tp (tp2ep)
CLA Signed
This label is managed by the Meta Open Source bot.
#1269
opened Jun 6, 2025 by
hann-wang
Loading…
Enable ROCm CI support.
ciflow/rocm
CLA Signed
This label is managed by the Meta Open Source bot.
module: rocm
#1260
opened Jun 4, 2025 by
akashveramd
•
Draft
[WIP][Blackwell Kernels] Blackwell group gemm and dense gemms with Python Cutlass
CLA Signed
This label is managed by the Meta Open Source bot.
#1256
opened Jun 3, 2025 by
lessw2020
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.