-
Notifications
You must be signed in to change notification settings - Fork 1.7k
Issues: sgl-project/sglang
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Bug] CUDA OOM when DP attention enabled, maybe due to incorrect acceptable length estimation.
#6027
opened May 5, 2025 by
junliu-mde
5 tasks done
[Bug] Document is different with source of --reasoning-parser value
#6023
opened May 5, 2025 by
engchina
5 tasks done
Throughput Discrepancy Between SGLang Server Logs and LLMPerf Tool for Qwen3 Models
#6022
opened May 5, 2025 by
nskpro-cmd
Instruction for Running DeepSeek with Large-scale PD and EP
collaboration
deepseek
#6017
opened May 5, 2025 by
fzyzcjy
[Bug] AttributeError: 'MHATokenToKVPoolHost' object has no attribute 'start_layer' when using --enable-hierarchical-cache
#6005
opened May 4, 2025 by
Simon-Li
5 tasks done
[Bug] OutOfResources: out of resource: shared memory, Required: 196608, Hardware limit: 65536 when run deepseek-r1 on sglang 0.4.6 with AMD Mi308
#6001
opened May 4, 2025 by
GuoxiangZu
5 tasks done
[Bug] Tensor model parallel group is not initialized when deploying Qwen3-30B-A3B-AWQ
#6000
opened May 4, 2025 by
SecretSettler
5 tasks done
[Bug] importError: libharpcuda.so.0: cannot open shared object file: No such file or directory
#5999
opened May 3, 2025 by
auroraontheway
1 of 5 tasks
[Bug] out of resource: shared memory with 0.4.6 Qwen3-30B-A3B-FP8-Dynamic
#5995
opened May 3, 2025 by
bash99
5 tasks done
[Feature] high performance multi node custom all reduce
high priority
#5994
opened May 3, 2025 by
zhyncs
2 tasks
[Performance Tuning Help] Enabling DP Attention + JIT DeepGEMM + PyTorch Compile Underperforms Baseline
#5985
opened May 2, 2025 by
kimbochen
[Bug] Requests with logprobs throws Internal server 500 error
#5984
opened May 2, 2025 by
om6-prakash
2 of 5 tasks
[Feature] Support for
tool_calls
in Assistant Messages Input
#5983
opened May 2, 2025 by
kibitzing
2 tasks done
[document link is broken] from main page, click installation link, it is broken
#5982
opened May 2, 2025 by
gaowayne
[Bug] Qwen3 14B runs very slowly on a GPU with the SM75 architecture for inference.
#5978
opened May 2, 2025 by
maxin9966
5 tasks done
[Feature] microsoft/Phi-4-multimodal-instruct
help wanted
Extra attention is needed
high priority
microsoft
#5972
opened May 2, 2025 by
avinash31d
2 tasks done
[Feature] Support more multi-modal input for VLM
feature
#5964
opened May 2, 2025 by
JustinTong0323
2 tasks
[Feature] support abort ongoing request
high priority
RLHF
Using SGLang for post training
#5963
opened May 2, 2025 by
zhuzilin
2 tasks done
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.