-
-
Notifications
You must be signed in to change notification settings - Fork 25
Insights: InftyAI/llmaz
Overview
Could not load contribution data
Please try again later
11 Pull requests merged by 4 people
-
feat: add TensorRT-LLM as backend
#392 merged
May 6, 2025 -
Polish website
#399 merged
May 3, 2025 -
Update logo
#398 merged
May 3, 2025 -
fix README.md link
#397 merged
May 3, 2025 -
fix: doc links and makefile
#396 merged
May 2, 2025 -
fix build error
#395 merged
May 1, 2025 -
Update the helm install command
#391 merged
May 1, 2025 -
Remove all files
#394 merged
May 1, 2025 -
Cleanup for website deploy
#393 merged
May 1, 2025 -
Fix diagram links
#390 merged
May 1, 2025 -
Initialize documentation site
#388 merged
May 1, 2025
2 Pull requests opened by 2 people
-
init LOADER_IMAGE with env variable MODEL_LOADER_IMAGE
#389 opened
Apr 30, 2025 -
feat: support speculative decoding for llamacpp
#402 opened
May 6, 2025
3 Issues closed by 1 person
-
Add TensorRT-LLM support as another backend
#205 closed
May 6, 2025 -
Support speculative decoding with llama.cpp
#197 closed
May 5, 2025 -
Add doc website
#355 closed
May 1, 2025
3 Issues opened by 1 person
-
Add T/$ as indicator to measure the cost efficiency
#401 opened
May 6, 2025 -
Add blog section to website navigate bar
#400 opened
May 3, 2025 -
Manage the ai gateway resources automatically
#387 opened
Apr 30, 2025
3 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
Support speculative decoding with llama.cpp
#240 commented on
May 2, 2025 • 0 new comments -
Support different GPU accelerators for fungibility
#62 commented on
May 6, 2025 • 0 new comments -
[Umbrella] advanced traffic load balancing algorithms
#376 commented on
May 6, 2025 • 0 new comments