-
-
Notifications
You must be signed in to change notification settings - Fork 32
Insights: InftyAI/llmaz
Overview
Could not load contribution data
Please try again later
1 Release published by 1 person
-
v0.1.4
published
Jun 10, 2025
28 Pull requests merged by 8 people
-
build(deps): bump the kubernetes group with 5 updates
#468 merged
Jun 25, 2025 -
add more integration test for model webhook(set custom createdAt)
#467 merged
Jun 21, 2025 -
chore: update golangci-lint to 1.64.8 for go 1.24
#466 merged
Jun 19, 2025 -
Polish the architecture of metrics aggregator
#465 merged
Jun 19, 2025 -
feature: add some field for OpenModel Playground Service
#464 merged
Jun 19, 2025 -
Update Karpenter image repository
#461 merged
Jun 18, 2025 -
Update lws dependency to 0.6.2
#459 merged
Jun 17, 2025 -
Add Discord
#460 merged
Jun 17, 2025 -
Update ci workflow to 0.1.18, which go version is 1.24
#457 merged
Jun 16, 2025 -
add miss integration test case for tensorrt-llm backend
#455 merged
Jun 16, 2025 -
Add reviewers
#454 merged
Jun 13, 2025 -
feat: add ownedBy and createdAt for OpenModel
#438 merged
Jun 13, 2025 -
fix: add ut for modelhub.
#434 merged
Jun 13, 2025 -
Release v0.1.4
#450 merged
Jun 10, 2025 -
Update documentation
#449 merged
Jun 10, 2025 -
Add Karpenter integration docs
#448 merged
Jun 10, 2025 -
Proposal for karpenter intergation
#439 merged
Jun 10, 2025 -
Add inftyai-scheduler support and config updates
#447 merged
Jun 10, 2025 -
feat: support runai streamer for vllm
#423 merged
Jun 10, 2025 -
Update helm chart
#444 merged
Jun 7, 2025 -
Update slack link
#442 merged
Jun 6, 2025 -
fix logo url
#441 merged
Jun 5, 2025 -
Add dispatcher & memoryStore & latencyAwarePlugin
#440 merged
Jun 4, 2025 -
chore: fix generate-apiref in Makefile
#437 merged
Jun 3, 2025 -
Update Installation Path in README.md
#436 merged
Jun 2, 2025 -
fix: add ut for backend runtime.
#428 merged
Jun 2, 2025 -
Add global configmap
#431 merged
Jun 2, 2025 -
Add ci test with helm chart
#432 merged
Jun 2, 2025
3 Pull requests opened by 2 people
-
feat: support configuring init container image
#443 opened
Jun 6, 2025 -
feature: use watch instead of client get in Reconciler
#452 opened
Jun 12, 2025 -
feat: add more case in playground webhook
#470 opened
Jun 27, 2025
12 Issues closed by 1 person
-
Upadate lws to v0.6.2
#458 closed
Jun 17, 2025 -
Reviewers Nomination
#453 closed
Jun 13, 2025 -
Add fields ownedBy and createdAt for OpenModel
#435 closed
Jun 13, 2025 -
Release v0.1.4
#445 closed
Jun 10, 2025 -
Support scaling with Spot instances for cost saving
#106 closed
Jun 10, 2025 -
Support runai model streamer for fast model loading
#352 closed
Jun 10, 2025 -
[Umbrella] inference engine metrics installation
#375 closed
Jun 8, 2025 -
[Umbrella] Metrics Aggregator Implementation
#421 closed
Jun 4, 2025 -
Support splitwise with multiModelsClaims
#15 closed
Jun 2, 2025 -
e2e test with ai gateway enabled
#430 closed
Jun 2, 2025 -
[Umbrella] grafana support with inference engines
#385 closed
May 28, 2025 -
Support different GPU accelerators for fungibility
#62 closed
May 28, 2025
5 Issues opened by 4 people
-
feat: Add priority field to Flavor
#469 opened
Jun 23, 2025 -
Support Envoy AI gateway v0.2.0
#463 opened
Jun 17, 2025 -
`golangci-lint` v1.63.4 failed to load config with Go 1.24
#462 opened
Jun 17, 2025 -
Add adopters list and blog/video list page
#451 opened
Jun 12, 2025 -
Milestone v0.3.0
#433 opened
Jun 2, 2025
7 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
Introduce inference reserve config for standby instance
#265 commented on
May 28, 2025 • 0 new comments -
Add popular open source models as in-tree support
#268 commented on
May 28, 2025 • 0 new comments -
Enable envoy token rate limiting by configuration
#412 commented on
Jun 3, 2025 • 0 new comments -
Can this initcontainer image be configurable?
#350 commented on
Jun 5, 2025 • 0 new comments -
Lora multiplexing support
#27 commented on
Jun 8, 2025 • 0 new comments -
Milestone v0.2.0
#259 commented on
Jun 10, 2025 • 0 new comments -
Enabling Efficient Model and Container Image Distribution in LLMaz with Dragonfly
#361 commented on
Jun 17, 2025 • 0 new comments