-
Notifications
You must be signed in to change notification settings - Fork 549
Insights: ray-project/kuberay
Overview
Could not load contribution data
Please try again later
1 Release published by 1 person
-
v1.4.0
published
Jun 21, 2025
79 Pull requests merged by 19 people
-
Add RayCluster YAML for verl example
#3833 merged
Jun 25, 2025 -
[cherry-pick] Cherry-pick #3826 into release-1.4 branch
#3828 merged
Jun 24, 2025 -
Fix ray nightly image env var setup
#3826 merged
Jun 24, 2025 -
chore: remove unnecessary empty
rayStartParams
#3586 merged
Jun 24, 2025 -
[Test][Release] Change upgrade test version to test upgrade from 1.3.2 to 1.4.0
#3825 merged
Jun 23, 2025 -
[Fix] changelog-generator.py failed to parse some commit messages
#3818 merged
Jun 21, 2025 -
[Fix][Release] Fix Krew release indenetation error
#3823 merged
Jun 21, 2025 -
[Chore] Remove CHANGELOG.md
#3819 merged
Jun 21, 2025 -
[Release] Update KubeRay version references for 1.4.0
#3816 merged
Jun 20, 2025 -
[cherry-pick] Cherry-pick #3809 into release-1.4 branch
#3814 merged
Jun 20, 2025 -
[cherry-pick] Cherry-pick #3804 into release-1.4 branch
#3813 merged
Jun 20, 2025 -
[kubeclt-plugin] fix get cluster all namespace
#3809 merged
Jun 20, 2025 -
[Docs] Add kubectl plugin create cluster sample yaml config files
#3804 merged
Jun 20, 2025 -
[Cherry-pick][Helm Chart] Set honorLabel of serviceMonitor to true (#3805)
#3810 merged
Jun 19, 2025 -
[Helm Chart] Set honorLabel of serviceMonitor to
true
#3805 merged
Jun 19, 2025 -
[cherry-pick] Cherry-pick #3795 into release-1.4 branch
#3798 merged
Jun 18, 2025 -
[Metrics] Remove serviceMonitor.yaml
#3795 merged
Jun 18, 2025 -
[cherry-pick] Cherry-pick #3796 into release-1.4 branch
#3797 merged
Jun 18, 2025 -
[Chore][Sample-yaml] Upgrade pytorch-lightning to 1.8.5 for
ray-job.pytorch-distributed-training.yaml
#3796 merged
Jun 18, 2025 -
[RayJob] Support deletion policies based on job status
#3731 merged
Jun 17, 2025 -
Use ImplementationSpecific in ray-cluster.separate-ingress.yaml (#3781)
#3790 merged
Jun 17, 2025 -
Use ImplementationSpecific in ray-cluster.separate-ingress.yaml
#3781 merged
Jun 17, 2025 -
[cherry-pick] Cherry-pick #3786 into release-1.4 branch
#3789 merged
Jun 17, 2025 -
Remove vLLM examples in favor of Ray Serve LLM
#3786 merged
Jun 17, 2025 -
[cherry-pick] Cherry-pick #3782 into release-1.4 branch
#3788 merged
Jun 17, 2025 -
[cherry-pick] Cherry-pick #3779 into release-1.4 branch
#3787 merged
Jun 17, 2025 -
Update update-ray-job.kueue-toy-sample.yaml
#3782 merged
Jun 16, 2025 -
[Feat] Add e2e test for applying
ray-job.interactive-mode.yaml
#3779 merged
Jun 16, 2025 -
[Release] Update KubeRay version references for 1.4.0-rc.2
#3784 merged
Jun 15, 2025 -
[cherry-pick] Cherry-pick #3780 into release-1.4 branch
#3783 merged
Jun 15, 2025 -
[Doc][Fix] correct the indention of storageClass in ray-cluster.persistent-redis.yaml
#3780 merged
Jun 15, 2025 -
[cherry-pick][doc] Improve APIServer v2 doc (#3773)
#3777 merged
Jun 13, 2025 -
[doc] Improve APIServer v2 doc
#3773 merged
Jun 13, 2025 -
[Doc] Reference helm chart version in
helm-chart/kuberay-operator/README.md.gotmpl
with go template#3763 merged
Jun 13, 2025 -
[Release] Reset ray-operator version in root go.mod to v0.0.0
#3774 merged
Jun 13, 2025 -
[cherry-pick] Cherry-pick #3771 into release-1.4
#3772 merged
Jun 13, 2025 -
Revert "Fix issue where unescaped semicolons caused task execution failures. (#3691)"
#3771 merged
Jun 13, 2025 -
[cherry-pick] support scheduler plugins (#3612)
#3766 merged
Jun 13, 2025 -
[cherry-pick] Added Ray-Serve Config For LLMs (#3517)
#3767 merged
Jun 13, 2025 -
[DOCS] KubeRay APIServer V2 document
#3594 merged
Jun 12, 2025 -
support scheduler plugins
#3612 merged
Jun 12, 2025 -
fix ray-service.different-port.yaml
#3721 merged
Jun 11, 2025 -
Remove
ray-pod.tls.yaml
#3762 merged
Jun 10, 2025 -
[doc] Update GitHub pages's home page
#3761 merged
Jun 10, 2025 -
[doc] Remove the HA document in favor of the Ray doc
#3760 merged
Jun 10, 2025 -
[Release] Fix helm chart tag missing "v" prefix and release rc1
#3757 merged
Jun 10, 2025 -
Added Ray-Serve Config For LLMs
#3517 merged
Jun 9, 2025 -
[Release] Update KubeRay version references for 1.4.0-rc.0
#3698 merged
Jun 9, 2025 -
Improve Grafana Dashboard
#3734 merged
Jun 7, 2025 -
[Fix][CI] Fix ray operator image build error by setting up docker buildx
#3750 merged
Jun 6, 2025 -
[Test][Autoscaler] deflaky unexpected dead actors in tests by setting max_restarts=-1
#3700 merged
Jun 6, 2025 -
add go.mod for operator
#3735 merged
Jun 5, 2025 -
[fix][operator] RayJob.Status.RayJobStatusInfo.EndTime nil deref error
#3742 merged
Jun 4, 2025 -
[operator] fix TPU multi-host RayJob and RayCluster samples
#3733 merged
Jun 3, 2025 -
[chore] upgrade Ray to 2.46.0 in remaining places
#3724 merged
Jun 3, 2025 -
chore: run yamlft pre-commit hook
#3729 merged
Jun 2, 2025 -
[Grafana] Update Grafana dashboard
#3726 merged
Jun 1, 2025 -
[Test][Autoscaler] deflaky autoscaler idle timeout e2e tests by a longer timeout
#3727 merged
Jun 1, 2025 -
[Chore] Upgrade Ray to 2.46.0 follow-up
#3722 merged
Jun 1, 2025 -
[doc] Update API server v1 doc
#3723 merged
Jun 1, 2025 -
feat: upgrade to Ray 2.46.0
#3547 merged
Jun 1, 2025 -
[Test][Autoscaler] deflaky unexpected dead actors in tests by higher resource requests
#3707 merged
Jun 1, 2025 -
[Doc] add ray cluster uv sample yaml
#3720 merged
Jun 1, 2025 -
[apiserver] Use ClusterIP instead of NodePort for KubeRay API server service
#3708 merged
Jun 1, 2025 -
Bump next from 15.2.3 to 15.2.4 in /dashboard
#3709 merged
May 30, 2025 -
[Feat][apiserver] Support CORS config
#3711 merged
May 30, 2025 -
Add kuberay operator servicemonitor
#3717 merged
May 30, 2025 -
[CI] Split Autoscaler e2e tests into 2 buildkite runners
#3715 merged
May 30, 2025 -
Add Grafana Dashboard for KubeRay Operator
#3676 merged
May 29, 2025 -
[Fix][Release] Fix KubeRay dahsboard image build pipeline
#3702 merged
May 29, 2025 -
Fix issue where unescaped semicolons caused task execution failures.
#3691 merged
May 28, 2025 -
[refactor] Refactor enable login shell
#3704 merged
May 28, 2025 -
[chore] Update user to
kuberay
instead of a contributor's name#3706 merged
May 28, 2025 -
[DOCS] Apiserver improve docs readability
#3564 merged
May 28, 2025 -
[Ray-operator] Feature flag login bash
#3679 merged
May 28, 2025 -
[Grafana] Add flag for enabling auto load dashboards
#3689 merged
May 28, 2025 -
[Doc] Fix broken link in documentation
#3697 merged
May 27, 2025 -
[kubectl-plugin] Generate
submission_id
injob_submit.go
#3693 merged
May 27, 2025 -
[Doc] Update README
#3695 merged
May 26, 2025
23 Pull requests opened by 17 people
-
Add default Ray node label info to Ray Pod environment
#3699 opened
May 27, 2025 -
Add priorityClassName for kuberay-operator helm chart
#3703 opened
May 28, 2025 -
[Helm] Refactor kuberay-operator chart
#3716 opened
May 30, 2025 -
Remove fmt.Println, convert to log
#3718 opened
May 30, 2025 -
[Refactor][kubectl-plugin] Share common struct for cluster-related CLI options to reduce duplication
#3719 opened
May 30, 2025 -
[Test][Autoscaler] deflaky unexpected dead actors in tests by more resources
#3728 opened
Jun 1, 2025 -
test: enable upgrade to image built from source
#3736 opened
Jun 3, 2025 -
[RayService][Test] create curl pod waiting until running
#3740 opened
Jun 3, 2025 -
[CI][precommit] Fix instruction command in validate helm hook
#3747 opened
Jun 5, 2025 -
[feat][CRD] allow RayJob to have RayCluster template
#3753 opened
Jun 6, 2025 -
[kubectl-plugin] Use a more Golang-native approach to retrieve the CR status for testing
#3775 opened
Jun 13, 2025 -
pass client when call batchscheduler.New()
#3785 opened
Jun 16, 2025 -
[Test][Autoscaler] add fake single-host TPU tests
#3792 opened
Jun 17, 2025 -
chore: reduce memory allocation on handling http response
#3800 opened
Jun 18, 2025 -
[Bug] Add default value for entrypoint flags in job_submit.go
#3808 opened
Jun 19, 2025 -
[Fix] kubectl ray create cluster config file CPU overwrites the whole resource requests and limits
#3811 opened
Jun 19, 2025 -
[apiserver] Add migration doc from v1 to v2
#3812 opened
Jun 20, 2025 -
[kubeclt-plugin] use solid value as default value in get and create
#3815 opened
Jun 20, 2025 -
[kubectl-plugin] Remove ephemeral storage check
#3821 opened
Jun 21, 2025 -
[feat][python-client]: add rayjob support to kuberay python-client
#3830 opened
Jun 24, 2025 -
[feat][operator] validate Ray resource metadata in webhook
#3831 opened
Jun 24, 2025 -
Use Go 1.24.0 in go module
#3835 opened
Jun 26, 2025 -
Feature/cron scheduling rayjob 2426
#3836 opened
Jun 26, 2025
25 Issues closed by 7 people
-
[Bug] Exiting because this node manager has mistakenly been marked as dead by the GCS
#3827 closed
Jun 24, 2025 -
[release] Update upgrade test during release process
#3824 closed
Jun 23, 2025 -
[Bug] kubectl-plugin get cluster without all-namespace still shows all-namespace.
#3806 closed
Jun 20, 2025 -
[Feature] Fix dependency upgrade for structured-merge-diff
#3542 closed
Jun 19, 2025 -
[Feat] Add e2e test for applying `ray-job.interactive-mode.yaml`
#3778 closed
Jun 16, 2025 -
[Bug] `ray-job.use-existing-raycluster.yaml` entrypoint error
#3764 closed
Jun 13, 2025 -
[Feature] Support PodGroup API
#3611 closed
Jun 12, 2025 -
[Bug] Minor inconsistency in RayJob submitter retries in v1.3
#3211 closed
Jun 12, 2025 -
[Feature] Improve Grafana dashboard for kuberay operator
#3710 closed
Jun 10, 2025 -
[Bug][CI] Multi-platform build fails with docker driver in GitHub Actions
#3568 closed
Jun 6, 2025 -
[CI] Deflaky Autoscaler V2 e2e tests
#3701 closed
Jun 6, 2025 -
[CI] Add a `go.mod` for `ray-operator`
#3730 closed
Jun 5, 2025 -
[Bug] Allow deleting pods in Role generated by rbac.go, to let autoscaler free up pod resources
#3737 closed
Jun 5, 2025 -
[Bug] Ray command not found when installed using uv.
#3247 closed
Jun 1, 2025 -
Use uv in sample YAML files
#3039 closed
Jun 1, 2025 -
[apiserver] Use ClusterIP instead of NodePort for KubeRay API server service
#3705 closed
Jun 1, 2025 -
[SLI Metric] onboarding experience study
#3622 closed
May 31, 2025 -
[Grafana] Auto load dashboard json when using kuberay metrics
#3650 closed
May 31, 2025 -
[Epic][Feature] KubeRay v1.4.0 - Operator SLI Tracking
#3171 closed
May 31, 2025 -
[Feature] Add kuberay operator serviceMonitor
#3207 closed
May 30, 2025 -
[CI][release] Verify KubeRay dashboard image release process
#3588 closed
May 29, 2025 -
[Bug] kubectl ray job submit command shadows the error of ray job sumit
#3675 closed
May 27, 2025 -
[kubectl-plugin] Handle multiple jobs in `ray job list`
#3624 closed
May 27, 2025
34 Issues opened by 20 people
-
Avoid requiring specific Go patch version in go module
#3834 opened
Jun 26, 2025 -
[Feature] [kubectl-plugin] Improve support for autoscaling clusters
#3832 opened
Jun 24, 2025 -
[Feature] Add RayJob support to Kuberay python-client
#3829 opened
Jun 24, 2025 -
KubeRay v1.4.0 default non-login BASH shell issue tracking
#3822 opened
Jun 21, 2025 -
[Feature] Add prometheus metrics reset support
#3820 opened
Jun 21, 2025 -
[Doc] Update RayJob Quick Start Job When V1.5 Release
#3817 opened
Jun 20, 2025 -
[Bug] Wrong default value for entrypoint flags in job_submit.go
#3807 opened
Jun 19, 2025 -
[Bug] kubectl ray create cluster config file CPU overwrites the whole resource requests and limits
#3803 opened
Jun 19, 2025 -
[Bug] Wrong default value for head and worker ray start params in create_cluster.go
#3801 opened
Jun 18, 2025 -
[Bug] RayCluster unavailable due to health probe failure
#3799 opened
Jun 18, 2025 -
Question: Autoscaler v1 vs v2 configuration and performance
#3794 opened
Jun 17, 2025 -
[Feature] Support `runtimeClassName` in values.yaml
#3793 opened
Jun 17, 2025 -
[Bug] why readinessProbe port 8000 check
#3791 opened
Jun 17, 2025 -
[Feature] gcsFaultToleranceOptions spec support in RayCluster Helm Chart
#3776 opened
Jun 13, 2025 -
[Umbrella] Scheduler plugins
#3770 opened
Jun 12, 2025 -
[scheduler-plugins] Support second scheduler mode
#3769 opened
Jun 12, 2025 -
[scheduler-plugins] Take multi-host into consideration when creating PodGroup
#3768 opened
Jun 12, 2025 -
[Feature] Document and provide a easy way to detect CRD / KubeRay mismatch
#3765 opened
Jun 12, 2025 -
[Bug] Custom resources are badly parsed and show error on
#3756 opened
Jun 9, 2025 -
[Feature] Include CR UID in kuberay metrics
#3754 opened
Jun 7, 2025 -
[Umbrella] Remove the max_restart in autoscaler e2e tests
#3752 opened
Jun 6, 2025 -
[Feature][kuberay-operator][Helm] Make reconcile concurrency configurable
#3751 opened
Jun 6, 2025 -
[Bug] Remove max_restarts=-1 from detached actor for autoscaler e2e tests
#3748 opened
Jun 6, 2025 -
[scheduler-plugins] Kuberay should pass client when call batchscheduler.New()
#3746 opened
Jun 5, 2025 -
[CI] kubectl plugin CI is flaky
#3743 opened
Jun 4, 2025 -
[Feature] Explore the integration with DraNet
#3739 opened
Jun 3, 2025 -
[Bug] [Autoscaler-v2] Workers pods don't completely scale-down
#3738 opened
Jun 3, 2025 -
[doc] Update uv feature flag description after 2.47 is released
#3732 opened
Jun 2, 2025 -
[Feature] Update RayJob DeletionPolicy API to differentiate success/failure scenarios
#3714 opened
May 29, 2025 -
[Feature][Helm] Add missing values used in kuberay-operator helm templates
#3712 opened
May 29, 2025 -
[Serve] RayServe Pods Stuck in Unready State Causing API Outages
#3696 opened
May 27, 2025
24 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
[RayService] Support Incremental Zero-Downtime Upgrades
#3166 commented on
Jun 5, 2025 • 16 new comments -
[Feature][APIServer] add retry for http client
#3551 commented on
Jun 8, 2025 • 15 new comments -
Refactor Apiserver e2e run in cluster
#3529 commented on
May 29, 2025 • 2 new comments -
Support custom resource configuration for the submit pod
#3690 commented on
May 27, 2025 • 0 new comments -
[Feature] Add ValidateRayClusterSpec to Webhook
#2739 commented on
Jun 26, 2025 • 0 new comments -
[RayJob][Feature] add light weight job submitter in kuberay image
#2587 commented on
Jun 26, 2025 • 0 new comments -
Add Fake TPU e2e Autoscaling Test Cases
#2279 commented on
Jun 18, 2025 • 0 new comments -
[Feature] Support cron scheduling for RayJob
#2426 commented on
Jun 26, 2025 • 0 new comments -
[Umbrella] Autoscaler improvements
#2600 commented on
Jun 25, 2025 • 0 new comments -
[Feature] Support to generate PersistentVolumeClaim for Pod
#59 commented on
Jun 24, 2025 • 0 new comments -
[Bug] RayCluster K8s event for creating worker Pods
#3056 commented on
Jun 21, 2025 • 0 new comments -
[Feature] Zero downtime upgrade for long-running requests.
#3602 commented on
Jun 21, 2025 • 0 new comments -
Follow up of #3202: update doc
#3580 commented on
Jun 20, 2025 • 0 new comments -
[Feature] [apiserver] Provide migration doc from v1 to v2
#3607 commented on
Jun 19, 2025 • 0 new comments -
[Feature] [apiserver] Add timeout and retry for apiserver v2
#3606 commented on
Jun 16, 2025 • 0 new comments -
[Feature] Provide a better experience to manage the KubeRay with ArgoCD and GitOps
#3659 commented on
Jun 16, 2025 • 0 new comments -
[Roadmap] KubeRay (or anything for Ray on K8s) v1.4.0 Wishlist
#2999 commented on
Jun 13, 2025 • 0 new comments -
[Feature][RayService] Handle serve deployment delete during the cluster destroy.
#647 commented on
Jun 8, 2025 • 0 new comments -
[Feature] Support networking.k8s.io.IngressSpec as ingress config for ray head
#1475 commented on
Jun 5, 2025 • 0 new comments -
[Feature] Support all HTTP request in apiserver V2
#3496 commented on
May 31, 2025 • 0 new comments -
[Umbrella] Ray Autoscaling tests
#2173 commented on
May 31, 2025 • 0 new comments -
[Umbrella] Document the breaking changes for CR names in v1.4
#3271 commented on
May 31, 2025 • 0 new comments -
[Bug] KubeRay Operator pod fails to start when using --enable-metrics with helm chart v1.3.2
#3657 commented on
May 29, 2025 • 0 new comments -
[Feature] Support node selector for running kuberay-operator via helm chart
#3625 commented on
May 28, 2025 • 0 new comments