1:1 job to topology assignment #27

ahg-g · 2023-04-14T21:28:22Z

Consider the case where in a replicatedJob each job needs to land on a separate topology, for example one per network domain, like a rack.

This can be achieved using a combination of affinity and anti-affinity constraints on the pod template:

      affinity:
        podAffinity:  # ensures the pods of this job land on the same rack
          requiredDuringSchedulingIgnoredDuringExecution:
          - labelSelector:
              matchExpressions:
              - key: job-name
                operator: In
                values:
                - my-job
              topologyKey: rack
        podAntiAffinity: # ensures only this job lands on the rack
          requiredDuringSchedulingIgnoredDuringExecution:
          - labelSelector:
              matchExpressions:
              - key: job-name
                operator: NotIn
                values:
                - my-job
              - key: job-name
                operator: Exists
            namespaceSelector: {}
            topologyKey: rack

The problem is that the affinity terms above require setting the value of the job-name label explicitly, which in a replicatedJob is different for each job replica.

The MatchLabelKeys enhancement in upstream k8s should address this problem, but it is not yet available.

Until this is available, we can have this injected by JobSet, triggered by an api we add to ReplicatedJob type that look like :

exclusive
  topologyKey: rack

The text was updated successfully, but these errors were encountered:

danielvegamyhre · 2023-04-14T21:57:19Z

/assign

ahg-g mentioned this issue Apr 14, 2023

Introduce MatchLabelKeys to Pod Affinity and Pod Anti Affinity kubernetes/enhancements#3633

Open

8 tasks

k8s-ci-robot assigned danielvegamyhre Apr 14, 2023

danielvegamyhre mentioned this issue Apr 17, 2023

Add support for 1:1 job per topology assignment #36

Merged

k8s-ci-robot closed this as completed in #36 Apr 18, 2023

ahg-g mentioned this issue Apr 26, 2023

A different API to express colocation and exclusiveness #75

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

1:1 job to topology assignment #27

1:1 job to topology assignment #27

1:1 job to topology assignment #27

1:1 job to topology assignment #27

Comments