propose a way to get service account token for csi driver #1855

zshihang · 2020-06-09T19:07:28Z

k8s-ci-robot · 2020-06-09T19:07:36Z

Hi @zshihang. Thanks for your PR.

I'm waiting for a kubernetes member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

zshihang · 2020-06-09T19:09:53Z

/assign @msau42
/cc @mikedanese @liggitt @seh @kfox1111 @munnerz

k8s-ci-robot · 2020-06-09T19:09:57Z

@zshihang: GitHub didn't allow me to request PR reviews from the following users: seh, kfox1111.

Note that only kubernetes members and repo collaborators can review this PR, and authors cannot review their own PRs.

In response to this:

/assign @msau42
/cc @mikedanese @liggitt @seh @kfox1111 @munnerz

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

micahhausler · 2020-06-10T17:59:31Z

keps/sig-storage/20200609-csi-service-account-token.md

+4.  The token is not guaranteed to be available (e.g.
+    `automountServiceAccountToken=false`).
+
+### User stories


AWS would like this for granting projected tokens to our EFS CSI driver so the driver can get a pod's AWS credentials (https://github.com/kubernetes-sigs/aws-efs-csi-driver & aws/efs-utils#60)

@micahhausler can you clarify if efs driver would need this at CreateVolume time, or is NodePublish sufficient?

NodePublish

jsafrane · 2020-06-16T07:50:09Z

keps/sig-storage/20200609-csi-service-account-token.md

+5.  Every 0.1 second, the reconciler component of volume manager will remount
+    the volume in case the vault secrets expire and re-login is required.


What if one of these remounts (NodePublish?) fails? What if kubelet is not able to refresh a token after expiration? There is no facility in kubelet to kill the affected pod, do you plan to add it?

(Starting separate "thread")
Does this mean that kubelet must cache "vault tokens" of all running pods and their lifetime, to be able to refresh a token when it expires?

What if one of these remounts (NodePublish?) fails?

return errors? do you have any suggestion?

What if kubelet is not able to refresh a token after expiration?

the token will be refreshed regularly before expiration.

Does this mean that kubelet must cache "vault tokens" of all running pods and their lifetime, to be able to refresh a token when it expires?

the token is already being cached and refreshed before expiration.

Returning errors only produces events to users. A pod keeps running, possibly with expired / stale data, without really knowing about it. Is it something that's acceptable? Or are there cases when a pod that uses a secret must be killed when it looses access to a secret? In that case you should add some note here + code to kill pods.

looking at the code, the pod will not be killed if the first volume mount fails. i expect the remounts will adopt the same behavior. @seh what will happen if the mounted secrets expire? will the secrets be deleted so that the users can be notified?

killing the container if they ever get stale

if the secret can ever get stale, the secrets provider (e.g. vault) should revoke those secrets, rather than the secrets consumers (e.g. k8s). we can not rely on the credential consumers not to use expired secrets. e.g. the pod cache the secret in memory even if the secret is replaced with another fresh one.

Those are fine ideas, but that's not how all the secret engines in Vault work.

Consider an administrator who stores an API key in a Vault K/V secret. Perhaps that administrator usually rotates the key about every other month, leaving a couple of weeks of overlap before revoking the previous key. She might set an advisory TTL on the secret read from Vault for one week, advising consuming applications that they should probably check for a fresh key every week so. An old key might still work for a long while after that week.

going back to the original question on what to do when the remount, i would still like to keep the current behavior where the error will only be logged because:

stale secrets might still work and it is not security critical for the consumer of credentials (k8s) to stop using the stale secrets.

the repeated remount will overcome transient errors.

there is no guarantee that killing the container/pod will solve fatal errors.

A lot of good discussion here. Can we capture all this somewhere in the KEP?

jsafrane · 2020-06-16T07:57:09Z

keps/sig-storage/20200609-csi-service-account-token.md

+`RequiresRemount` is optional, which should be only set when the mounted volumes
+by the CRI driver have TTL and require re-validation on the token.


Can you please elaborate on interaction between RequiresRemount and token lifetime? What should happen when RequiresRemount==false and a token expires? Will kubelet keep running a pod that used the expired token? I may be missing some basic knowledge, but why is that OK?

for vault, the token is used to authenticate as the pod's service account so that the driver can fetch the secrets stored in vault. however, the secrets have TTL, so it will authenticate again to fetch the newer secrets. in this case, RequiresRemount==true.

for other csi that doesn't need re authentication, RequiresRemount==false, e.g. cert manager csi driver wants to create CertificateRequest on behalf of the pod the credential of which is only needed in the first mount if the certificate doesn't expire.

Does this mean remounting the volume set up by the driver that consumes the service account token? In my driver that refreshes its service account tokens, I don't remount the volume; I use (a port of) AtomicWriter to replace the files holding the secrets fetched from Vault.

With secrets (or X.509 certificates) acquired from Vault, having authenticated to Vault with a Kubernetes service account token, there are three things that can expire:

The secret data
(By way of an optional advisory TTL.)

The Vault token
(Usually you can refresh it, but when that offer runs out, you need to authenticate again.)

The Kubernetes service account token
(Minimum validity period from the TokenRequest API is ten minutes.)

Even when the secret data changes, though, it's not necessary to unmount and remount volumes. The CSI driver can replace that data on disk "atomically".

If I've misunderstood what is being remounted here, feel free to ignore my critique.

Does this mean remounting the volume set up by the driver that consumes the service account token?

the remount is triggered by kubelet, which means a sequence of NodePublishVolume will be called. this way
can assure the token not going stale and move the common token fetching logic from different csi drivers to kubelet.

I use (a port of) AtomicWriter to replace the files holding the secrets fetched from Vault.

this is good. many existing volumes which requires remount (downwardAPI, projected service account token) are also using AtomicWriter.

it is true that there is extra overhead incurred by the periodic remount operation in this approach, but you can still use AtomicWriter right? what the difference in overhead between the writes via AtomicWriter and mounts for the secrets store csi driver? the difference in overhead seems minimal for other in tree volumes using AtomicWriter.

There's no remounting of volumes necessary, though. The process running in the container may be reading the files that would be unmounted. It introduces a gap during which the files would not be available any longer, and in most cases slightly stale files are better than no files.

For most drivers, unmounting involves recursively deleting files from a directory, and mounting requires creating directories. There may be a mixture of files within such a mounted volume, with differing expiration times. Sweeping it all away and writing it again a short time later is not what these other drivers are doing with AtomicWriter, and it's not how my driver works. AtomicWriter helps facilitate in-place updates of a tree of files. Deleting all the files and writing them again later is not an atomic update.

fair enough, i can add the option.

IMO, term "remount" comes from Kubernetes RequiresRemount handling

It has never remounted anything, it refreshes atomic writer volumes. Refactoring to "Refresh" would be better, but orthogonal to this enhancement.

@jsafrane do you think we need to clarify in the csi spec this new refresh behavior or leave this as a Kubernetes-specific behavior? Although already, the drivers that support this would have to be Kubernetes-specific anyway since they need to make the TokenRequest API call. Right now I don't think it's expected the NodePublishVolume could get called repeatedly.

the csi driver would not make api calls. TokenRequest will be called by kubelet and TokenReview is called by storage provider.

Still, NodePublish will be called frequently. So far, NodePublish was called only once when mounting the volume for the first time. As long as the NodePublish is called in ephemeral CSI volumes (secrets and similar stuff), they're Kubernetes extension anyway and we don't need to update CSI spec.

AWS EFS does not seem to be interested in periodic NodePublish when the token expires, right @micahhausler?

jsafrane · 2020-06-16T08:01:34Z

keps/sig-storage/20200609-csi-service-account-token.md

+This KEP proposes a way to obtain service account token for pods that the CSI
+drivers are provisioning volumes for. Since these tokens are valid only for a


The first sentence implies that the token is available in CreateVolume, while the token is passed in NodePublish (which makes more sense).

will reword it. :)

jsafrane · 2020-06-16T08:07:32Z

keps/sig-storage/20200609-csi-service-account-token.md

+    `mountedPods` will have `requiresRemound=true` after `MarkRemountRequired`
+    is called. `MarkRemountRequired` will call into `RequiresRemount` of the
+    in-tree csi plugin to fetch the `CSIDriver` object.
+3.  Before `NodePublishVolume` call, kubelet will request token from


Some CSI drivers need ControllerPublish and NodeStage calls. Do you plan to add the token to these too? If not,
can we safely assume that these calls won't ever need such a token?

Do you plan to add the token to these too?

at this time it is a no. in the current collected user stories, vault and cert manager only need this token in NodePublishVolume calls to authenticate.

can we safely assume that these calls won't ever need such a token?

i didn't find a good reason to use the pod's token in the controller rpc services and NodeStage/NodeUnstage as they are cluster operations or early stage node operations which not much related to pod.

If it's used in EFS, a real persistent volume that coincidentally uses only NodePublish, I think that there is only tiny step to other volume plugins that may require NodeStage / ControllerPublish.

I think NodeStage/ControllerPublish may add additional challenges because multiple pods can share the same volume at those operations.

I think an explicit warning that tokens are do not cover ControllerPublish/NodeStage in the KEP (and in subsequent docs in github.com/kubernetes-csi/docs) will do for now.

msau42 · 2020-06-16T19:43:26Z

keps/sig-storage/20200609-csi-service-account-token.md

+4.  The token is not guaranteed to be available (e.g.
+    `automountServiceAccountToken=false`).
+
+### User stories


@micahhausler can you clarify if efs driver would need this at CreateVolume time, or is NodePublish sufficient?

msau42 · 2020-06-16T19:45:54Z

keps/sig-storage/20200609-csi-service-account-token.md

+    ... // existing fields
+
+    RequiresRemount *bool
+    ServiceAccountTokenAudience *string


What are the valid values for this string?

it is just an arbitrary non empty identifier.

the SP is supposed to send a TokenReview request with that audience included to validate the token.

msau42 · 2020-06-16T19:48:15Z

/ok-to-test

zshihang · 2020-06-22T17:08:19Z

/assign @msau42

any more comments? shall we proceed with implementation?

msau42 · 2020-06-24T01:08:48Z

keps/sig-storage/20200609-csi-service-account-token.md

+`TokenRequestStatus` will be passed to CSI driver.
+
+**`RequiresRemount`**: is optional, which should be only set when the mounted
+volumes by the CRI driver have TTL and require re-validation on the token.


msau42 · 2020-06-24T01:16:57Z

keps/sig-storage/20200609-csi-service-account-token.md

+    ... // existing fields
+
+    RequiresRemount *bool
+    ServiceAccountTokenAudiences []string


Would it make sense to put the two service account related fields into another struct?

msau42 · 2020-06-24T01:18:04Z

keps/sig-storage/20200609-csi-service-account-token.md

+5.  Every 0.1 second, the reconciler component of volume manager will remount
+    the volume in case the vault secrets expire and re-login is required.


A lot of good discussion here. Can we capture all this somewhere in the KEP?

msau42 · 2020-06-24T01:19:30Z

keps/sig-storage/20200609-csi-service-account-token.md

+    plugin to do necessary validation and every csi driver needs to reimplement
+    the same functionality.
+
+## Implementation History


We also need to fill out the test plan, graduation criteria, and production readiness sections before we can move the kep to implementable.

regarding criteria and production readiness, seems they need FeatureGate. i am not sure we need feature gate for this one as it doesn't change existing behavior.

I think any API change we need to make would require going through the alpha/beta/ga feature gates so that we have the ability to make changes after testing and receiving feedback.

msau42 · 2020-06-24T01:20:35Z

keps/sig-storage/20200609-csi-service-account-token.md

+    `mountedPods` will have `requiresRemound=true` after `MarkRemountRequired`
+    is called. `MarkRemountRequired` will call into `RequiresRemount` of the
+    in-tree csi plugin to fetch the `CSIDriver` object.
+3.  Before `NodePublishVolume` call, kubelet will request token from


I think NodeStage/ControllerPublish may add additional challenges because multiple pods can share the same volume at those operations.

msau42 · 2020-07-01T21:46:11Z

/approve
from sig-storage

/hold
for sig-auth review

k8s-ci-robot · 2020-07-01T21:46:32Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: msau42, zshihang

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~keps/sig-storage/OWNERS~~ [msau42]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

mikedanese · 2020-07-07T16:15:04Z

@kubernetes/sig-auth-proposals

keps/sig-storage/1855-csi-driver-service-account-token/README.md

mikedanese · 2020-07-13T21:35:48Z

keps/sig-storage/1855-csi-driver-service-account-token/README.md

+
+// ServiceAccountToken contains parameters of a token.
+type ServiceAccountToken struct {
+    Audiences []string


I think this should probably only allow a single audience. I'd like @liggitt to weigh in on the API review. I don't think this needs to block though.

the use case to consider is secrets store csi driver which possibly has multiple providers. there are two ways:

currently the approach is to distribute one token with multiple audiences to the secrets store csi driver.

another way would be distribute multiple tokens (each has single audience).

FYI @seh

My suggestion to allow multiple audiences was for parity with the TokenRequest API, and to allow operators greater flexibility in talking to, say, more than one Vault authentication role, where they each demand a different audience. Doing so would also allow changing the configuration of such a role, and allowing multiple CSI driver daemons to play along through the transition.

both ways can suffice the multi-audience in a csi driver scenario. take the secrets store csi driver as an example, the csi driver only needs to pass the single token with multiple audiences to all the providers, while if adopting the second approach, the csi driver needs to hardcode a mapping between the tokens from kubelet and the providers. the advantage i can think of of the second approach is every provider gets its own token but i am not sure how much security sugar we get here. seems to me that the convenience of the first approach outweighs the advantages that the second approach provides to us.

the csi driver only needs to pass the single token with multiple audiences to all the providers

passing replayable credentials which are considered valid by multiple providers to all the providers does not seem like a good idea

allow operators greater flexibility in talking to, say, more than one Vault authentication role, where they each demand a different audience

I'm not very familiar with this... is this a scenario where you are presenting the token to a single vault server, using multiple audiences as some sort of scoping mechanism?

is this a scenario where you are presenting the token to a single vault server, using multiple audiences as some sort of scoping mechanism?

Yes. A Vault operator can mount the Kubernetes auth method any number of times. Within each mounted auth method, you can create any number of roles. Each such role defines which service account names, within which namespaces, and bound to which audience (and only a single audience, questioned here and here, alas too late) to accept.

In my CSI driver, a pod can read multiple Vault KV secrets and request multiple X.509 certificates, each involving authenticating as a different Vault role. Each of these roles could demand that the principal present a service account token bound to a different audience. If a given service account token is bound to multiple audiences—perhaps the union of what these Vault roles demand—then a single token could suffice for authenticating as each Vault role.

passing replayable credentials which are considered valid by multiple providers to all the providers does not seem like a good idea

agreed. when passing multiple tokens to the csi driver, in the case of secrets store csi driver, it is required to implement a mapping between the tokens and the providers. my current idea is to pass a map in VolumeAttributes:

csi.storage.k8s.io/serviceAccount.tokens: { 'audience-0': { 'token': token-0, 'expiry': expiry-0, }, 'audience-1': { 'token': token-1, 'expiry': expiry-1, }, ... }

regarding Vault roles which might have different audiences, the csi driver will pass all the tokens with audience in the format 'vault-*' to Vault provider. of course, it is a non-goal in this KEP but it is something that the csi driver needs to implement to accommodate the Vault case.

mikedanese · 2020-07-13T21:36:48Z

keps/sig-storage/1855-csi-driver-service-account-token/README.md

+    ... // existing fields
+
+    RequiresRemount *bool
+    ServiceAccountToken *ServiceAccountToken


Is there ever a usecase for multiple service account tokens? If I'm using the secret driver with both AWS and valut?

mikedanese · 2020-07-21T18:03:11Z

/lgtm
/hold cancel

micahhausler · 2020-09-18T14:22:07Z

This is being implemented in kubernetes/kubernetes#93130 and currently targeted for v1.20

/milestone v1.20

k8s-ci-robot · 2020-09-18T14:22:09Z

@micahhausler: You must be a member of the kubernetes/milestone-maintainers GitHub team to set the milestone. If you believe you should be able to issue the /milestone command, please contact your and have them propose you as an additional delegate for this responsibility.

In response to this:

This is being implemented in kubernetes/kubernetes#93130 and currently targeted for v1.20

/milestone v1.20

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

msau42 · 2020-09-18T16:28:10Z

@micahhausler is there an enhancement issue open for this that is being tracked by the release team?

k8s-ci-robot added cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Jun 9, 2020

k8s-ci-robot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label Jun 9, 2020

k8s-ci-robot requested review from jsafrane and msau42 June 9, 2020 19:07

k8s-ci-robot added sig/architecture Categorizes an issue or PR as relevant to SIG Architecture. sig/storage Categorizes an issue or PR as relevant to SIG Storage. labels Jun 9, 2020

zshihang force-pushed the master branch from e6c4dd2 to 1474b42 Compare June 9, 2020 19:09

k8s-ci-robot assigned msau42 Jun 9, 2020

k8s-ci-robot requested review from mikedanese and liggitt June 9, 2020 19:09

k8s-ci-robot requested a review from munnerz June 9, 2020 19:09

micahhausler reviewed Jun 10, 2020

View reviewed changes

zshihang force-pushed the master branch from 1474b42 to b7a6d7e Compare June 10, 2020 21:24

jsafrane reviewed Jun 16, 2020

View reviewed changes

msau42 reviewed Jun 16, 2020

View reviewed changes

k8s-ci-robot added ok-to-test Indicates a non-member PR verified by an org member that is safe to test. and removed needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Jun 16, 2020

zshihang force-pushed the master branch 2 times, most recently from 8f4198f to 7ab5beb Compare June 22, 2020 17:03

zshihang force-pushed the master branch 2 times, most recently from e4ad6d4 to 07f052b Compare June 22, 2020 17:14

msau42 reviewed Jun 24, 2020

View reviewed changes

zshihang force-pushed the master branch 2 times, most recently from 02480b1 to 963b297 Compare June 26, 2020 16:57

zshihang force-pushed the master branch from 0370b21 to 5e21a7a Compare June 30, 2020 04:29

k8s-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Jul 1, 2020

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jul 1, 2020

zshihang force-pushed the master branch from 5e21a7a to cfb272c Compare July 1, 2020 22:26

mikedanese added the sig/auth Categorizes an issue or PR as relevant to SIG Auth. label Jul 7, 2020

k8s-ci-robot added the kind/design Categorizes issue or PR as related to design. label Jul 7, 2020

mikedanese reviewed Jul 7, 2020

View reviewed changes

keps/sig-storage/1855-csi-driver-service-account-token/README.md Outdated Show resolved Hide resolved

mikedanese reviewed Jul 7, 2020

View reviewed changes

keps/sig-storage/1855-csi-driver-service-account-token/README.md Outdated Show resolved Hide resolved

mikedanese reviewed Jul 7, 2020

View reviewed changes

keps/sig-storage/1855-csi-driver-service-account-token/README.md Outdated Show resolved Hide resolved

zshihang force-pushed the master branch from cfb272c to d21bcee Compare July 7, 2020 18:24

mikedanese reviewed Jul 13, 2020

View reviewed changes

zshihang mentioned this pull request Jul 16, 2020

plumb service account token down to csi driver kubernetes/kubernetes#93130

Merged

zshihang force-pushed the master branch from d21bcee to c1d3aa7 Compare July 20, 2020 22:31

propose a way to get service account token for csi driver

cc66e4f

zshihang force-pushed the master branch from c1d3aa7 to cc66e4f Compare July 20, 2020 22:32

k8s-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Jul 21, 2020

k8s-ci-robot assigned mikedanese Jul 21, 2020

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jul 21, 2020

k8s-ci-robot merged commit 411e510 into kubernetes:master Jul 21, 2020

k8s-ci-robot added this to the v1.19 milestone Jul 21, 2020

zshihang mentioned this pull request Oct 1, 2020

plumb pod's service account token down to CSI drivers kubernetes/kubernetes#95238

Closed

zshihang mentioned this pull request Sep 15, 2022

release issue for feature CSIServiceAccountToken #2047

Closed

20 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

propose a way to get service account token for csi driver #1855

propose a way to get service account token for csi driver #1855

		5. Every 0.1 second, the reconciler component of volume manager will remount
		the volume in case the vault secrets expire and re-login is required.

		`RequiresRemount` is optional, which should be only set when the mounted volumes
		by the CRI driver have TTL and require re-validation on the token.

		This KEP proposes a way to obtain service account token for pods that the CSI
		drivers are provisioning volumes for. Since these tokens are valid only for a

propose a way to get service account token for csi driver #1855

propose a way to get service account token for csi driver #1855

Conversation

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment