[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Page MenuHomePhabricator

RLazarus (Reuven Lazarus) (rzl)
User

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Monday

  • Clear sailing ahead.

User Details

User Since
Oct 15 2019, 4:02 PM (267 w, 3 d)
Availability
Available
IRC Nick
rzl
LDAP User
RLazarus
MediaWiki User
RLazarus (WMF) [ Global Accounts ]

Recent Activity

Wed, Nov 27

RLazarus added a comment to T341553: Allow running one-off scripts manually.

Thanks @Clement_Goubert! Yeah, --sort-by=.metadata.creationTimestamp is my go-to for ordering.

Wed, Nov 27, 6:03 PM · Patch-For-Review, MW-on-K8s, serviceops
RLazarus added a comment to T380925: Support passing env variables to maintenance scripts in mwscript-k8s.

You can definitely break maintenance scripts by changing the right env variable, but as long as the user needs to be explicit about which variables they want to set / copy, that seems like a "don't do that then" problem. From a security perspective, I don't think it makes any difference as you can already execute arbitrary code via shell.php (and mwscript also lets you run ad hoc PHP files).

Wed, Nov 27, 12:11 AM · MW-on-K8s, serviceops

Tue, Nov 26

RLazarus placed T376616: MWScript.php doesn't allow wikiless scripts without the .php suffix up for grabs.
Tue, Nov 26, 9:27 PM · MW-on-K8s, serviceops
RLazarus created T380925: Support passing env variables to maintenance scripts in mwscript-k8s.
Tue, Nov 26, 9:26 PM · MW-on-K8s, serviceops
RLazarus added a comment to T341553: Allow running one-off scripts manually.

Do we plan to fix the race conditions related to --attach?

Tue, Nov 26, 8:31 PM · Patch-For-Review, MW-on-K8s, serviceops
RLazarus added a comment to T380575: Make SUL3 authentication domain mode available from CLI.

Drive-by: The reason is that sudo doesn't preserve the original environment by default -- so in

sudo FOO=bar command

the command sees $FOO, but in

FOO=bar sudo command

it doesn't.

Tue, Nov 26, 6:33 PM · MediaWiki-Platform-Team, SUL3
RLazarus added a comment to T341553: Allow running one-off scripts manually.
  1. Hitting control-c no longer terminates the job (so blindly following the script's guidance will execute the job)

[...]
Since mwscript-k8s now prints guidance on ctrl+c, the first issue is probably a low-priority thing (we likely do not want to hardcode stuff specific to mwscript-k8s to userOptions.php itself, and hitting ctrl+c will print the right guidance; the only thing that could be done is explicitly asking the user about their intention, as I suggested at T341553#10263470, but I understand the concerns previously raised by @RLazarus).

Tue, Nov 26, 6:21 PM · Patch-For-Review, MW-on-K8s, serviceops
RLazarus added a comment to T341553: Allow running one-off scripts manually.

Is there a way to pass an env variable? The need came up in T380575: Make SUL3 authentication domain mode available from CLI; old mwscript doesn't do it smoothly either, but it can be made to work, which I suspect won't be the case with mwscript-k8s.

Tue, Nov 26, 5:28 PM · Patch-For-Review, MW-on-K8s, serviceops

Thu, Nov 14

RLazarus updated the task description for T379901: Create tool to monitor and automatically delete misbehaving pods.
Thu, Nov 14, 4:42 PM · serviceops, Kubernetes

Fri, Nov 8

RLazarus added a comment to T376604: [PS] Update PropertySuggester update process for mwscript-k8s.

As currently implemented, if the shell dies (or the network is disconnected, or anything else interrupts the stream) then stdin is closed in the container.

Fri, Nov 8, 5:29 PM · MW-1.44-notes (1.44.0-wmf.6; 2024-12-03), Wikidata Dev Team (Wikidata.org Slice), MediaWiki-extensions-PropertySuggester, wmde-wikidata-tech, Wikidata, MW-on-K8s, serviceops

Tue, Nov 5

RLazarus closed T376230: Support bringing text files into the container for one-off maintenance scripts as Resolved.

This is now supported, and documented at https://wikitech.wikimedia.org/wiki/Maintenance_scripts#Input_from_a_file.

Tue, Nov 5, 12:51 AM · MW-on-K8s, serviceops
RLazarus closed T376230: Support bringing text files into the container for one-off maintenance scripts, a subtask of T341553: Allow running one-off scripts manually, as Resolved.
Tue, Nov 5, 12:49 AM · Patch-For-Review, MW-on-K8s, serviceops

Thu, Oct 31

RLazarus created T378754: Persistent logging of mwscript-k8s runs.
Thu, Oct 31, 5:55 PM · MW-on-K8s, serviceops

Oct 30 2024

Michael awarded T378429: Allow members of restricted to run maintenance scripts a Like token.
Oct 30 2024, 8:36 AM · MW-on-K8s, serviceops
RLazarus added a comment to T377497: Functional replacement for importImages.php on Kubernetes.

(For the avoidance of doubt: We'll need some form of solution to this problem before turning off the mwmaint hosts, but I'm not working on it as a mwscript-k8s feature.)

Oct 30 2024, 12:54 AM · serviceops, MW-on-K8s

Oct 29 2024

RLazarus added a comment to T341553: Allow running one-off scripts manually.

Recording here that I'm noticing myself still running one-off scripts on the maint-hosts because, as I understand it, for the new way of running them, I would need deployer-rights, and I do not have (and do not really want) those.

So going forward, I have to either ask for more priviledges which allow me to do many more things than I actually want to do, or I have to ask someone else to run the maintenance scripts for me in the future. Both are not great, but maybe these tradeoffs are worth it.

Oct 29 2024, 12:47 AM · Patch-For-Review, MW-on-K8s, serviceops
RLazarus created T378429: Allow members of restricted to run maintenance scripts.
Oct 29 2024, 12:41 AM · MW-on-K8s, serviceops

Oct 25 2024

RLazarus added a comment to T341553: Allow running one-off scripts manually.

at the very least, --help should have that link.

Oct 25 2024, 10:27 PM · Patch-For-Review, MW-on-K8s, serviceops
RLazarus added a comment to T341553: Allow running one-off scripts manually.

Suggestion: Catch ctrl-c and print out something like

⚠️ It looks like you may have wanted to stop your script's execution, use:
kube-env mw-script-deploy codfw; kubectl delete job mw-script.codfw.foo

then proceed to exit mwscript-k8s ?

Oct 25 2024, 5:11 PM · Patch-For-Review, MW-on-K8s, serviceops
RLazarus added a comment to T341553: Allow running one-off scripts manually.

[...] needing to remember to setup the correct credentials with kube_env mw-script-deploy codfw before being able to use kubectl ... commands is a cognitive burden that could be replaced by mwscript-k8s making that environmental change as part of its normal operation.

Oct 25 2024, 5:01 PM · Patch-For-Review, MW-on-K8s, serviceops

Oct 24 2024

RLazarus added a comment to T341553: Allow running one-off scripts manually.

Sorry this happened. Unfortunately it's kind of working as intended -- not because it's supposed to be hard to kill a job when you want to kill it, but because the job is supposed to keep working after the mwscript-k8s launcher terminates. (Thus preventing the "oops, I forgot to start it in a tmux and now I'm stuck" scenario.)

Oct 24 2024, 6:02 PM · Patch-For-Review, MW-on-K8s, serviceops
RLazarus added a comment to T367118: Control mw-on-k8s periodic maintenance jobs with an etcd value.

The shape of this sounds right to me. Similarly we can have the mw-script helmfiles gate on the same conftool value for defense-in-depth, but also read it in the wrapper script and exit early with a friendlier error message.

Oct 24 2024, 12:22 AM · Datacenter-Switchover, serviceops, MW-on-K8s

Oct 22 2024

RLazarus added a comment to T376427: Update cirrus for mwscript-on-k8s.

Note that while the announcement is only for "manual maintenance scripts", it's probably safe to assume invoking mwscript from other scripts that invoke mwscript falls under a similar umbrella.

Oct 22 2024, 10:54 PM · Discovery-Search (Current work)
RLazarus closed T377292: Support machine-readable output for mwscript-k8s as Resolved.

This is ready to use, and documented (including the JSON output format) at https://wikitech.wikimedia.org/wiki/Maintenance_scripts#Shelling_out_to_mwscript-k8s. @EBernhardson please give this a try and let me know how it works for you! Happy to iterate as needed.

Oct 22 2024, 6:44 PM · MW-on-K8s, serviceops
RLazarus closed T377292: Support machine-readable output for mwscript-k8s, a subtask of T341553: Allow running one-off scripts manually, as Resolved.
Oct 22 2024, 6:43 PM · Patch-For-Review, MW-on-K8s, serviceops

Oct 17 2024

RLazarus added a comment to T377497: Functional replacement for importImages.php on Kubernetes.

Thanks -- that was in reference to:

Oct 17 2024, 7:13 PM · serviceops, MW-on-K8s
RLazarus updated the task description for T377497: Functional replacement for importImages.php on Kubernetes.
Oct 17 2024, 7:12 PM · serviceops, MW-on-K8s
RLazarus created T377497: Functional replacement for importImages.php on Kubernetes.
Oct 17 2024, 6:44 PM · serviceops, MW-on-K8s

Oct 16 2024

RLazarus created T377292: Support machine-readable output for mwscript-k8s.
Oct 16 2024, 1:15 AM · MW-on-K8s, serviceops

Oct 15 2024

RLazarus added a comment to T372337: High number of dangling search index results at fr.wikipedia or it.wikipedia.

Instead of

Oct 15 2024, 7:48 PM · MW-1.43-notes (1.43.0-wmf.26; 2024-10-08), User-Michael, Growth-Team (Current Sprint), Add-Link

Oct 10 2024

RLazarus renamed T376616: MWScript.php doesn't allow wikiless scripts without the .php suffix from mwscript-k8s no longer supports wikiless scripts to MWScript.php doesn't allow wikiless scripts without the .php suffix.
Oct 10 2024, 8:04 PM · MW-on-K8s, serviceops
RLazarus added a comment to T341553: Allow running one-off scripts manually.

[...]
The mediafiles can be very large – I've certainly uploaded files that had dozens of GBs in total. As long as mwmaint had enough space and sufficient sleep was allowed for videoscalers to catch up, things worked well.

Oct 10 2024, 8:04 PM · Patch-For-Review, MW-on-K8s, serviceops
RLazarus closed T376099: --timeout flag for mwscript-k8s, a subtask of T341553: Allow running one-off scripts manually, as Resolved.
Oct 10 2024, 7:38 PM · Patch-For-Review, MW-on-K8s, serviceops
RLazarus closed T376099: --timeout flag for mwscript-k8s as Resolved.

This is now supported!

Oct 10 2024, 7:38 PM · Patch-For-Review, MW-on-K8s, serviceops
RLazarus added a comment to T376230: Support bringing text files into the container for one-off maintenance scripts.

Ha, attachAccount.php specifically blocks this from working:

Oct 10 2024, 6:44 PM · MW-on-K8s, serviceops

Oct 9 2024

RLazarus added a comment to T376776: mw-scripts SAL integration.

A mwscript-k8s flag to log to SAL is on my to-do list -- I hadn't gotten around to filing a task, thanks.

Oct 9 2024, 8:48 PM · Sustainability (Incident Followup), MW-on-K8s, serviceops

Oct 8 2024

RLazarus added a comment to T376714: Evaluate running a statsd-exporter in the mw-script namespace.

And if for whatever reason, we end up with a different namespace from the currently implemented as systemd timers recurring scripts

Oct 8 2024, 5:45 PM · Patch-For-Review, MW-on-K8s, serviceops

Oct 7 2024

RLazarus added a comment to T376616: MWScript.php doesn't allow wikiless scripts without the .php suffix.

This is MWScript.php behavior, and it's actually unchanged:

Oct 7 2024, 4:16 PM · MW-on-K8s, serviceops
RLazarus claimed T376616: MWScript.php doesn't allow wikiless scripts without the .php suffix.

If this changed in mwscript-k8s, it's unintended (but definitely not impossible) -- I'll dig into it today and get back to you.

Oct 7 2024, 3:57 PM · MW-on-K8s, serviceops

Oct 4 2024

RLazarus added a comment to T341553: Allow running one-off scripts manually.

And a bug report. When running mwscript-k8s from an arbitrary directory it leaves behind a .kube directory. I often move to the current deploy dir, such as /srv/mediawiki/php-1.43.0-wmf.25, to get tab completion of the maintenance scripts. Running mwscript-k8s from that dir left a .kube directory behind. That directory caused scap to fail with a permission denied error since the dir is not world readable. My guess would be that should be created in $HOME/.kube. Plausible $HOME isn't being passed down to the k8s commands?

Oct 4 2024, 7:10 PM · Patch-For-Review, MW-on-K8s, serviceops
RLazarus added a comment to T341553: Allow running one-off scripts manually.

Could --follow also exit with the same return code as the script that was executed? The most notable example of this on our side is the CirrusSearch UpdateOneSearchIndexConfig.php maintenance script which signals to cirrus-reindex-orchestrator via the exit code one of three possible results of the update operation.

Oct 4 2024, 5:19 PM · Patch-For-Review, MW-on-K8s, serviceops

Oct 2 2024

RLazarus closed T369142: Show more useful information when mwscript-k8s fails to launch as Resolved.
rzl@deploy1003:~$ mwscript-k8s --attach -- shell.php
⏳ Starting shell.php on Kubernetes as job mw-script.codfw.9m47rjcq ...
⏳ Waiting for the container to start...
🚀 Job is running.
ℹ️ Expecting a prompt but don't see it? Due to a race condition, the beginning of the output might be missing. Try pressing enter.
📜 Attached to stdin/stdout:
error: unable to upgrade connection: container mediawiki-9m47rjcq-app not found in pod mw-script.codfw.9m47rjcq-fw2t7_mw-script
☠️ Command failed with status 1: /usr/bin/kubectl attach --quiet job/mw-script.codfw.9m47rjcq --container mediawiki-9m47rjcq-app -it
For logs (may not work) run:
K8S_CLUSTER=codfw KUBECONFIG=/etc/kubernetes/mw-script-deploy-codfw.config kubectl logs -f job/mw-script.codfw.9m47rjcq mediawiki-9m47rjcq-app
Oct 2 2024, 4:17 PM · MW-on-K8s, serviceops
RLazarus closed T369142: Show more useful information when mwscript-k8s fails to launch, a subtask of T341553: Allow running one-off scripts manually, as Resolved.
Oct 2 2024, 4:16 PM · Patch-For-Review, MW-on-K8s, serviceops

Oct 1 2024

RLazarus added a comment to T341553: Allow running one-off scripts manually.

Is it possible to include a text file from disk in the container where a script runs in? Some scripts (like extensions/CentralAuth/maintenance/attachAccount.php) use those for lists of things to process.

Oct 1 2024, 11:35 PM · Patch-For-Review, MW-on-K8s, serviceops
RLazarus created T376230: Support bringing text files into the container for one-off maintenance scripts.
Oct 1 2024, 11:30 PM · MW-on-K8s, serviceops
RLazarus created T376099: --timeout flag for mwscript-k8s.
Oct 1 2024, 3:14 AM · Patch-For-Review, MW-on-K8s, serviceops

Sep 27 2024

RLazarus created T375910: Functional replacement for `sql` on Kubernetes.
Sep 27 2024, 5:22 PM · serviceops, MW-on-K8s

Sep 26 2024

RLazarus claimed T369142: Show more useful information when mwscript-k8s fails to launch.
Sep 26 2024, 3:10 AM · MW-on-K8s, serviceops
RLazarus closed T369143: Allow cleaning up specific mwscript-k8s runs as Declined.

Thanks! In general you shouldn't need to do this, even if the job was a mistake. Kubernetes cleans up the job automatically a week after it terminates -- whether it completed or failed -- and there's nothing wrong with leaving it until then.

Sep 26 2024, 2:37 AM · MW-on-K8s, serviceops
RLazarus closed T369143: Allow cleaning up specific mwscript-k8s runs, a subtask of T341553: Allow running one-off scripts manually, as Declined.
Sep 26 2024, 2:36 AM · Patch-For-Review, MW-on-K8s, serviceops
RLazarus closed T359127: MW image version for maintenance scripts as Resolved.

The image version is now copied from the mw-web deployment.

Sep 26 2024, 2:14 AM · MW-on-K8s, serviceops
RLazarus updated the task description for T359127: MW image version for maintenance scripts.
Sep 26 2024, 2:13 AM · MW-on-K8s, serviceops
RLazarus closed T359127: MW image version for maintenance scripts, a subtask of T341553: Allow running one-off scripts manually, as Resolved.
Sep 26 2024, 2:13 AM · Patch-For-Review, MW-on-K8s, serviceops

Sep 9 2024

RLazarus updated the task description for T359130: Update DC switchover cookbooks to handle maintenance scripts on k8s.
Sep 9 2024, 5:08 PM · Datacenter-Switchover, serviceops, MW-on-K8s

Sep 5 2024

RLazarus added a member for acl*sre-team: jasmine_.
Sep 5 2024, 9:16 PM

Sep 4 2024

RLazarus updated the task description for T359127: MW image version for maintenance scripts.
Sep 4 2024, 7:49 PM · MW-on-K8s, serviceops

Aug 29 2024

RLazarus closed T369676: `mwscript-k8s --attach` seems to terminate IO after a few seconds without input, a subtask of T341553: Allow running one-off scripts manually, as Resolved.
Aug 29 2024, 2:22 AM · Patch-For-Review, MW-on-K8s, serviceops
RLazarus closed T369676: `mwscript-k8s --attach` seems to terminate IO after a few seconds without input as Resolved.

So, mwscript-k8s is still in medium-early development, some snags still expected and it shouldn't be anyone's primary workflow yet. There'll be wider announcements when it's ready for adoption (Soon TM but Not Yet TM); in the meantime, thanks for giving it a try and feedback is still welcome, just don't panic when it isn't ready to use full-time.

Aug 29 2024, 2:22 AM · MW-on-K8s

Aug 22 2024

RLazarus added a comment to T372961: db1206 depooled, high replication lag.

Thanks both! Sorry for missing the earlier task.

Aug 22 2024, 5:06 PM · DBA

Aug 20 2024

RLazarus renamed T372961: db1206 depooled, high replication lag from db1206 repooled, high replication lag to db1206 depooled, high replication lag.
Aug 20 2024, 11:34 PM · DBA
RLazarus triaged T372961: db1206 depooled, high replication lag as High priority.
Aug 20 2024, 10:54 PM · DBA
RLazarus created T372961: db1206 depooled, high replication lag.
Aug 20 2024, 10:54 PM · DBA

Aug 16 2024

RLazarus added a comment to T367118: Control mw-on-k8s periodic maintenance jobs with an etcd value.

See also T359130 for the cookbook work. We aren't as far as I expected we'd be, so we can revisit which of those steps for cronjobs-on-k8s need to be accomplished before the switchover, but I agree it's a good idea to add this to mw-cli-wrapper.py in the meantime and start using it.

Aug 16 2024, 5:22 PM · Datacenter-Switchover, serviceops, MW-on-K8s

Aug 15 2024

RLazarus added a member for WMF-NDA: jasmine_.
Aug 15 2024, 12:29 AM

Aug 14 2024

RLazarus assigned T372521: PHP Notice: Undefined index: min_avail_workers in /srv/monitoring/lib.php on line 334 to hnowlan.

This looks related to https://gerrit.wikimedia.org/r/c/operations/docker-images/production-images/+/1060867 -- at a guess, should

Aug 14 2024, 10:56 PM · serviceops, Wikimedia-production-error
RLazarus updated subscribers of T372521: PHP Notice: Undefined index: min_avail_workers in /srv/monitoring/lib.php on line 334.
Aug 14 2024, 10:55 PM · serviceops, Wikimedia-production-error

Jul 30 2024

RLazarus added a comment to T371427: Transient httpbb errors from on mwdebug boxes.

Yeah, the usual standard, for any time you change something on a debug host, is a note in #wikimedia-operations to say something like "I'm grabbing mwdebug1002 to test an Apache config change" -- that ensures no one else is testing a conflicting change at the same time, such as (for example) a deployment during a deployment window. :)

Jul 30 2024, 9:49 PM · serviceops-radar, Release-Engineering-Team (Seen), Scap
RLazarus updated subscribers of T371427: Transient httpbb errors from on mwdebug boxes.

From -serviceops IRC logs, I think this was during the time @Ottomata was working on an unrelated Apache config change, and testing it on a mwdebug host. That would explain why the tests failed on one host only.

Jul 30 2024, 7:59 PM · serviceops-radar, Release-Engineering-Team (Seen), Scap

Jul 26 2024

RLazarus added a comment to T371069: Add helm rollback functionality to scap.

I haven't dug for logs, but workers were saturated and latency was up: https://grafana.wikimedia.org/goto/h67ix_uSg

Jul 26 2024, 3:25 PM · Release-Engineering-Team (Priority Backlog 📥), MW-on-K8s, Scap
RLazarus updated the task description for T371069: Add helm rollback functionality to scap.
Jul 26 2024, 12:51 AM · Release-Engineering-Team (Priority Backlog 📥), MW-on-K8s, Scap
RLazarus created T371069: Add helm rollback functionality to scap.
Jul 26 2024, 12:49 AM · Release-Engineering-Team (Priority Backlog 📥), MW-on-K8s, Scap

Jul 22 2024

RLazarus added a comment to T370118: Register Citoid as a "friendly bot" (or alternatively verified bot) with Cloudflare.

Someone in serviceops probably knows the answer to this but I don't, at least not confidently. Here's a rough stab:

Jul 22 2024, 8:20 PM · serviceops, Goal, VisualEditor-MediaWiki-References, Editing-team (Kanban Board), VisualEditor, Citoid

Jul 17 2024

Jelto awarded T317794: requestctl can't act on cache hits a Like token.
Jul 17 2024, 2:15 PM · SRE-Sprint-Week-Sustainability-March2023, Patch-For-Review, Traffic, Sustainability (Incident Followup), conftool

Jul 15 2024

kamila awarded T317794: requestctl can't act on cache hits a Love token.
Jul 15 2024, 12:25 PM · SRE-Sprint-Week-Sustainability-March2023, Patch-For-Review, Traffic, Sustainability (Incident Followup), conftool

Jul 11 2024

RLazarus triaged T369855: db1179 crashed - hardware issues as High priority.
Jul 11 2024, 5:54 PM · SRE, DC-Ops, ops-eqiad, DBA

Jul 9 2024

RLazarus added a comment to T341560: Migrate mwmaint server functionality to mw-on-k8s.

Script output is visible through kubectl logs, and mwscript-k8s can be invoked with -f to immediately start tailing the script output (under the hood, it just invokes that kubectl command). If you don't launch with -f it prints out the kubectl command so you can copy and paste it.

Jul 9 2024, 9:11 PM · serviceops, MW-on-K8s

Jul 4 2024

RLazarus closed T369175: mwscript-k8s --attach error: TypeError: 'NoneType' object is not iterable as Resolved.

This is fixed, thanks again for testing!

Jul 4 2024, 12:11 AM · Patch-For-Review, MW-on-K8s, serviceops
RLazarus closed T369175: mwscript-k8s --attach error: TypeError: 'NoneType' object is not iterable, a subtask of T341553: Allow running one-off scripts manually, as Resolved.
Jul 4 2024, 12:09 AM · Patch-For-Review, MW-on-K8s, serviceops

Jul 3 2024

RLazarus claimed T369175: mwscript-k8s --attach error: TypeError: 'NoneType' object is not iterable.

Thanks for the report! It's actually not because of the successful exit; the script handles that.

Jul 3 2024, 10:46 PM · Patch-For-Review, MW-on-K8s, serviceops

Jul 2 2024

RLazarus closed T368966: Pipe stdin into one-off maintenance scripts on Kubernetes as Resolved.
rzl@deploy1002:~$ echo 'https://office.wikimedia.org/wiki/User:RLazarus_(WMF)' | ./mwscript-k8s --attach -- purgeList.php
⏳ Starting purgeList.php on Kubernetes...
[snip]
⏳ Waiting for the container to start...
🚀 Job is running.
📜 Attaching to stdin/stdout:
Purging 1 urls
Done!
Jul 2 2024, 10:30 PM · MW-on-K8s, serviceops
RLazarus closed T368966: Pipe stdin into one-off maintenance scripts on Kubernetes, a subtask of T341553: Allow running one-off scripts manually, as Resolved.
Jul 2 2024, 10:29 PM · Patch-For-Review, MW-on-K8s, serviceops
RLazarus added a comment to T369080: statsd-exporter in k8s is not configured to use its mapping configuration.

Disregard the above scap, I got too carried away with "never run helmfile across all mw deployments, use scap instead" but obviously that rule doesn't apply here. :)

Jul 2 2024, 10:08 PM · SRE, Observability-Metrics

Jul 1 2024

RLazarus updated the task description for T368966: Pipe stdin into one-off maintenance scripts on Kubernetes.
Jul 1 2024, 6:44 PM · MW-on-K8s, serviceops
RLazarus created T368966: Pipe stdin into one-off maintenance scripts on Kubernetes.
Jul 1 2024, 6:27 PM · MW-on-K8s, serviceops
RLazarus closed T348284: Handle sidecar containers in one-off Kubernetes jobs, a subtask of T292861: Find a better solution than `concurrencyPolicy: Replace` for sidecars in CronJob, as Resolved.
Jul 1 2024, 5:41 PM · User-bd808, Toolhub
RLazarus closed T348284: Handle sidecar containers in one-off Kubernetes jobs, a subtask of T341553: Allow running one-off scripts manually, as Resolved.
Jul 1 2024, 5:41 PM · Patch-For-Review, MW-on-K8s, serviceops
RLazarus closed T348284: Handle sidecar containers in one-off Kubernetes jobs as Resolved.
Jul 1 2024, 5:41 PM · MW-on-K8s, serviceops

Jun 10 2024

RLazarus created T367118: Control mw-on-k8s periodic maintenance jobs with an etcd value.
Jun 10 2024, 8:24 PM · Datacenter-Switchover, serviceops, MW-on-K8s

May 1 2024

RLazarus awarded T358636: etcdmirror does not recover from a cleared waitIndex a Barnstar token.
May 1 2024, 4:12 PM · serviceops

Apr 24 2024

RLazarus added a comment to T348284: Handle sidecar containers in one-off Kubernetes jobs.

Thanks. At present the controller monitors all namespaces, but ignores pods other than in mw-script. So if I were estimating memory usage I'd base it on the total number of pod events in the cluster, not just in the namespace.

Apr 24 2024, 5:42 PM · MW-on-K8s, serviceops

Apr 17 2024

RLazarus added a comment to T362717: scap should optionally display helmfile diffs for review.

That sounds reasonable! Note for the future that helm diff has a --suppress-output-line-regex which does exactly what you'd like it to do, but it's not available in the version we're currently running.

Apr 17 2024, 10:42 PM · serviceops, Release-Engineering-Team, Scap
RLazarus closed T57857: Unit tests for apache config/rewrites as Resolved.
Apr 17 2024, 7:54 PM · Wikimedia-Apache-configuration

Apr 16 2024

RLazarus added a comment to T362717: scap should optionally display helmfile diffs for review.

If we're really worried about that race condition, is it plausible to do this?

Apr 16 2024, 9:56 PM · serviceops, Release-Engineering-Team, Scap

Apr 4 2024

RLazarus triaged T361860: Old "Email this user" email is repeatedly resent as High priority.

Clinic duty SRE here -- I/F, can you start investigating this at the MTA end? Triaging this to High in case it's widespread, but feel free to decrease if it turns out it's not.

Apr 4 2024, 6:19 PM · Mail, Infrastructure-Foundations, MediaWiki-Email, SRE
RLazarus closed T361798: Grant Access to <LDAP/wmf> for <ospingou> as Resolved.
rzl@mwmaint1002:~$ ldapsearch -x cn=wmf | grep ospingou
member: uid=ospingou,ou=people,dc=wikimedia,dc=org
Apr 4 2024, 5:27 PM · Patch-For-Review, SRE, LDAP-Access-Requests
RLazarus added a member for WMF-NDA: Ospingou.
Apr 4 2024, 5:19 PM
RLazarus claimed T361742: Requesting access to shell access to analytics client servers for AndyRussG.
Apr 4 2024, 1:12 AM · Patch-For-Review, SRE, SRE-Access-Requests
RLazarus updated the task description for T361742: Requesting access to shell access to analytics client servers for AndyRussG.
Apr 4 2024, 1:12 AM · Patch-For-Review, SRE, SRE-Access-Requests

Apr 3 2024

RLazarus claimed T361665: Grant Access to wmf for AndyRussG.
Apr 3 2024, 6:15 PM · Patch-For-Review, SRE, LDAP-Access-Requests
RLazarus updated subscribers of T361665: Grant Access to wmf for AndyRussG.

@AndyRussG Welcome back!

Apr 3 2024, 6:15 PM · Patch-For-Review, SRE, LDAP-Access-Requests