User Details
User Details
- User Since
- Nov 9 2015, 9:18 PM (474 w, 1 d)
- Availability
- Available
- IRC Nick
- gehel
- LDAP User
- Gehel
- MediaWiki User
- GLederrey (WMF) [ Global Accounts ]
Yesterday
Yesterday
Gehel added a comment to T381707: Low available space on Hadoop / HDFS.
The current rate of capacity consumption seems to be around 10% / month since October 1. If this stays stable, we'll be below 10% of capacity before we are fully back from the end of year holiday. This seems to close to the limit to be comfortable.
Gehel placed T362922: Audit/consider enabling CPU performance governor on DPE SRE-owned hosts up for grabs.
Gehel added a comment to T381707: Low available space on Hadoop / HDFS.
https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/941 has been merged and should also reduce storage consumption (see T379024)
Gehel added a comment to T46581: Partial Wikidata dumps.
Would a split by "instance of" (P31) be useful?
Mon, Dec 9
Mon, Dec 9
Gehel archived Access requests.
Gehel edited projects for T258962: Investigate easier methods for WMF staff to access Superset, added: Data-Platform-SRE; removed Data Platform Access request and user management.
Gehel edited projects for T258962: Investigate easier methods for WMF staff to access Superset, added: Data Platform Access request and user management; removed Data-Platform-SRE.
Gehel created Access requests.
Gehel moved T381649: Add FactGrid to WDQS allowlist from Backlog - project to Backlog - operations on the Data-Platform-SRE (2024.11.30 - 2024.12.20) board.
Fri, Dec 6
Fri, Dec 6
Gehel updated the task description for T381707: Low available space on Hadoop / HDFS.
Gehel updated the task description for T381707: Low available space on Hadoop / HDFS.
Gehel updated the task description for T381707: Low available space on Hadoop / HDFS.
Gehel closed T377153: Migrate Glent to Gitlab for publication of artifacts, a subtask of T367405: Migrate existing Java packages to deploying to Gitlab, including new version of parent pom, validation that all dependencies are available, and validation that deployment to production still works, as Resolved.
Gehel closed T377150: Config: enable CirrusSearchEnableEventBusWeightedTags , a subtask of T372904: Use page_weighted_tags_changed stream, as Resolved.
Gehel moved T380622: Migrate the airflow-wmde scheduler to Kubernetes from Done to Reported on the Data-Platform-SRE (2024.11.30 - 2024.12.20) board.
Gehel moved T380613: Migrate the airflow-wmde database to Kubernetes from Done to Reported on the Data-Platform-SRE (2024.11.30 - 2024.12.20) board.
Gehel moved T381163: Export the postgresql-airflow-wmde backups to Bacula from Done to Reported on the Data-Platform-SRE (2024.11.30 - 2024.12.20) board.
Gehel moved T380729: 2024-11-20 dump run appears stuck from Done to Reported on the Data-Platform-SRE (2024.11.30 - 2024.12.20) board.
Gehel moved T381407: Decommission an-presto100[1-5] from Done to Reported on the Data-Platform-SRE (2024.11.30 - 2024.12.20) board.
Gehel moved T380765: Integrate Kerberos login with RestExternalTaskSensor from Done to Reported on the Data-Platform-SRE (2024.11.30 - 2024.12.20) board.
Gehel moved T375595: Check home/HDFS leftovers of dumisani from Done to Reported on the Data-Platform-SRE (2024.11.30 - 2024.12.20) board.
Wed, Dec 4
Wed, Dec 4
Gehel added a comment to T377875: Migrate dse-k8s cluster from docker to containerd.
Let's not touch this during December, it seems too risky. But we can already come up with a plan and probably a number of steps as subtasks.
Tue, Dec 3
Tue, Dec 3
Gehel moved T380729: 2024-11-20 dump run appears stuck from Backlog - project to Backlog - operations on the Data-Platform-SRE (2024.11.30 - 2024.12.20) board.
Gehel added a project to T381389: Add QoS markings to profile Hadoop/HDFS analytics traffic: Data-Platform-SRE.
Mon, Dec 2
Mon, Dec 2
Gehel edited projects for T376427: Update cirrus for mwscript-on-k8s, added: Discovery-Search; removed Discovery-Search (Current work).
Moving back to our backlog. Some specific cases are addressed in T378382, the rest will be revisited once we have made more progress.
Gehel moved T381283: wdqs1025 fails to PXE boot, NIC shows "no link" in DRAC web UI from needs triage to Ops / SRE on the Discovery-Search board.
Gehel edited projects for T381195: Inconsistent Hydra Links Cause Interruption in Data Fetching from Wikidata LDF Endpoint, added: Wikidata-Query-Service; removed Discovery-Search.
Gehel triaged T381091: When using sort=create_timestamp_desc or _asc, dates should be page creation dates as Medium priority.
Gehel moved T380553: Search page uses Codex markup without appropriate style pack from needs triage to UI tickets on the Discovery-Search board.
Gehel moved T380984: whitelist kg.kunsten.be from Incoming to Operations/SRE on the Wikidata-Query-Service board.
Fri, Nov 29
Fri, Nov 29
Gehel edited projects for T378030: Q2:rack/setup/install wdqs102[567], added: Data-Platform-SRE (2024.11.30 - 2024.12.20); removed Data-Platform-SRE (2024.11.09 - 2024.11.29).
Gehel edited projects for T364363: [Epic] Productionize federated wdqs graph-split endpoints, added: Data-Platform-SRE (2024.11.30 - 2024.12.20); removed Data-Platform-SRE (2024.11.09 - 2024.11.29).
Gehel edited projects for T367409: Measure the time to recovery of WDQS after the graph split, added: Data-Platform-SRE (2024.11.30 - 2024.12.20); removed Data-Platform-SRE (2024.11.09 - 2024.11.29).
Gehel edited projects for T374021: Make WikibaseQualityConstraints use split-graph query service, added: Data-Platform-SRE (2024.11.30 - 2024.12.20); removed Data-Platform-SRE (2024.11.09 - 2024.11.29).
Gehel edited projects for T376151: Cutover wdqs-internal to new split endpoints, added: Data-Platform-SRE (2024.11.30 - 2024.12.20); removed Data-Platform-SRE (2024.11.09 - 2024.11.29).
Gehel edited projects for T380555: Enable LVS for wdqs-internal-[main,scholarly], added: Data-Platform-SRE (2024.11.30 - 2024.12.20); removed Data-Platform-SRE (2024.11.09 - 2024.11.29).
Gehel edited projects for T380608: Address categories migration for internal graph split endpoints, added: Data-Platform-SRE (2024.11.30 - 2024.12.20); removed Data-Platform-SRE (2024.11.09 - 2024.11.29).
Gehel edited projects for T193473: Add HTTPS support to wdqs-internal service, added: Data-Platform-SRE (2024.11.30 - 2024.12.20); removed Data-Platform-SRE (2024.11.09 - 2024.11.29).
Gehel edited projects for T380556: Provision VIPs for wdqs-internal-[main,scholarly], added: Data-Platform-SRE (2024.11.30 - 2024.12.20); removed Data-Platform-SRE (2024.11.09 - 2024.11.29).
Gehel edited projects for T380752: Migrate Relforge to Opensearch, added: Data-Platform-SRE (2024.11.30 - 2024.12.20); removed Data-Platform-SRE (2024.11.09 - 2024.11.29).
Gehel edited projects for T349481: Create SLI/SLO for search index inconsistencies, added: Data-Platform-SRE (2024.11.30 - 2024.12.20); removed Data-Platform-SRE (2024.11.09 - 2024.11.29).
Gehel edited projects for T374922: Bring an-conf100[4-6] into service to replace an-conf100[1-3], added: Data-Platform-SRE (2024.11.30 - 2024.12.20); removed Data-Platform-SRE (2024.11.09 - 2024.11.29).
Gehel edited projects for T377134: Create and distribute a flink base image with flink 1.20.0, added: Data-Platform-SRE (2024.11.30 - 2024.12.20); removed Data-Platform-SRE (2024.11.09 - 2024.11.29).
Gehel edited projects for T374916: Port Categories lag / ping checks to Prometheus/Alertmanager, added: Data-Platform-SRE (2024.11.30 - 2024.12.20); removed Data-Platform-SRE (2024.11.09 - 2024.11.29).
Gehel edited projects for T371080: Port disk space check for hadoop worker to Alertmanager, added: Data-Platform-SRE (2024.11.30 - 2024.12.20); removed Data-Platform-SRE (2024.11.09 - 2024.11.29).
Gehel edited projects for T380258: Create an Airflow instance for ML, added: Data-Platform-SRE (2024.11.30 - 2024.12.20); removed Data-Platform-SRE (2024.11.09 - 2024.11.29).
Gehel edited projects for T377266: DSE kubernetes namespace for llm-inference, added: Data-Platform-SRE (2024.11.30 - 2024.12.20); removed Data-Platform-SRE (2024.11.09 - 2024.11.29).
Gehel edited projects for T375252: Document which stat server to use for which workflow, added: Data-Platform-SRE (2024.11.30 - 2024.12.20); removed Data-Platform-SRE (2024.11.09 - 2024.11.29).