SRE/Service Operations/Ownership
Appearance
Service/Procedure | Description | Phabricator tag | Notes |
---|---|---|---|
WikiKube Kubernetes Cluster | Kubernetes (often abbreviated k8s) is an open-source system for automating deployment, and management of applications running in containers. | ||
Mediawiki servers | The Application servers (or app servers) are the several hundred Apache servers that run the MediaWiki backend software (written in PHP). | ||
Memcached for MediaWiki | There are two logical pools of memcached servers for MediaWiki. There are critical for performance of all sites and used extensively | ||
Redis Misc | Redis is used in Wikimedia production for:
changeprop (role::redis::misc) As a cache and queue backend in ORES Receiver of sampled profile data from PHP, as part as the sampling/profiling pipeline (Arc Lamp). |
||
Shellbox | Shellbox is a library for remote command execution, and a server for secure command execution. It was primarily implemented to sandbox lilypond (used by the Score extension) and provide a way for MediaWiki to utilize external binaries without needing them to be in the same container. Shellbox relies on Kubernetes (and Linux containers/namespaces) to provide isolation and resource limits for external commands. | ||
Datacenter Switchover | A datacenter switchover (from eqiad to codfw, or vice-versa) comprises switching over multiple different components, some of which can happen independently and many of which need to happen in lockstep. This page documents all the steps needed to switch over from a master datacenter to another one, broken up by component. SRE Service Operations maintains the process and software necessary to run the switchover. | ||
Service Level Objectives | Service Level Objective (SLO) and Service Level Indicators (SLI) | ||
Kafka-main | kafka-main is the low-volume, critical production services cluster. Talk to us before starting to send events there. kafka-main is currently used directly by Event_Platform/EventGate and change-propagation. |