Browse that include the production tag
How we improved on-call life by reducing pager noise
Too many pages? Here's how we tackled on-call SRE quality of life by grouping alerts by service and only paging on downstream services.
How we share SLIs across engineering departments
The Scalability team engages with the Development department for collaborating on SLIs. The first post in this series explains how we made available information accessible for development groups.
Ruby 2.7: Understand and debug problems with heap compaction
An overview of Ruby 2.7 heap compaction and the risks it adds to production Rails applications.
This SRE attempted to roll out an HAProxy config change. You won't believe what happened next...
This post is about a wild discovery made while investigating strange behavior from HAProxy. We dive into the pathology, describe how we found it, and share some investigative techniques used along the way.
Automation check-in and rate limit changes on GitLab.com
GitLab is making some changes to our rate limits on GitLab.com starting in January 2021.
How to make Docker Hub rate limit monitoring a breeze
Docker Hub Rate Limits are enforced and we need to find ways to monitor the remaining pull requests. Explore some ways to create a monitoring plugin for Nagios/Icinga/Sensu/Zabbix and test-drive a new Prometheus exporter in combination with Grafana.
New to GitLab and not sure where to start?
Get started guideLearn about what GitLab can do for your team
Talk to an expert