Back

Site Reliability

7 deep dives

Sre advanced

Mastering Self-Healing Systems in Distributed Architectures

In today's complex distributed systems, failures aren't just possible—they're inevitable. Self-healing systems represent...

advanced cloud

Read more

Sre beginner

Slack’s Outage, SRE’s Secret Sauce, and the Journey to Reliability

It was May 12, 2020—the day Slack faced its first major outage in years. A rollout of a database configuration change sp...

sre reliability

Read more

Sre beginner

Toil, Triumph, and the 80% MTTR Turnaround: A Lowe's SRE Journey

In the middle of a raging ecommerce surge, Lowe’s faced a looming reliability crisis. Google’s SRE playbook helped them ...

sre reliability

Read more

Sre intermediate

When Bursts Hit the Pipeline: How to Reign in Backlog Without Sacrificing Delivery

Picture this: Airbnb’s Mussel store slams into a traffic spike, reads and writes explode, and a backlog starts piling up...

sre

Read more

Sre intermediate

From Wayfair to Your Stack: A Real-World Journey Through Per-Region Traffic Shaping

Many developers discover that global systems aren’t just about capacity; they’re about isolation. When one region stumbl...

sre

Read more

Sre intermediate

The 500ms Crisis: When Your Error Budget Runs Out and Your CEO Wants a New Feature

It was 3am when the pager went off. Your SLO for API response time is 99.9% with a 500ms threshold, but you're sitting a...

slo sli error-budget

Read more

Sre beginner

When Your API Is Up But Unusable: The 3AM Pager Story Every Developer Fears

It was 3 AM when the pager went off. Your API dashboard showed green lights everywhere, but customers were screaming abo...

prometheus grafana opentelemetry

Read more

function openSearch() { document.getElementById('searchModal').classList.add('open'); document.getElementById('searchInput').focus(); document.body.style.overflow = 'hidden'; } function closeSearch() { document.getElementById('searchModal').classList.remove('open'); document.body.style.overflow = ''; document.getElementById('searchInput').value = ''; document.getElementById('searchResults').innerHTML = '

Start typing to search articles…

'; } document.addEventListener('keydown', e => { if ((e.metaKey || e.ctrlKey) && e.key === 'k') { e.preventDefault(); openSearch(); } if (e.key === 'Escape') closeSearch(); }); document.getElementById('searchInput')?.addEventListener('input', e => { const q = e.target.value.toLowerCase().trim(); const results = document.getElementById('searchResults'); if (!q) { results.innerHTML = '

Start typing to search articles…

'; return; } const matches = searchData.filter(a => a.title.toLowerCase().includes(q) || (a.intro||'').toLowerCase().includes(q) || a.channel.toLowerCase().includes(q) || (a.tags||[]).some(t => t.toLowerCase().includes(q)) ).slice(0, 8); if (!matches.length) { results.innerHTML = '

No articles found

'; return; } results.innerHTML = matches.map(a => `

${a.channel.replace(/-/g,' ')}${a.difficulty}

`).join(''); }); function toggleTheme() { const html = document.documentElement; const next = html.getAttribute('data-theme') === 'dark' ? 'light' : 'dark'; html.setAttribute('data-theme', next); localStorage.setItem('theme', next); } // Reading progress window.addEventListener('scroll', () => { const bar = document.getElementById('reading-progress'); const btt = document.getElementById('back-to-top'); if (bar) { const doc = document.documentElement; const pct = (doc.scrollTop / (doc.scrollHeight - doc.clientHeight)) * 100; bar.style.width = Math.min(pct, 100) + '%'; } if (btt) btt.classList.toggle('visible', window.scrollY > 400); }); // TOC active state const tocLinks = document.querySelectorAll('.toc-list a'); if (tocLinks.length) { const observer = new IntersectionObserver(entries => { entries.forEach(e => { if (e.isIntersecting) { tocLinks.forEach(l => l.classList.remove('active')); const active = document.querySelector('.toc-list a[href="#' + e.target.id + '"]'); if (active) active.classList.add('active'); } }); }, { rootMargin: '-20% 0px -70% 0px' }); document.querySelectorAll('.article-content h2[id]').forEach(h => observer.observe(h)); } function filterArticles(difficulty, btn) { document.querySelectorAll('.diff-filter').forEach(b => b.classList.remove('active')); if (btn) btn.classList.add('active'); document.querySelectorAll('.article-card').forEach(card => { card.style.display = (difficulty === 'all' || card.dataset.difficulty === difficulty) ? '' : 'none'; }); } function copySnippet(btn) { const snippet = document.getElementById('shareSnippet')?.innerText; if (!snippet) return; navigator.clipboard.writeText(snippet).then(() => { btn.innerHTML = ''; if (typeof lucide !== 'undefined') lucide.createIcons(); setTimeout(() => { btn.innerHTML = ''; if (typeof lucide !== 'undefined') lucide.createIcons(); }, 2000); }); } if (typeof lucide !== 'undefined') lucide.createIcons();