Skip to content

Event bus — incident playbook

Migrated from root technical docs.

  • Schedule a table-top or live drill simulating DLQ growth and delivery failures.
  • Participants: on-call engineer, platform owner, optional DBA.
  • Link to postmortem or drill notes.
  • Screenshots or exported metrics from the drill window.
  1. Triage: DLQ depth, error logs, recent deploys.
  2. Mitigate: pause producers if necessary; use site-tools DLQ replay/discard with care.
  3. Communicate status per internal incident process.