Event bus — incident playbook
Section titled “Event bus — incident playbook”Drill Plan (Pre-Launch)
Section titled “Drill Plan (Pre-Launch)”- Schedule a table-top or live drill simulating DLQ growth and delivery failures.
- Participants: on-call engineer, platform owner, optional DBA.
Drill Evidence
Section titled “Drill Evidence”- Link to postmortem or drill notes.
- Screenshots or exported metrics from the drill window.
Response (live incident)
Section titled “Response (live incident)”- Triage: DLQ depth, error logs, recent deploys.
- Mitigate: pause producers if necessary; use site-tools DLQ replay/discard with care.
- Communicate status per internal incident process.