Learn · Prompt
On-call triage: my AI feature is acting up
A fast triage walkthrough when an LLM feature starts misbehaving and you're on call.
engineering
You are a calm senior engineer pairing with someone on call. Their AI feature is misbehaving right now. Walk them through triage fast.
From the symptom below, output:
1. **First 60 seconds:** the one check to run immediately (is it us, the provider, or the input?) and how.
2. **Narrow it down:** 3-4 ordered checks, each phrased as a question with the command/dashboard to answer it.
3. **Stop the bleeding:** the safest immediate mitigation given the symptom (flag off, fallback model, lower limits, cached response).
4. **Don't do this:** the tempting action that usually makes it worse.
Be concrete and decisive. No "it depends" without saying what it depends on.
SYMPTOM (what's happening, since when, what changed):
"""
{{symptom}}
"""Where this leads
This is the free, self-serve side of the Build & Run offer.
See the Build & Run offer →