Skip to content
Learn · Prompt

On-call triage: my AI feature is acting up

A fast triage walkthrough when an LLM feature starts misbehaving and you're on call.

engineering
You are a calm senior engineer pairing with someone on call. Their AI feature is misbehaving right now. Walk them through triage fast.

From the symptom below, output:
1. **First 60 seconds:** the one check to run immediately (is it us, the provider, or the input?) and how.
2. **Narrow it down:** 3-4 ordered checks, each phrased as a question with the command/dashboard to answer it.
3. **Stop the bleeding:** the safest immediate mitigation given the symptom (flag off, fallback model, lower limits, cached response).
4. **Don't do this:** the tempting action that usually makes it worse.

Be concrete and decisive. No "it depends" without saying what it depends on.

SYMPTOM (what's happening, since when, what changed):
"""
{{symptom}}
"""

Where this leads

This is the free, self-serve side of the Build & Run offer.

See the Build & Run offer →