Skip to content
Chapter 07 · 11 min

Leading an AI initiative

You understand the value, the build-vs-buy call, the costs, the risks, and the governance. This final chapter is how to actually lead an AI initiative to results — the team, the sequencing, the metrics, and the questions that separate a leader who gets value from one who funds expensive theatre.

Don't bet the company on one moonshot. Plant a row of seeds, water the ones that sprout.

Start small, prove it, then scale

The pattern that works is unglamorous: pick one high-value, feasible task; ship a real, measured solution to production; learn what it actually took; then use that capability and credibility to take on the next one. The pattern that fails is the company-wide AI transformation announced before anyone has shipped a single working feature.

Small first isn't timidity — it's how you build the muscles you'll need for the big bets: the eval discipline, the data plumbing, the realistic cost intuition, the trust of skeptics. A team that has shipped one AI feature to production knows things no amount of strategy decks can teach, and they earn the right to attempt the hard problems.

Measure the outcome, not the activity

The trap is measuring AI activity — "we ran 40 experiments," "adoption is up," "we use AI in five departments" — none of which is value. Tie every initiative to a business outcome you'd care about even if AI didn't exist: faster resolution, lower cost per task, higher conversion, fewer errors. If you can't name the outcome metric, you don't have a project, you have a science fair.

The team you need

You need less specialised AI talent than the hype implies. The load-bearing skills for most AI products are strong software engineering, good data work, and product judgment — not deep machine-learning research. A capable engineering team can build excellent AI products on top of existing models; you rarely need to hire a research scientist to use an API well.

What you do need is someone who genuinely understands the failure modes — who treats evals as non-negotiable and the model as a flaky component to be engineered around, not a magic box. That mindset matters more than a particular credential. And for capabilities you decide to buy or partner on, you need the judgment to evaluate vendors well, which is the rest of this section.

Questions to ask your engineers

You don't need to read the code, but a few questions reliably separate a project on solid ground from one on sand. Asking them signals you understand what matters, and the quality of the answers tells you a lot:

  • "How do we know it works — what's our eval set, and what does it score?" (No eval set is a red flag.)
  • "What happens when the model is wrong, slow, or down?" (There should be a real answer.)
  • "What's the fully-loaded cost, including data, evals, and ongoing operations?" (Not just the API bill.)
  • "Where does our data go, and what's our exposure?" (Especially for customer data.)
  • "What can a malicious user make this do?" (The security question.)
  • "What's the plan if our model provider changes prices or deprecates the model?" (The lock-in question.)

Where to go from here

You now have the decision-maker's complete picture: cut through the hype, find the value, decide build-vs-buy, budget for reality, manage the risk, govern it, and lead it to measured results. If you want to go deeper on any layer, the fundamentals course explains how the technology works, Building with AI covers the engineering, and AI Security covers the threats. And when you want a partner who builds exactly this way — small, measured, secure, and honest about cost — that's what SDEN does.

In one line each

  • Start small: ship one high-value, feasible task to production, learn what it took, then scale on real capability and credibility.
  • Measure business outcomes, not AI activity — if you can't name the outcome metric, it's a science fair, not a project.
  • You need strong engineering, data, and product judgment more than ML research talent — plus someone who respects the failure modes.
  • Lead by asking the right questions: evals, failure handling, fully-loaded cost, data exposure, security, and provider lock-in.