The graveyard of internal AI tools is large and growing. A developer builds a Slack bot that summarizes meeting notes. It works. For two weeks, everyone uses it. Then it starts summarizing incorrectly on longer meetings, nobody fixes it, and the team quietly goes back to reading the notes themselves.

This pattern repeats because building an internal AI tool and building one people actually adopt are different problems. The technical part is usually the smaller one.

Start With a Problem That Has a Named Victim

The internal tools with the highest adoption rates solve a problem that a specific person complains about out loud. Not "we could use AI to improve X" but "Maria spends four hours every Friday compiling the weekly status report and hates it." Maria is the named victim. Build for Maria.

Unnamed victims produce unused tools. "The team could save time with AI-assisted code review" fails because nobody owns the problem. When the tool has rough edges, nobody advocates for fixing them. It quietly dies.

The Adoption Trap: Building What You Think People Want

The fastest way to build an internal tool nobody uses is to design it in isolation and launch it as a completed product. The team didn't ask for it, the workflow it assumes doesn't match how people actually work, and by the time feedback comes in, the builder has already moved on to the next project.

Build the first version in a week. It should be rough. Give it to the named victim before it's polished. Watch them use it. The things they do that you didn't expect are more valuable than any feature you planned.

The Evaluation Problem Nobody Plans For

Internal AI tools fail silently. A summarization tool that produces plausible-sounding but inaccurate summaries is worse than no summarization tool — the team makes decisions based on wrong information and doesn't realize it. An AI-assisted code review that misses real bugs while flagging style issues trains engineers to ignore the alerts.

Every internal AI tool needs an evaluation layer before it touches real work. This doesn't have to be complex. For a summarization tool, take 20 real documents, have a human produce the correct summaries, and measure accuracy against them. For a code review tool, collect false positive and false negative rates from the first month of production use. Without this, you're flying blind.

Maintenance Is Not Optional

LLM behavior changes when models update. The prompt that worked on model version N may produce different outputs on model version N+1. Without a testing suite and an owner who monitors the tool, model updates break internal tools silently.

Name an owner before launch. Not just the person who built it — the person who is responsible for it working correctly in three months. If you can't name that person, the tool will be unmaintained within a quarter.

The Rollout That Works

The internal tools with the best adoption follow the same rollout pattern: one user for two weeks, five users for two weeks, whole team. Each phase surfaces a different class of problem. The one-user phase surfaces fundamental workflow mismatches. The five-user phase surfaces edge cases. The full team launch surfaces scale and reliability issues.

Skipping to full team launch because the demo looked good is the most common mistake. The demo was built against controlled inputs. Production users have surprising inputs.

What Success Looks Like

A successful internal AI tool is used daily, without reminders, by the people it was built for. The usage is visible in logs or analytics. When the tool has a problem, someone reports it and it gets fixed. If none of those three things are true after 90 days, the tool isn't successful regardless of how good the technical implementation is.

Build for adoption, not for completion.