Issue 09 · Jun 04, 2026/ Tactics

Automated QA for 400+ ad variations, without hiring a single contractor

The 'middle manager' prompt pattern that catches off-brand creative before it ever ships. Now running across three Fortune 500 brand teams.

By Kira Tan · Tactics Editor
Read · 9 min
Automated QA for 400+ ad variations, without hiring a single contractor

Brand teams shipping 400+ creative variations a quarter have a quiet problem: human QA doesn't scale, but a single off-brand asset in market is worse than three approved-and-mediocre ones. Most teams either over-staff QA or under-ship volume. There's a third option.

The 'middle manager' pattern uses a single Claude or GPT-4 evaluator with three things wired into its system prompt: the brand voice charter (verbatim), the do-not-say list (verbatim), and three reference assets that are explicitly on-brand and three that are explicitly off-brand. The evaluator returns a pass/flag/fail with a one-sentence rationale.

Pass rate in our pilots: 71% of variations approved without human review. Of the 29% flagged, roughly 60% were genuine catches and 40% were over-cautious, both an acceptable failure mode for a system that costs roughly $0.04 per asset to run.

The deployment lift is small. Three Fortune 500 brand teams now run this in production. Two of them retired a contractor row from the budget. The third re-routed the contractor to higher-leverage strategic work.

The prompt template, the do-not-say list format, and the reference-asset selection criteria are all in the appendix.

"The model isn't the creative director. It's the middle manager who reads the brief twice and says 'this isn't quite right' before the work hits the CD's desk."

// Forward this brief
Next issue · Filing Thursday 6am ET

Don't read this one second-hand.
Get the next one first.

One brief a week. Intel, tactics, and career defense, straight to your inbox before the rest of LinkedIn even knows it happened.