Rev Vaughn
Articles

How AI-Generated Content Is Destroying Trust

And The Mathematical Solution No One’s Talking About

Apr 2, 2026 · 8 min read

Make this sound more human

Three numbers. That is what I am asking of you before I explain everything wrong with AI writing.

Let’s lead with proof, not context. Proof is the only order that earns your attention on a subject this consequential.

Number one: 4.7

That is the average density of antithesis structures per 1,000 words in AI-generated marketing copy. The “not X, but Y” construction you have seen in every AI-produced email, post, or proposal that crossed your desk this week. The human baseline, measured across 50,000 words of good professional copywriting, is 1.2 per thousand.

AI writes this pattern nearly four times more than the rate of a trained human writer. Readers feel the excess at 2.0 per thousand. They consciously register “something is robotic here.” At 5.0, they might not be able to name what they are detecting, but they stop reading.

Number two: 18 to 22

That is the density of copula constructions per 1,000 words in AI-generated copy. “It is.” “This is.” “There is.” “Here is.” Passive structures that strip agency and leave writing feeling hedged, uncertain, bureaucratic. The human baseline: 3.5 per thousand words.

AI writes copula patterns at five to six times the rate of a professional human writer.

Number three: 8 to 12

That is how many consecutive sentence fragments AI clusters before registering that the technique has exhausted itself. Human writers use 2 to 3, then back off. One 500-word AI post I analyzed had 14 fragments in the first 100 words: a density of 140 per thousand. That’s a telegram, not prose.

Three patterns. Each one operating below conscious detection. Each one experienced by readers as “something is off” without the ability to name exactly what. Each one measurable, thresholdable, and fixable with the right architecture.

These three numbers share a single root cause. And understanding that root cause is why the obvious fix does not work.

The 1948 Theorem That Explains Why Your AI Copy Sounds Wrong

In 1948, Claude Shannon published A Mathematical Theory of Communication and founded the discipline of information theory. His central insight was not about content but about redundancy: every communication channel has a finite capacity for meaningful signal, and when patterns repeat beyond a threshold, signal-to-noise ratio collapses. The receiver’s ability to extract meaning degrades proportionally.

Shannon was describing telegraph systems. The principle holds for AI-generated marketing copy with equal mathematical precision.

AI writing has a redundancy problem.

It produces patterns so densely repeated that a reader’s pattern-recognition system flags them before the conscious mind can evaluate the argument. The channel is saturated.

I was managing marketing for two brands simultaneously. One promotion for a new offer required 172 separate emails and 58,000 words of copy. That’s the length of The Great Gatsby, and I needed to produce it in weeks, not months.

No one writes 58,000+ words by hand at that pace without compromising quality. So I used AI. It delivered the volume, the consistency, and the speed that made the project possible.

It also delivered copy that, across every email, sounded like the same machine had written it all.

The copy was polished, competent, and completely invisible.

When you’re supporting brands doing big launches, “invisible” quickly becomes a revenue problem. Email open rates crater. Click rates slip. A proposal gets skimmed in thirty seconds and forgotten. The pipeline doesn’t move, and you never see the moment it stopped.

I started hunting for the mechanism.

I collected samples: AI-generated marketing emails, blog posts, and LinkedIn content. I compared them against human-written copy — mine and other professional copywriters’ content before AI took hold.

I found over 20 distinct patterns that AI overuses. A few of them account for 80% of the effect readers describe as “robotic” or “off,” but they can’t say exactly why.

The problem is never the presence of a pattern. It is the density.

You already have the three numbers. What the numbers do not show is how they interact. Antithesis dilutes your emphasis. Copula saturation makes your prose feel passive and hedged. Fragment clustering exhausts the reader before the argument lands. Together, they create a compounding effect: the writing feels simultaneously over-structured and uncertain — which is precisely the so-called “uncanny valley” readers cannot articulate but cannot ignore.

Solutions That Don’t Work

The obvious response to this diagnosis is better prompting.

“Don’t sound robotic. Vary your sentence structure. Write like Malcolm Gladwell on crack cocaine.”

I spent weeks testing this. It worked for a paragraph. Occasionally two. Then the patterns returned, with the reliability of the tide coming in.

Here’s why. It requires understanding what AI models actually do.

A language model doesn’t write. It predicts. At every position in a sentence, it samples from a probability distribution over possible next words — a distribution shaped by hundreds of billions of training examples. Every word is a weighted selection from that distribution, resolved toward the most statistically probable outcome given everything that came before it.

“Not only X” triggers “but also Y” with 87% probability. Not because the model evaluated that structure as effective rhetoric and chose it. Because that sequence appeared hundreds of thousands of times in training data, and high-frequency patterns produce high-confidence predictions.

Statistical confidence is what AI optimizes for. Average is the result.

When you prompt the model to “sound less robotic,” you stack three layers of probabilistic guessing. The underlying probability distributions don’t shift. Even sophisticated prompt engineering moves the needle perhaps 15 to 20%. Enough for one strong paragraph. Not enough to overcome patterns the model encountered 847,000 times.

You cannot fix probabilistic generation with more probabilistic generation.

The solution came from a direction I didn’t expect: manufacturing quality assurance.

A developer I worked with, who builds AI tools professionally, said something that reframed the entire problem:

“In software development, we learned this decades ago… Never let the compiler be its own debugger. Generation and evaluation are fundamentally different operations. When you make one system do both, you get mediocrity.”

W. Edwards Deming built the postwar Japanese manufacturing transformation on exactly this principle. Quality cannot be inspected into a product by the machine producing it. The production line and the inspection system are separate by design. Each optimizes for something different. Generation optimizes for completion. Evaluation optimizes for correctness.

The Hemingway Editor doesn’t write. It measures. Grammarly doesn’t create. It flags. The architecture is the point.

What AI-assisted writing requires is a quality gate that doesn’t guess. It counts.

Not “does this feel robotic?” — which is a probabilistic judgment subject to the same biases as the content it’s evaluating. But: what is the copula density? What is the antithesis rate per thousand words? How do these compare against the measured human baseline?

Math doesn’t tire at 2 AM, miss a pattern after six hours of editing, or rationalize keeping a phrase you worked hard on.

How the measurement works in practice

Copula patterns carry a threshold of 5.0 instances per thousand words. A chapter I considered clean (1,120 words) had 16 instances. That’s 14.3 per thousand. Excess over threshold: 9.3.

Antithesis structures carry a threshold of 2.0 per thousand.

All thresholds normalize to document length, so a 500-word piece and a 2,000-word piece are held to the same density standard. Short pieces don’t get false passes. Long pieces don’t get buried in flags.

Critically, the scanner doesn’t automatically fix violations. It presents numbered instances with line references and surrounding context. You decide which to keep as intentional rhetoric and which to remove as pattern noise. The two antithesis structures that are your emphasis — you keep them. The eight diluting their impact — those get rewritten.

Intentional craft is preserved. Accidental repetition is eliminated.

This is the distinction that matters. The goal is not zero antithesis. The goal is 2.0 per thousand or less instead of 4.7. Which means the instances you keep land with the force they were designed to carry.

Results from the 172-email campaign

Before the scanner: Grammarly caught 35% of density violations. Manual review took over 14 hours. Every decision about what sounded robotic was made on feel — which is to say on the same probabilistic judgment that created the problem in the first place.

After: the scanner caught 94% of violations. The review took less than 3 hours. Every decision was grounded in a number.

  • Click rates up 23%.
  • Replies up 41%.
  • The offers stayed the same. The writing stopped pattern-matching to every other AI campaign in the reader’s inbox.

The speed was retained. The signatures were removed.

New models won’t solve this — and here’s why

But what if I did it again

New models are coming. They will be faster, more capable, more sophisticated in every measurable dimension. Will the redundancy problem improve? No.

AI language models are fundamentally probabilistic. That is not a bug. It is how they work. They train on billions of documents, learn the statistical distributions of language in that corpus, and generate text by sampling from those distributions. The mean of the corpus, pulled toward good enough rather than distinctive.

Shannon’s insight about redundancy applies here too, in a form he didn’t anticipate: a communication channel that trains on its own output degrades signal over time. Current estimates place 30 to 40% of new web content as AI-generated. That figure is rising. The next generation of models will train on increasingly AI-generated text. That means they will learn AI patterns as the baseline for human writing. The homogenization accelerates.

Unless the architecture changes fundamentally, we will always need an external evaluation layer. The external evaluation layer is permanent infrastructure in the stack.

The new workflow is not complicated:

  • Probabilistic generation: AI creates the draft, handles the volume, does the heavy lifting.
  • Deterministic evaluation: The scanner measures density against human baselines.
  • Human judgment: You decide which patterns serve the argument and which dilute it.
  • Iterative refinement: Re-scan until it passes the quality gate.
  • Automate only after confidence is high.

Separate creation from inspection. Don’t ask the generator to evaluate itself. Build the quality gate into the process — not as an afterthought, but as a permanent layer in the stack.

In manufacturing, removing the quality scanner from the assembly line means shipping defective products. Defective in writing means invisible. The content lands in the same forgettable pile as every other AI-generated piece in the reader’s inbox. Trust erodes. You never see the moment it happened.

Build it as an afterthought, and you ship defective product.

Warm regards — and always, better words and bigger returns,
Rev

Work with me

Find your hidden bottleneck.

A 60-minute Quick Win Consult to pinpoint what is blocking growth and remove it fast.

Book a Quick Win Consult →