uIdukki essay · Idukki Strategy notebookStrategy

AI in the UGC loop, part 3 — moderation: the layer most teams skip

Brand-safe is not the same as profanity-filtered. The difference is catching the competitor’s logo in the background, not just the swear word. Here is the three-tier moderation queue every UGC programme should run.

Rohin AggarwalCo-founder · Idukki.io·May 15, 2026·8 minFrom the Idukki desk

Most UGC moderation programmes look like this: a profanity filter set up once and never tuned, a "report this" button on the gallery clicked twice a year, and a Slack channel where someone occasionally posts "did anyone approve this clip?" That is not a moderation programme. That is a hope.

Real brand safety is the difference between catching the swear word and catching the competitor’s logo in the background of the unboxing video that just went live on the homepage. The first is table stakes. The second is what costs you a conversation with the CMO. This is the layer most teams skip — and the day job taught me that the unsexy layer is usually the one that matters.

What "brand-safe" actually means now

The list of things a moderation layer has to catch has grown long:

Profanity, hate speech and slurs — the classic filter, still necessary, no longer sufficient.
Faces of minors — almost every brand’s policy says no, almost no brand actively detects it.
Competing brand logos and packaging in the background of a clip.
Unsafe product claims — "cleared my acne in three days" on a beauty PDP is a regulatory issue.
Copyrighted music, especially in clips repurposed across platforms.
Sensitive context where the brand association is simply wrong.
Quality signals — vertical-only, low-light, unstable handheld footage that looks bad on a PDP regardless of legality.

A profanity filter catches one of those. The rest need vision, language understanding and brand context. That is the AI part. But the AI alone is not enough — what makes a moderation programme work is the queue model around it.

The three-tier moderation queue

The single biggest mistake brands make is treating moderation as binary: approved or rejected. Real moderation is three-tier, because the cost of a wrong decision differs wildly across categories.

Tier 1 — hard auto-reject

The unambiguous stuff: clear profanity, faces of minors, high-confidence competitor logos, copyrighted music with a strike. The model rejects, the asset never reaches a human, and the creator gets a polite templated rejection with the reason. No human time spent.

Tier 2 — soft flag, human review

The grey area: "probably a minor, low confidence", "possible competitor product, partial occlusion", "claim language that might be regulated". This is where human judgement matters, and where the AI’s job is to surface the asset with the specific concern timestamped, so the reviewer is not re-watching the whole clip hunting for what is wrong. A good Tier 2 review is 30 to 60 seconds.

Tier 3 — auto-approve, sample audit

The clean stuff goes live. But — and this is the tier most brands forget — you still sample 5 to 10% of Tier 3 for a weekly human audit. Not to catch escapes, but to catch model drift before it becomes a problem.

50–70%
Tier 3 · auto-approve
Clean, high confidence
15–30%
Tier 2 · human review
Grey area, judgement needed
5–15%
Tier 1 · auto-reject
Unambiguous violations

A representative healthy inbound mix for a mature programme — consolidated guidance, not Idukki-measured customer averages.

The SLAs that make it work

A moderation queue without SLAs is a queue that grows. Set them, post them in the channel, report on them weekly.

Tier	Target SLA	Why
Tier 1 reject	Under 1 minute	Creator gets feedback while the upload is still fresh
Tier 2 review	Under 4 working hours	Asset is still timely when it goes live
Tier 3 audit	Weekly batch	Trend monitoring, not per-asset latency

Moderation SLAs by tier.

“Sub-4-hour human review is the threshold that makes UGC feel live rather than delayed. Slower than that, and you lose the trend value of the creator’s original post.”

The numbers to track

You need three, not one. Tier 2 P75 review latency in hours — the health-of-queue metric. Escape rate — of assets that went live, how many were later flagged as a miss. And reject-overturn rate — of the assets the model auto-rejected, how many a human would have approved. A high overturn rate means the model is too aggressive and you are throwing away good content.

“From the first interaction with Idukki, it's clear this platform is in a class of its own. It's more than just a UGC content platform on Shopify; it's a game-changer that truly revolutionizes the way businesses can leverage user-generated content.”

— MOONFREEZE FOODS PRIVATE LIMITED — verbatim, Shopify App Store review, October 25 2023

Three things to do this quarter

1Write a one-page moderation policy. Not a 40-page legal doc. One page listing what is auto-reject, what is soft-flag, what is auto-approve. If it does not fit on a page, your reviewers cannot apply it consistently.
2Set the three SLAs above and post them where the moderation team can see them. Report weekly.
3Run a Tier 2 review queue, even manually for the first month. The queue model itself is the unlock. AI just makes it scale.

Last in the series: part 4, personalisation — why "newest first" leaves conversion on the table, and the maturity ladder to 1:1 matching. The product view of this stage is the Creator Review page.

Get the full series — AI in the UGC loop

All four parts plus the pipeline self-audit worksheet, in one file.

Sources + note on numbers

1Bazaarvoice — content moderation + authenticity research — UGC moderation and fraud-signal benchmarks.
2TINT — State of User-Generated Content — Moderation practice survey across marketers.
3Note on numbers — The three-tier mix percentages are representative healthy ranges consolidated from the sources above and Idukki’s product experience. They are not verbatim customer-measured averages.

Rohin Aggarwal

Co-founder · Idukki.io

A builder. In the long way of saying it. Day job: SAP architect — the unglamorous backbone software that runs UK government and Fortune 500s, mostly used while people are complaining about it. The brief, simplified: make the systems behind those services feel less like punishment for the people running them. Night job (and most weekends): co-founded Idukki.io in 2022 — UGC, shoppable video and reviews for DTC brands, built from a kitchen table in Egham. The Venn diagram of those two communities is, on a good day, approximately one person. Coding since '99. Worked in 9+ countries. London-based, mostly. Vegetarian, no exceptions. Girl-dad. In his friend group, he is the IT department. Has an opinion about font rendering that will outlast our friendship. Still shipping. Still nervous before each release. Writes here when he has an opinion he can defend with numbers.

inFollow on LinkedIn

#ugc#content-moderation#brand-safety#ai-in-ugc-loop

AI in the UGC loop, part 3 — moderation: the layer most teams skip

What "brand-safe" actually means now

The three-tier moderation queue

Tier 1 — hard auto-reject

Tier 2 — soft flag, human review

Tier 3 — auto-approve, sample audit

The SLAs that make it work

The numbers to track

Three things to do this quarter

Sources + note on numbers

More from Rohin Aggarwal

A kitchen table in Egham — why I built Idukki

What is Agentic Commerce for Merchants?

What Is Agentic Commerce? Definition, Examples, and How It Works

Same data model. Every surface a shopper meets.