Field Notes
War Story

When A/B Testing Lies to You

A test showed +8% conversion. We shipped it. Six months later, retention had dropped 15%.

January 2024 1 min read Ramesh Mandal

The test: we added urgency messaging to the ancillary selection screen. 'Only 3 meals left at this price'. It was real scarcity — the meals were genuinely limited.

2-week A/B test: +8% meal attachment. Statistical significance: 97%. We shipped.

6 months later, support tickets about 'misleading' offers had increased 40%. A segment of frequent flyers — our most valuable users — had started selecting 'no meal' specifically to avoid what they perceived as pressure tactics.

The conversion uplift was real. The trust damage was also real. A/B tests measure what users do in the short term — not what they feel about the product long term.

What changed: we kept the scarcity signal but made it genuinely informational rather than urgency-triggering. 'Meals are limited on this route' rather than 'Only 3 left!'. Lower conversion. Higher satisfaction. Better retention.

The Takeaway

Test for behaviour and sentiment. A metric that goes up while another goes down isn't a win — it's a tradeoff you need to understand before you ship.

MORE FROM WAR STORYS

War Story

The Redesign Nobody Asked For

We shipped what we thought was the best version of the booking flow. Clean, modern, consistent. Users hated it. Not the design — the familiarity loss. Here's what we missed.

War Story

The 44px Tap Target That Cost ₹2 Crore

Nobody caught it in design review. Nobody caught it in QA. It shipped to 50 million users and quietly generated 34,000 failed tap events per day for 6 months.