From stalled pilot to $43M annual ROI and 95% accuracy

Impact

$43M

Annual ROI

95%

Accuracy (up from 54%)

90%

of complex cases resolved in <24 hrs

The challenge

This Top 5 Global Telco aimed to evolve its internal billing co-pilot into a customer-facing chatbot capable of serving its global customer base. However, the project stalled at 54% accuracy due to data blind spots and reasoning errors that frustrated efforts to launch.

The solution

Snorkel used cutting-edge data development frameworks to embed the telco’s subject-matter expertise directly into the GenAI application. This included creating a rigorous evaluation workflow and data development acceleration techniques that scaled human-in-the-loop expertise, enabling the application to accurately resolve even the most complex edge case.

The outcome

In just a few months, the team drove model accuracy from 54% to 95%, enabling a successful global rollout to 80,000 daily customers. This high-performance system now resolves 90% of the most problematic billing cases in under 24 hours, delivering $43 million in annual ROI.

Share this customer story

More customer stories

View all stories

From hours to seconds on CLO contract review with 94% end user acceptance

A top 10 US bank manages CLO portfolios totaling billions in assets, each governed by contracts up to 500 pages.

Conversational, decision-grade responses in 15 seconds

A global media intelligence firm analyzes hundreds of millions of sources daily – from public news, social, and broadcast to proprietary analyst-curated databases – to help large enterprise clients manage communications, reputation, and strategic decision-making. Their competitive advantage is the layer on top of publicly available data: in-house human editorial teams, proprietary scoring and analytics frameworks, and years of analyst judgment refined into decision-grade intelligence. When a crisis signal is building or a competitor’s narrative is gaining traction, speed and accuracy matter enormously. Historically, getting an answer meant waiting for a human analyst to manually aggregate across those sources: a process measured in hours, not seconds.

Deploying production AI in <60 days to accelerate claims review 67%

A leading global firm transforming insurance subrogation operations with AI found that manual review processes capped their throughput to ~30% of available claims.

For models that need to be right. Not just good enough.

Request dataset samples

Talk to our team