Image

From stalled pilot to $43M annual ROI and 95% accuracy

Impact
$43M

Annual ROI

95%

Accuracy (up from 54%)

90%

of complex cases resolved in <24 hrs

The challenge

This Top 5 Global Telco aimed to evolve its internal billing co-pilot into a customer-facing chatbot capable of serving its global customer base. However, the project stalled at 54% accuracy due to data blind spots and reasoning errors that frustrated efforts to launch.

The solution

Snorkel used cutting-edge data development frameworks to embed the telco’s subject-matter expertise directly into the GenAI application. This included creating a rigorous evaluation workflow and data development acceleration techniques that scaled human-in-the-loop expertise, enabling the application to accurately resolve even the most complex edge case.

The outcome

In just a few months, the team drove model accuracy from 54% to 95%, enabling a successful global rollout to 80,000 daily customers. This high-performance system now resolves 90% of the most problematic billing cases in under 24 hours, delivering $43 million in annual ROI.

Image
Share this customer story

More customer stories

View all stories
Image
From hours to seconds on CLO contract review with 94% end user acceptance
A top 10 US bank manages CLO portfolios totaling billions in assets, each governed by contracts up to 500 pages.
global media company
Conversational, decision-grade
responses in 15 seconds
A global media intelligence firm analyzes hundreds of millions of sources daily – from public news, social, and broadcast to proprietary analyst-curated databases – to help large enterprise clients manage communications, reputation, and strategic decision-making. Their competitive advantage is the layer on top of publicly available data: in-house human editorial teams, proprietary scoring and analytics frameworks, and years of analyst judgment refined into decision-grade intelligence. When a crisis signal is building or a competitor’s narrative is gaining traction, speed and accuracy matter enormously. Historically, getting an answer meant waiting for a human analyst to manually aggregate across those sources: a process measured in hours, not seconds.
Image
Deploying production AI in <60 days to accelerate claims review 67%
A leading global firm transforming insurance subrogation operations with AI found that manual review processes capped their throughput to ~30% of available claims.
Image

For models that need to be right. Not just good enough.