Machine LearningNLPStartups

Building ML Models for Sales Performance: Lessons from RillaVoice

October 12, 2024·10 min

Building ML Models for Sales Performance: Lessons from RillaVoice

During my time at RillaVoice (a Bessemer-backed startup), I built an unsupervised topic algorithm for analyzing sales conversations. Here's how I achieved a 72% improvement in accuracy.

The Challenge

Sales conversations are messy:

Multiple languages and accents

Small, imbalanced datasets

Domain-specific terminology

Real-time processing requirements

Traditional topic modeling (LDA, LSA) failed because they rely on large, clean datasets.

The Solution Stack

I combined multiple NLP techniques:

1. BERT Embeddings

Contextual word representations

Better handling of sales jargon

Transfer learning from large corpora

2. Dialogue Act Tagging

Distinguish questions from statements

Identify objection handling

Track conversation flow

3. Part-of-Speech Filtering

Focus on nouns and verbs

Remove filler words dynamically

Preserve domain terms

4. Topic Coherence Measures

C_v score for evaluation

Automated hyperparameter tuning

A/B testing different approaches

Results & Impact

The improved model:

Processed Pepsi and Verizon sales calls

Generated custom performance suggestions

Ranked salesperson performance automatically

Scaled to multiple languages

Key Takeaways

**Domain expertise matters**: Understanding sales conversations was more valuable than ML theory

**Start simple**: Baseline models help you understand the problem

**Iterate quickly**: Small improvements compound

**Business impact > Academic metrics**: Focus on what drives value

This project taught me that the best technical solutions come from deep understanding of the business problem.

← View All Articles