Video by Rasa via YouTube

🎙️ Most teams think the secret to better AI is better prompts.
Rebecca Evanhoe says that’s the wrong question.
At Slang AI, handling millions of restaurant phone calls every month means success isn’t measured by clever prompts—it’s measured by whether the system actually works.
The real differentiator?
✅ Rigorous evaluation
✅ Transcript reviews
✅ Human scoring
✅ Automated testing
✅ Continuous improvement
As Rebecca tells Lauren Goerz on The Dialogue Architects:
"The future belongs to teams that can evaluate AI, not just build it."
Watch the full episode to learn how enterprise teams are creating AI experiences users can trust.
#ConversationDesign #VoiceAI #EnterpriseAI #LLMEvaluation #SlangAI
(00:00) Introduction to The Dialogue Architects
(00:39) Meet Rebecca Evanhoe of Slang AI
(01:30) A Day in Product Management & Voice AI
(02:25) Why Conversation Design Still Matters
(03:29) Why LLM Evaluation Is Critical
(05:38) Is Conversation Design Dead?
(06:19) Prompt Engineering vs Conversation Design
(07:07) What Conversation Designers Actually Do
(10:01) User Advocacy, Research & Design Workshops
(12:22) Hybrid AI vs Fully Generative Systems
(16:31) When Deterministic Systems Beat LLMs
(19:31) AI Costs, Latency & Model Selection
(21:07) How Slang AI Evaluates LLM Performance
(25:41) Transcript Reviews That Improve AI
(26:34) Finding Root Causes Through Metrics
(27:54) CSAT vs Sentiment: What Matters More?
(31:07) Choosing the Right North Star Metrics
(31:59) The Biggest Challenges in Voice AI Design
(36:23) Multilingual AI & The Future Product Roadmap
(40:31) Turning Customer Calls Into Business Insights
(43:12) How AI Is Changing Human Communication
(46:46) Rebecca Evanhoe’s Advice for Conversation Designers
(47:51) Closing Thoughts & Where to Connect