AI Evals & Discovery – All Things Product Podcast with Teresa Torres & Petra Wille

ai-evals-&-discovery-–-all-things-product-podcast-with-teresa-torres-&-petra-wille

AI Evals & Discovery - All Things Product Podcast with Teresa Torres & Petra Wille

Listen to this episode on: Spotify | Apple Podcasts

Building AI products isn’t just about clever prompts and orchestration—it’s about knowing if what you’ve built actually works. In this episode, Teresa Torres and Petra Wille dive deep into AI evals: how they’re defined, why they’re essential, and how teams can implement them to ensure product quality.

Teresa shares her journey building her Interview Coach tool and the hard lessons she learned about evals along the way. From golden datasets and synthetic data to error analysis, code-based checks, and LLM-as-judge methods, you’ll walk away with a clearer picture of how to measure and improve AI products over time.

What you’ll learn in this episode:

  • What “evals” actually mean in the AI/ML world
  • Why evals are more than just quality assurance
  • The difference between golden datasets, synthetic data, and real-world traces
  • How to identify error modes and turn them into evals
  • When to use code-based evals vs. LLM-as-judge evals
  • How discovery practices inform every step of AI product evaluation
  • Why evals require continuous maintenance (and what “criteria drift” means for your product)
  • The relationship between evals, guardrails, and ongoing human oversight

Resources & Links:

Mentioned in the episode:

Coming soon from Teresa:

  • Weekly Monday posts sharing lessons learned while building AI products
  • A new podcast interviewing cross-functional teams about real-world AI product development stories

Join the Conversation:

Have thoughts on this episode? Leave a comment below.

Full Transcript

Full transcripts are only available for paid subscribers.

Total
0
Shares
Leave a Reply

Your email address will not be published. Required fields are marked *

Previous Post
these-marketing-kpis-will-help-you-predict-and-scale-revenue-growth-by-10x

These marketing KPIs will help you predict and scale revenue growth by 10x

Next Post
the-channel-strategy-that’s-saving-brands-from-ai-search-cannibalization

The channel strategy that’s saving brands from AI search cannibalization

Related Posts