
Zack Reneau-Wedeen is the Head of Product at Sierra, the conversational AI platform behind customer-facing agents for most of the Fortune 20. Before Sierra, he spent seven years at Google as the founding PM for Google Lens and Google Podcasts, then led product at Robinhood and CoinTracker. Sierra is mostly known for customer support, but Zack reveals how and why the company is building agents that span the entire customer lifecycle, from browsing and booking to sales and loyalty.
In this conversation with Harrison Chase, he argues agentic commerce will be bigger than e-commerce, explains why he's a "monolith loyalist", and unpacks why, when a model looks dumb, the problem is usually you.
🎧 Watch the full conversation on YouTube, or listen & subscribe on Apple Podcasts or Spotify.
What we learned
Choose the best models for the job
Most teams pick a model and commit. Sierra runs multiple models in parallel and trusts each one where it's actually stronger. Zack recounted when one model was the best at transcribing thick northern UK accents, but also hallucinated during silence more than any other. You don't discover that tradeoff if you are committed to just one provider.
Beyond transcription models, Sierra runs Claude, Gemini, and GPT-class models the same way. Different providers for reasoning, synthesis, and speech-to-speech.
Why Sierra runs on outcome-based pricing
Sierra's pricing model is built on a simple premise: if you don't see the value in sharing an outcome, the outcome probably wasn't that valuable.
Zack’s rule of thumb: outcome-based for high-value work (closing a sale, selling a car), and usage or seat-based for commodity tasks (balance checks, knowledge lookups). He thinks the former becomes the default for any AI product doing differentiated work.
Don’t Ship Your Org Chart
"If you want a multi-agent system so that one team can work on one agent and one team can work on another agent, then you're shipping your org chart."
Sierra’s default is one agent per brand. The agent is the brand's voice. It knows the full customer history, the full context of the conversation, and the full set of things it can do. The moment you split that into multiple agents, you're asking customers to interact with a system that's only ever working with part of the picture.
The cost is concrete. Split triage and task into two agents and each one is working blind. The task agent never learned what triage uncovered. Sierra's bet is that the best customer experiences come from an agent that holds everything, not one that hands off.
Other Topics Discussed
- How Sierra's no-code layer compiles down to agent code, and back again
- Why most multi-agent systems just ship your org chart
- Inside Sierra's modular voice architecture: thinking, listening, and talking in parallel
- Why Sierra built a PCI-certified stack for voice payments
- How outcome-based pricing aligns incentives
- Why there's no breakout memory company
Timestamps
- 00:00 Introduction
- 03:39 Analyze, build, release: how you build on Sierra
- 07:54 Inside Ghostwriter
- 11:04 Meeting models on their turf “80% of the time
- 17:47 The one constraint Claude Code doesn't have
- 19:35 Agent-to-agent: when an API call still beats MCP
- 21:02 Why agentic commerce will be bigger than e-commerce
- 27:31 Running models in parallel and ensembling transcription
- 32:22 Inside the Agent Data Platform
- 40:00 Context engineering: everything it needs, nothing more
- 41:38 "Whenever you think the model's too dumb, the model's actually too smart"
- 46:13 Why multi-agent systems are a trap
- 48:44 Voice 101: latency, naturalism, and 60 languages
- 56:11 When voice-to-voice passes 50%: the over/under
- 57:03 Making memory a first-class primitive
- 1:02:47 Why there's no breakout memory company
- 1:08:02 Why the solution to all AI problems "is more AI"
- 1:09:20 Why Sierra open-sources the tau-bench universe
- 1:14:42 How outcome-based pricing aligns incentives
- 1:20:26 Who thrives as a forward-deployed agent builder
- 1:22:16 The Formula One analogy: why product is the bottleneck
- 1:25:47 How Sierra interviews for agency
People & Tools Mentioned During This Episode
- Agent2Agent (A2A) Protocol
- Anthropic
- ChatGPT
- Claude
- Claude Code
- Claude Mythos
- Claude Opus 4.5
- Codex
- Deep Agents
- Gemini
- Hawaiian Airlines
- LangGraph
- Model Context Protocol (MCP)
- Not Another Workflow Builder
- Redfin
- Sentry
- Shopify
- Silero
- SiriusXM
- Stripe
- Tau-bench
- Thinking Machines Lab
Get More Max Agency

Hosted by Harrison Chase, CEO of LangChain, each episode goes deep with the builders designing, deploying, and learning from real agent systems in the wild. From architecture decisions to evals, tooling, and failure modes, Max Agency is for people who want to understand what it really takes to build useful agents.
Subscribe today

