Scalable voice agent design with Amazon Nova Sonic: multi-agent, tools, and session segmentation | Amazon Web Services

https://d2908q01vomqb2.cloudfront.net/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59/2026/05/19/ml-20826.png

Design patterns for scalable voice agents matter for organizations that need to deliver fast, natural, and reliable voice experiences. Many teams face challenges like high latency, managing real-time audio, and coordinating multiple agents in complex workflows.

In this post, you’ll learn how to use Amazon Nova Sonic, Amazon Bedrock AgentCore, and Strands BidiAgent to build scalable, maintainable voice agents that handle these challenges efficiently, resulting in more responsive and intelligent customer interactions.

We’ll explore three popular architectural patterns for voice agents, highlighting their trade-offs and best practices for minimizing latency.

The building blocks

Before diving deeper into the architecture patterns, here’s a quick overview of the three key components used as the sample solution in this post.

Amazon Nova Sonic is a foundation model that creates natural, human-like speech-to-speech conversations for generative AI applications. Users can interact with AI through voice in real time, with capabilities for understanding...

Copyright of this story solely belongs to amazon.com. To see the full text click HERE

Read more

https://static01.nyt.com/images/2026/05/18/multimedia/Biz-China-AI-01-pwzt/Biz-China-AI-01-pwzt-facebookJumbo.jpg

Three precedent-setting court rulings in China have said that employers replacing workers with AI is voluntary cost-cutting that does not justify mass layoffs

Sponsor Posts Niantic Spatial: World models need real-world data — Scaniverse is the gateway to spatial services — self-serve and built for AI and robotics. Large-area 3D reconstruction from 360° cameras and precise localization, anywhere machines operate. Protecting your Cloud Applications Data — Backing up Office 365, Google Workspace, Dropbox & Salesforce data