TECH NEWS

Amazon SageMaker AI Async Inference now supports inline request payloads | Amazon Web Services

Today, we’re announcing inline payload support for Amazon SageMaker AI Async Inference. Customers can now send inference payloads directly in the request body of the InvokeEndpointAsync API, removing the need to upload input data to Amazon Simple Storage Service (Amazon S3) before each invocation.

For payloads up to 128,000 bytes, this removes an entire network round-trip, simplifies client-side code, and reduces the operational surface area of asynchronous inference workloads.

In this post, we explain the motivation behind this feature, walk through the customer experience before and after, and show you how to start using inline payloads today.

Background: How async inference worked before

You can use Amazon SageMaker AI Async Inference to queue inference requests and process them asynchronously. It’s a good fit for workloads with large payloads, variable traffic, or tolerance for seconds-to-minutes latency. It supports automatic scaling to zero, making it cost-efficient for bursty or batch-style workloads.

Until...

Copyright of this story solely belongs to amazon.com. To see the full text click HERE

https://cdn.mos.cms.futurecdn.net/87H4xgq4N56bJjakcMDTTC-2000-80.jpg

Ubisoft CEO suggests there are pros and cons to Sony's plan to end the production of discs, 'but I think it…

* Ubisoft CEO Yves Guillemot says Sony's plan to end physical game disc production won't have a major impact on the industry * Guillemot says there are pros and cons to the decision * He says, "I think it will not disturb the industry too much" Ubisoft

https://cdn.mos.cms.futurecdn.net/e9daFEAHRRRq7beMXHAcHR-2560-80.jpeg

'Science fiction that happened': experts explain why OpenAI's 'mind-blowing' cyberattack should…

Hugging Face is probably not a name you'd heard before this week, but you likely caught the big news about OpenAI's model escaping its sandbox and to launch a cyberattack on the company. In short, Hugging Face — which is kind of like the equivalent of GitHub

https://cdn.mos.cms.futurecdn.net/LX3Uw8GY67iaNqE6kwWmjj-2000-80.jpg

Netflix and Prime Video plot AI-driven homepages, but users say they don't want 'software to think' for…

* Prime Video and Netflix are planning new AI-generated homepages * Netflix revealed its 'GenPage' tests ahead of a new Prime Video report * Viewers aren't convinced that homepages are moving in the right direction You're about to see a lot more generative AI in your streaming

https://cdn.nextgov.com/media/img/cd/2026/07/24/072426OracleNG/open-graph.jpg

Pentagon awards Oracle up to $7B in software consolidation deal

Beata Zawrzel/NurPhoto via Getty Images ByEdward Graham, Managing Editor, Nextgov/FCW July 24, 2026 11:25 AM ET The agreement merges Oracle's product and licensing services into one contract, which the Defense Department estimates will result in at least $441 million in savings. * Defense * Industry The Pentagon

Background: How async inference worked before

Read more

Ubisoft CEO suggests there are pros and cons to Sony's plan to end the production of discs, 'but I think it…

'Science fiction that happened': experts explain why OpenAI's 'mind-blowing' cyberattack should…

Netflix and Prime Video plot AI-driven homepages, but users say they don't want 'software to think' for…

Pentagon awards Oracle up to $7B in software consolidation deal