Perplexity built an “air-traffic controller” that decides in real time whether your AI query runs on your PC or in the cloud
TL;DR
Perplexity AI announced a platform at Computex that dynamically routes AI inference between PCs and cloud servers in real time, acting as an “air-traffic controller” for AI tasks. The chip-agnostic system targets the cost crisis of centralised inference as Perplexity’s revenue hits $500 million.
Perplexity AI has developed a platform that dynamically splits AI workloads between personal computers and cloud servers, deciding in real time which tasks can run locally on a PC’s processor and which need the power of data centre hardware. CEO Aravind Srinivas announced the system at Computex in Taipei on Tuesday, describing it as an “air-traffic controller for AI tasks” designed to reduce the cost of inference, the process of running trained AI models to generate responses.
“You don’t want all your compute centralised in servers and everything running through the largest models,” Srinivas said in a Bloomberg Television interview. “You’re...
Copyright of this story solely belongs to thenextweb.com. To see the full text click HERE