AnswersPublished2 months agoLast edited last month24 sources

Perplexity's New AI Orchestrator Decides What Runs On-Device vs. Cloud in Real Time

Perplexity AI's new hybrid inference orchestrator, announced at Computex 2026, automatically assesses every AI subtask and decides in real time whether it should run locally on your device for privacy or be sent to cl... The system is designed to be chip agnostic despite its launch stage exclusivity on Intel Core Ul...

Search & fact-check with Studio Global AI Browse more Trending pages

An abstract visualization of an AI brain split between a personal computer chip and a glowing data center, illustrating Perplexity's hybrid local-cloud AI inference orchestration. — What is Perplexity AI's new "hybrid local/server agentic inference orchestrator" announced at Computex 2026, how does it dynamically split APerplexity's new orchestrator acts as an intelligent bridge between on-device AI and powerful cloud agents.
AI Prompt
Create a landscape editorial hero image for this Studio Global article: What is Perplexity AI's new "hybrid local/server agentic inference orchestrator" announced at Computex 2026, how does it dynamically split A. Article summary: Here is the full picture based on the announcements from Computex 2026.. Topic tags: general, general web, user generated. Reference image context from search candidates: Reference image 1: visual subject "# Perplexity’s new Computer is another bet that users need many AI models. The tool is available now, only on the company’s highest subscription tier, the $200/month Perplexity Max" source context "Perplexity's new Computer is another bet that users need many AI ..." Reference image 2: visual subject "# Perplexity AI unveils hybrid local-cloud inference system at Computex 2026. Perplexity AI, the fast-growing search startup now valued at $20 billion, unveiled
openai.com

At Computex 2026, Perplexity CEO Aravind Srinivas and Intel CEO Lip-Bu Tan took the stage to demonstrate a new approach to running AI: a hybrid local/server agentic inference orchestrator. Instead of forcing users to choose between powerless on-device AI or privacy-invasive cloud processing, this software layer automatically dissects a complex task, keeps sensitive parts locked down on your machine, and sends the computationally expensive parts to the cloud. The result is a seamless, single output stitched together from both worlds—and, according to early ecosystem analysis, a potential 30–50% reduction in inference costs .

An 'Air-Traffic Controller' for AI Workloads

Perplexity describes the orchestrator as an "air-traffic controller" for AI tasks, and the analogy fits . When you ask the system to, say, analyze a confidential investment deal, it doesn't just blindly send all your data to a frontier model in a faraway data center.

The process follows four distinct steps:

Local Assessment: Small models running directly on your PC's NPU or GPU first scan the task to identify any sensitive, confidential, or compliance-relevant data .

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

Search & fact-check with Studio Global AI

Perplexity's New AI Orchestrator Decides What Runs On-Device vs. Cloud in Real Time

An 'Air-Traffic Controller' for AI Workloads

Search, cite, and publish your own answer

People also ask

What is the short answer to "Perplexity's New AI Orchestrator Decides What Runs On-Device vs. Cloud in Real Time"?

What are the key points to validate first?

Sources

The Key Features Driving Enterprise Adoption

The Intel Partnership: Local Silicon Meets Cloud Brains

Why Computex 2026 Marked a Turning Point for AI Architecture