Perplexity announces hybrid AI system that decides what runs locally or in the cloud

· Source: The Decoder · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Cloud Computing & IT Infrastructure, Cybersecurity & Data Privacy · Depth: Intermediate, quick

Summary

Perplexity announced on June 3, 2026, a new hybrid AI system, an orchestrator designed to combine AI models running on a user's local computer with powerful cloud models. This system automatically decides where each task is processed, aiming to optimize accuracy, privacy, and energy efficiency simultaneously. It will be integrated into Perplexity's "Always-on agent" product, Personal Computer, starting in July. The orchestrator ensures sensitive data, such as financial or health information, remains local, while compute-intensive tasks are routed to cloud models. Introduced in collaboration with Intel, the framework is model-agnostic and also supports other hardware like Nvidia's RTX Spark, emphasizing a shift towards local compute to reduce centralized infrastructure needs and simplify data sovereignty.

Key takeaway

For AI Architects designing agent systems or MLOps Engineers managing distributed inference, Perplexity's hybrid AI orchestrator signals a critical shift towards balancing local and cloud processing. You should evaluate how such model-agnostic frameworks can enhance data privacy by keeping sensitive information on-device while offloading compute-intensive tasks. This approach offers a blueprint for reducing centralized infrastructure costs and simplifying data sovereignty challenges in your deployments.

Key insights

Perplexity's hybrid AI orchestrator intelligently routes tasks between local and cloud models for optimal privacy and efficiency.

Principles

Method

Automatically route tasks: sensitive data stays local, compute-intensive tasks go to cloud models.

In practice

Topics

Best for: CTO, VP of Engineering/Data, Director of AI/ML, AI Engineer, MLOps Engineer, AI Architect

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by The Decoder.