Synapse: Federated Tool Routing via Typed Compendium Artifacts
Summary
Synapse is a novel federated framework designed for LLM-based agents to collaboratively learn tool-usage patterns while preserving data privacy and reducing communication costs. Instead of sharing raw data, model weights, or discrete prompts, Synapse trains a shared global knowledge model using "compendiums"—structured, hierarchical representations encapsulating modular knowledge and contextual information. This tiered architecture involves clients generating local compendiums, edge aggregators refining them, and a central server aggregating and redistributing a global snapshot. Evaluated on reasoning-intensive benchmarks like BBH and GSM8k, Synapse achieves 92–93% accuracy, demonstrating improved tool-usage effectiveness and significantly lower communication overhead compared to weight or prompt-sharing baselines, even with non-IID data distributions.
Key takeaway
For MLOps engineers deploying LLM-based multi-agent systems in privacy-sensitive or bandwidth-constrained environments, Synapse offers a compelling alternative to traditional federated learning. You should consider adopting a compendium-based knowledge exchange approach to achieve high tool-routing accuracy (92-93% on GSM8k) and robust performance with significantly reduced communication overhead. This method allows for collaborative learning of tool-usage patterns across heterogeneous clients while maintaining data privacy, enabling scalable and adaptive AI deployments in regulated domains like healthcare or finance.
Key insights
Synapse enables privacy-preserving federated LLM tool routing by exchanging structured knowledge compendiums instead of model parameters.
Principles
- Compendiums offer richer expressivity than prompt-sharing.
- Low communication overhead can match centralized performance.
- Fixed noise ensures routing convergence despite privacy.
Method
Clients curate local compendiums, edge aggregators deduplicate/summarize, a central server aggregates into a global snapshot. This snapshot is redistributed, enabling tool selection via vector search, LLM reranking (llama-3.1-8b-instruct), and LLM planning.
In practice
- Generate usage scenarios locally with domain-specific data.
- Apply adaptive text masking and Laplace noise for privacy.
- Use LLM rerankers (e.g., llama-3.1-8b-instruct) for tool selection.
Topics
- Federated Learning
- LLM Tool Routing
- Retrieval-Augmented Generation
- Differential Privacy
- Multi-Agent Systems
- Knowledge Representation
Best for: AI Architect, Research Scientist, AI Scientist, MLOps Engineer, AI Security Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.