NeFut Logo NeFut
Admin Login

[CS.AI] Direct Latent-Space Synthesis for Efficient LLM-Agent Workflows

Published at: 2026-06-15 22:00 Last updated: 2026-06-16 12:13
#algorithm #AI #Open Source

Abstract

Large language models (LLMs) increasingly serve as execution engines for agentic systems, yet they still consume context through a sequential text interface. This creates a mismatch with modern structured agent workflows, where independent branches explore subtasks, retrieve evidence, or generate candidate solutions before a final synthesis step.

Existing systems typically merge these branches by concatenating their textual outputs, which discards the parallel structure and incurs redundant prefill computation.

In this work, we introduce Parallel-Synthesis, a plug-and-play framework that enables a synthesizer to directly consume the KV caches produced by parallel worker agents. Parallel-Synthesis combines a cache mapper that calibrates independently generated branch caches with a fine-tuned synthesizer adapter that enables generation from this non-sequential cache interface.

We train Parallel-Synthesis using data that exposes the synthesizer to parallel cache contexts, teaches aggregation across cached branches, and distills reasoning behavior from standard text-concatenation-based synthesis.

Across nine downstream datasets spanning math, science QA, code generation, GAIA, and multi-agent database diagnosis, Parallel-Synthesis matches or outperforms text-based synthesis on seven datasets and remains close on the other two. It also reduces time-to-first-token by 2.5x-11x, suggesting that direct cache-based synthesis is a promising interface for more native and efficient synthesis over parallel agent branches.

Blogger's Review: This research provides an innovative approach to the synthesis process in parallel agent systems. By directly utilizing KV caches, it not only enhances efficiency but also preserves the structural integrity of branches, which could have profound implications for future multi-task processing and intelligent agent applications.

Original Source: https://arxiv.org/abs/2606.14672

[h] Back to Home