Research2026-06-30

Query-Aware Spreading Activation for Multi-Hop Retrieval over Knowledge Graphs

Originally published byArxiv CS.AI

arXiv:2606.30133v1 Announce Type: cross Abstract: Retrieval-augmented generation built on knowledge graphs (Graph RAG) outperforms flat passage retrieval on multi-hop question answering by leveraging graph structure. In most existing systems, however, the question only sets the seed nodes; the...

What Happened

A new arXiv preprint (2606.30133) introduces a technique called Query-Aware Spreading Activation for multi-hop retrieval over knowledge graphs. The core problem it addresses is a limitation in current Graph RAG systems: while these systems use graph structure to improve over flat passage retrieval, the query typically only influences the selection of initial seed nodes. After that, the spreading activation—the process of traversing the graph to find relevant information—proceeds in a largely query-agnostic manner, treating all edges and nodes equally. The proposed method makes the spreading activation itself sensitive to the query, dynamically weighting edges and nodes based on their relevance to the specific question. This allows the retrieval process to prioritize paths that are semantically aligned with the query, rather than blindly expanding outward.

Why It Matters

This work addresses a fundamental inefficiency in knowledge graph retrieval. In standard Graph RAG, a multi-hop question like "Which company founded by a Nobel laureate acquired a startup in 2023?" might start with seed nodes for "Nobel laureate" and "2023 acquisition," but then spread activation equally across all connected entities—including irrelevant ones like a laureate's unrelated publications or a company's routine press releases. Query-aware spreading activation prunes this noise by scoring each hop against the query's intent, effectively creating a focused subgraph for the retriever.

The significance is threefold. First, it improves retrieval precision for complex, multi-step questions without requiring additional model training or fine-tuning—the query-awareness is injected as a lightweight scoring mechanism on the graph traversal. Second, it reduces computational waste: by avoiding expansion into irrelevant graph regions, the retriever processes fewer nodes and edges per query, lowering latency and cost. Third, it addresses a known weakness of Graph RAG where the graph structure can actually mislead retrieval—if the graph is dense or noisy, blind spreading can drown out the signal.

Implications for AI Practitioners

For engineers building RAG pipelines over knowledge graphs (e.g., enterprise knowledge bases, biomedical ontologies, or product catalogs), this technique offers a practical upgrade. It can be implemented as a middleware layer between the graph database and the LLM, requiring no changes to the underlying graph or the generation model. The key engineering consideration is the scoring function for query-edge relevance: the authors likely use a lightweight embedding similarity or a small learned model, which must be fast enough to not become a bottleneck.

Practitioners should also note that this approach is complementary to other Graph RAG improvements like hierarchical graph summarization or entity disambiguation. It is most beneficial when the knowledge graph is large, heterogeneous, or contains many weak connections—scenarios where blind spreading leads to low precision. For small or highly curated graphs, the gains may be marginal.

However, there is a trade-off: query-aware scoring adds a per-hop computation cost. Teams should benchmark this against their specific graph density and query complexity. The technique also assumes the graph is well-structured with meaningful edge types—if edges are poorly labeled, the scoring function may struggle.

Key Takeaways

Query-aware spreading activation improves Graph RAG by making the graph traversal process responsive to the specific question, not just the initial seed nodes.
This approach boosts retrieval precision for multi-hop queries without retraining the LLM or modifying the knowledge graph structure.
Practitioners should evaluate the scoring function's latency and accuracy against their graph's density and edge quality before deployment.
The technique is most impactful for large, noisy knowledge graphs where blind spreading leads to irrelevant expansions and wasted computation.

Read Original Article on Arxiv CS.AI

arxivpapers