Paper H / IX: Geodesic Trajectory Caching as a RAG Alternative

NagusameCS · April 2026 · Part of HyperTensor Papers I-X

30.9 usper-query latency

5.96 KBper record

97xbatched Jacobi gain

Abstract

Retrieval-Augmented Generation (RAG) pipelines add latency, index maintenance, and external dependency costs to language model inference. Paper IX proposes Geodesic Trajectory Caching (GTC) as a manifold-native alternative: instead of retrieving documents from an external vector database, the model retrieves previously-computed geodesic paths through its own intrinsic k-manifold. Each cached trajectory stores the input embedding, the sequence of geodesic segments traversed during decoding, and the Jacobi correction field---requiring only 5.96 KB per record. At inference time, a cosine-similarity lookup (30.9 microseconds) finds the closest cached trajectory, and the Jacobi field is parallel-transported to the current query position via Magnus-3 expansion, providing a manifold-consistent prior for the next token distribution. On SmolLM2-135M-Instruct, GTC achieves comparable factuality to a 50-document DPR retriever with 340x lower latency and zero external infrastructure.

Key Findings

340x latency advantage: GTC lookup at 30.9 us vs typical vector DB retrieval at ~10 ms.
Zero external infrastructure: No embedding database, no index refresh, no separate retriever model.
97x batched Jacobi gain: At batch size B=10, Magnus-3 parallel transport across cached trajectories is 97x faster than sequential Jacobi integration.
Cross-model transfer: Caches trained on SmolLM2-135M transfer partially to Phi-3.5-mini (34% hit rate) and Gemma-4-E2B (28%), suggesting manifold universality.
Storage efficiency: 5.96 KB/record means 1M trajectories fit in 5.8 GB, comparable to a mid-size vector index.

Reproduction: The OTT runtime is exercised via geod.ps1 benchmark --mode ott --batch 10. The 97x batched-Jacobi measurement requires the full GTC cache (included in the Models tab).

Download PDF LaTeX source