Paper J / X: Chimeric Model Vector Bridging (CECI)
Abstract
Paper X introduces the Cross-Embedding Compatibility Index (CECI) and Chimeric Model Vector Bridging (CMVB): a protocol for measuring whether independently-trained transformer models have compatible internal representations that would permit surgical component transfer (e.g., grafting one model's attention heads onto another's body). The core metric is the Grassmann distance between the subspaces spanned by corresponding weight matrices. A systematic sweep of SmolLM2-135M layers reveals that no pair of non-adjacent layers achieves GD below the 0.94 threshold required for gauge-aligned splicing without post-hoc alignment. However, a gauge transformation (the Axiom Gauge from Paper VI) applied to both donor and recipient reduces GD from 0.96 to 0.15 in cross-band splicing tests, achieving a splice residual below 0.15 and CECI above 0.85. This demonstrates that component transfer is possible but requires a shared coordinate system---motivating the Universal Geodesic Taxonomy proposed in follow-on work (Papers XI+).
Key Findings
- No natural compatibility: Without gauge alignment, Grassmann distance between corresponding layers of independently-trained models exceeds 0.86, making naive component transfer infeasible.
- Gauge alignment works: The Axiom Gauge from Paper VI reduces inter-model GD to 0.15, enabling viable splicing.
- Permutation invariance is the bottleneck: Neural networks develop arbitrary coordinate systems; the GL(d) symmetry of weight space means two models with identical initialization but different training data will diverge in their internal representations.
- CECI as a diagnostic: The Cross-Embedding Compatibility Index provides a single scalar metric (0-1) for assessing whether two models can be mechanically combined.
- Motivates UGT: The findings directly motivate the Universal Geodesic Taxonomy (Paper XI), which proposes to eliminate permutation invariance by design through enforced taxonomic coordinates during training.
python scripts/ceci_splice_test.py --model-a ... --model-b .... Gauge alignment requires the Axiom Gauge module from scripts/axiom_gauge.py. Full sweep protocol in the PDF appendix.