Abstract
Can skills be surgically extracted from one transformer and grafted into another? We introduce CECI, a gauge-aligned manifold projection measuring geometric compatibility between layers. 120 layer pairs measured on SmolLM2-135M. Within-band grafts (adjacent layers, $\Delta L \le 4$) are viable: Grassmann distance $<0.92$, subspace overlap $\ge 15\%$, gauge alignment $+74\%$. Cross-band grafts (Mix→Refine) are infeasible: GD $>0.96$, gauge $\Delta \approx 0$, residual $>100\%$. Seven Danish-named chimeric models published on HuggingFace. Five of seven improve MMLU over baseline. Cross-model grafting confirmed: Qwen2.5-0.5B FFN in SmolLM2-135M body achieves +6pp MMLU.
1. CECI Feasibility Map
| Band Pair | ΔL | Overlap | GD | Viable? |
|---|---|---|---|---|
| Mix→Mix | 0–2 | 24.9% | 0.89 | |
| Compress→Compress | 0–2 | 20.1% | 0.91 | |
| Mix→Compress | 2–4 | 15.4% | 0.92 | |
| Mix→Refine | 8–12 | 7.6% | 0.96 |
2. 7 Danish Chimeras
| Model | MMLU | BoolQ |
|---|---|---|
| SmolLM2-135M (baseline) | 62% | 40% |
| minElskede | 68% | 53% |
| minFjollede (cross-model) | 68% | 47% |
Cross-model grafting: minFjollede uses Qwen2.5-0.5B FFN in SmolLM2-135M body — direct evidence GRC basis transfers functional knowledge.
References
- Stewart, W.K.O. Universal Geodesic Taxonomy. HyperTensor Paper XI, 2026.
- Stewart, W.K.O. Geodesic Projection Pipeline. HyperTensor Paper II, 2026.