Our paper on disk-aware KV cache offloading for long-context on-device inference has been conditionally accepted at MobiSys 2026.
Feb 28, 2026