Chunwei Xia
Open Menu
Close Menu
Bio
Papers
Experience
Projects
Talks
News
Teaching
Large Language Models
KVSwap: Disk-aware KV Cache Offloading for Long-Context On-device Inference
Mar 1, 2026
The new compiler stack: a survey on the synergy of LLMs and compilers
Jan 9, 2026