Publications

(2025). Leveraging Compilation Statistics for Compiler Phase Ordering. IPDPS'25.
(2025). Accelerating Tensor-train Decomposition on Graph Neural Networks. IPDPS'25.
(2024). Optimizing Deep Learning Inference via Global Analysis and Tensor Expression. In ASPLOS ‘24.
(2022). HOPE: a heterogeneity-oriented parallel execution engine for inference on mobiles. In HTL ‘22.
(2021). Optimizing Deep Learning Inference via Global Analysis and Tensor Expression. In ATS ‘21.
(2019). DNNTune: Automatic Benchmarking DNN Models for Mobile-cloud Computing. In TACO 19.
(2018). Optimizing Deep Learning Inference via Global Analysis and Tensor Expression. In NPC ‘18.
(2018). Characterizing DNN Models for Edge-Cloud Computing. In IISWC ‘18.