Characterizing DNN Models for Edge-Cloud Computing

Oct 13, 2018·

Chunwei Xia

Jiacheng Zhao

Huimin Cui

Xiaobing Feng

· 0 min read

PDF Cite Slides DOI

Abstract

Traditionally Deep Neural Networks (DNNs) services are deployed in the cloud, due to the computation-intensive DNN models. In recent years, as emerging edge computing provides new possibilities for DNN applications, we have opportunities to process DNN models in the cloud and on the device collaboratively, i.e., edge-cloud computing. Since cloud and edge devices demonstrate significant diversity on inference latency, network transmission overhead, memory capacity and power consumption, it is a big challenge to determine the DNN model deployment in the cloud and on edge devices. In this paper, we characterize the behaviours of three types of DNN models, i.e., CNN, LSTM and MLP, on four types of platforms, i.e., server-class CPU, server-class GPU, embedded device with GPU, and smart-phones. Our experimental results demonstrate that we can carefully tune a deployment strategy for DNN models in the cloud, and on big and (or) little cores of the edge device, to balance performance and power consumption.

Type

Conference paper

Publication

In 2018 IEEE International Symposium on Workload Characterization

Last updated on Oct 13, 2018

Deep Learning Framework

Authors

Chunwei Xia

Lecturer (Assistant Professor)

← Optimizing Deep Learning Inference via Global Analysis and Tensor Expression Dec 30, 2018