Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications

Cai, Han; Lin, Ji; Lin, Yujun; Liu, Zhijian; Tang, Haotian; Wang, Hanrui; Zhu, Ligeng; Han, Song

doi:10.1145/3486618

Computer Science > Machine Learning

arXiv:2204.11786 (cs)

[Submitted on 25 Apr 2022]

Title:Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications

Authors:Han Cai, Ji Lin, Yujun Lin, Zhijian Liu, Haotian Tang, Hanrui Wang, Ligeng Zhu, Song Han

View PDF

Abstract:Deep neural networks (DNNs) have achieved unprecedented success in the field of artificial intelligence (AI), including computer vision, natural language processing and speech recognition. However, their superior performance comes at the considerable cost of computational complexity, which greatly hinders their applications in many resource-constrained devices, such as mobile phones and Internet of Things (IoT) devices. Therefore, methods and techniques that are able to lift the efficiency bottleneck while preserving the high accuracy of DNNs are in great demand in order to enable numerous edge AI applications. This paper provides an overview of efficient deep learning methods, systems and applications. We start from introducing popular model compression methods, including pruning, factorization, quantization as well as compact model design. To reduce the large design cost of these manual solutions, we discuss the AutoML framework for each of them, such as neural architecture search (NAS) and automated pruning and quantization. We then cover efficient on-device training to enable user customization based on the local data on mobile devices. Apart from general acceleration techniques, we also showcase several task-specific accelerations for point cloud, video and natural language processing by exploiting their spatial sparsity and temporal/token redundancy. Finally, to support all these algorithmic advancements, we introduce the efficient deep learning system design from both software and hardware perspectives.

Comments:	Journal preprint (ACM TODAES, 2021). The first seven authors contributed equally to this work and are listed in the alphabetical order
Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2204.11786 [cs.LG]
	(or arXiv:2204.11786v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2204.11786
Journal reference:	ACM Transactions on Design Automation of Electronic Systems (TODAES), Volume 27, Issue 3, Article 20, Page 1-50, 2021
Related DOI:	https://doi.org/10.1145/3486618

Submission history

From: Zhijian Liu [view email]
[v1] Mon, 25 Apr 2022 16:52:48 UTC (29,216 KB)

Computer Science > Machine Learning

Title:Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators