Efficient Network Construction through Structural Plasticity

Xiaocong Du, Zheng Li, Yufei Ma, Yu Cao

Research output: Contribution to journalArticle

1 Scopus citations

Abstract

Deep Neural Networks (DNNs) on hardware is facing excessive computation cost due to the massive number of parameters. A typical training pipeline to mitigate over-parameterization is to pre-define a DNN structure with redundant learning units (filters and neurons) with the goal of high accuracy, then to prune redundant learning units after training with the purpose of efficient inference. We argue that it is sub-optimal to introduce redundancy into training in order to reduce redundancy later in inference. Moreover, the fixed network structure further results in poor adaption to dynamic tasks, such as lifelong learning. In contrast, structural plasticity plays an indispensable role in mammalian brains to achieve compact and accurate learning. Throughout the lifetime, active connections are continuously created while those that are no longer important are degenerated. Inspired by such observation, we propose a training scheme, namely Continuous Growth and Pruning (CGaP), where we start the training from a small network seed, then literally execute continuous growth by adding important learning units and finally prune secondary ones for efficient inference. The inference model generated from CGaP is sparse in the structure, largely decreasing the inference power and latency when deployed on hardware platforms. With popular DNN structures on representative datasets, the efficacy of CGaP is benchmarked by both algorithmic simulation and architectural modeling on Field-programmable Gate Arrays (FPGA). For example, CGaP decreases the FLOPs, model size, DRAM access energy and inference latency by 63.3%, 64.0%, 11.8% and 40.2%, respectively, for ResNet-110 on CIFAR-10.

Original languageEnglish (US)
Article number8788560
Pages (from-to)453-464
Number of pages12
JournalIEEE Journal on Emerging and Selected Topics in Circuits and Systems
Volume9
Issue number3
DOIs
StatePublished - Sep 2019

    Fingerprint

Keywords

  • Deep learning
  • algorithm-hardware co-design
  • hardware acceleration
  • model pruning
  • structural plasticity

ASJC Scopus subject areas

  • Electrical and Electronic Engineering

Cite this