Please use this identifier to cite or link to this item:
Full metadata record
DC FieldValueLanguage
dc.contributor.authorHuai, Shuoen_US
dc.contributor.authorZhang, Leien_US
dc.contributor.authorLiu, Dien_US
dc.contributor.authorLiu, Weichenen_US
dc.contributor.authorSubramaniam, Ravien_US
dc.identifier.citationHuai, S., Zhang, L., Liu, D., Liu, W. & Subramaniam, R. (2021). ZeroBN : learning compact neural networks for latency-critical edge systems. 2021 58th ACM/IEEE Design Automation Conference (DAC), 151-156.
dc.description.abstractEdge devices have been widely adopted to bring deep learning applications onto low power embedded systems, mitigating the privacy and latency issues of accessing cloud servers. The increasingly computational demand of complex neural network models leads to large latency on edge devices with limited resources. Many application scenarios are real-time and have a strict latency constraint, while conventional neural network compression methods are not latency-oriented. In this work, we propose a novel compact neural networks training method to reduce the model latency on latency-critical edge systems. A latency predictor is also introduced to guide and optimize this procedure. Coupled with the latency predictor, our method can guarantee the latency for a compact model by only one training process. The experiment results show that, compared to state-of-the-art model compression methods, our approach can well-fit the 'hard' latency constraint by significantly reducing the latency with a mild accuracy drop. To satisfy a 34ms latency constraint, we compact ResNet-50 with 0.82% of accuracy drop. And for GoogLeNet, we can even increase the accuracy by 0.3%en_US
dc.description.sponsorshipNational Research Foundation (NRF)en_US
dc.rights©2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at:
dc.subjectEngineering::Computer science and engineering::Computing methodologies::Artificial intelligenceen_US
dc.titleZeroBN : learning compact neural networks for latency-critical edge systemsen_US
dc.typeConference Paperen
dc.contributor.schoolSchool of Computer Science and Engineeringen_US
dc.contributor.conference2021 58th ACM/IEEE Design Automation Conference (DAC)en_US
dc.contributor.researchHP-NTU Digital Manufacturing Corporate Laben_US
dc.description.versionSubmitted/Accepted versionen_US
dc.subject.keywordsCompact Learningen_US
dc.citation.conferencelocationSan Francisco, CA, USAen_US
dc.description.acknowledgementThis research was conducted in collaboration with HP Inc. and supported by National Research Foundation (NRF) Singapore and the Singapore Government through the Industry Alignment Fund - Industry Collaboration Projects Grant (I1801E0028).en_US
item.fulltextWith Fulltext-
Appears in Collections:SCSE Conference Papers
Files in This Item:
File Description SizeFormat 
ZeroBN_Accept_Version.pdf1.15 MBAdobe PDFView/Open

Citations 50

Updated on Jul 9, 2022

Page view(s)

Updated on Sep 28, 2022


Updated on Sep 28, 2022

Google ScholarTM




Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.