Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/164833
Title: Smart scissor: coupling spatial redundancy reduction and CNN compression for embedded hardware
Authors: Kong, Hao
Liu, Di
Huai, Shuo
Luo, Xiangzhong
Liu, Weichen
Subramaniam, Ravi
Makaya, Christian
Lin, Qian
Keywords: Engineering::Computer science and engineering
Issue Date: 2022
Source: Kong, H., Liu, D., Huai, S., Luo, X., Liu, W., Subramaniam, R., Makaya, C. & Lin, Q. (2022). Smart scissor: coupling spatial redundancy reduction and CNN compression for embedded hardware. 41st IEEE/ACM International Conference on Computer-Aided Design, 1-9. https://dx.doi.org/10.1145/3508352.3549397
Project: I1801E0028 
M4082282 
Conference: 2022 IEEE/ACM International Conference On Computer Aided Design (ICCAD)
Abstract: Scaling down the resolution of input images can greatly reduce the computational overhead of convolutional neural networks (CNNs), which is promising for edge AI. However, as an image usually contains much spatial redundancy, e.g., background pixels, directly shrinking the whole image will lose important features of the foreground object and lead to severe accuracy degradation. In this paper, we propose a dynamic image cropping framework to reduce the spatial redundancy by accurately cropping the foreground object from images. To achieve the instance-aware fine cropping, we introduce a lightweight foreground predictor to efficiently localize and crop the foreground of an image. The finely cropped images can be correctly recognized even at a small resolution. Meanwhile, computational redundancy also exists in CNN architectures. To pursue higher execution efficiency on resource-constrained embedded devices, we also propose a compound shrinking strategy to coordinately compress the three dimensions (depth, width, resolution) of CNNs. Eventually, we seamlessly combine the proposed dynamic image cropping and compound shrinking into a unified compression framework, Smart Scissor, which is expected to significantly reduce the computational overhead of CNNs while still maintaining high accuracy. Experiments on ImageNet-1K demonstrate that our method reduces the computational cost of ResNet50 by 41.5% while improving the top-1 accuracy by 0.3%. Moreover, compared to HRank, the state-of-theart CNN compression framework, our method achieves 4.1% higher top-1 accuracy at the same computational cost. The codes and data are available at https://github.com/ntuliuteam/smart-scissor.
URI: https://hdl.handle.net/10356/164833
ISBN: 9781450392174
DOI: 10.1145/3508352.3549397
DOI (Related Dataset): 10.21979/N9/NB6FU2
Schools: School of Computer Science and Engineering 
Research Centres: HP-NTU Digital Manufacturing Corporate Lab
Rights: © 2022 Association for Computing Machinery. All rights reserved. This paper was published in the Proceedings of IEEE/ACM International Conference On Computer Aided Design (ICCAD) and is made available with permission of Association for Computing Machinery.
Fulltext Permission: open
Fulltext Availability: With Fulltext
Appears in Collections:SCSE Conference Papers

Files in This Item:
File Description SizeFormat 
Smart scissor.pdf1.38 MBAdobe PDFThumbnail
View/Open

SCOPUSTM   
Citations 50

4
Updated on Mar 13, 2025

Web of ScienceTM
Citations 50

1
Updated on Oct 28, 2023

Page view(s)

212
Updated on Mar 13, 2025

Download(s) 50

195
Updated on Mar 13, 2025

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.