摘要: Quantization is an effective technique for Deep Neural Network (DNN) inference acceleration. However, conventional quantization techniques are either applied at network or layer level that may fail to exploit fine-grained quantiza... 展开
作者 | Zhuoran Song Bangqi Fu Feiyang Wu Zhaoming Jiang Li Jiang Naifeng Jing Xiaoyao Liang | ||
---|---|---|---|
作者单位 | |||
文集名称 | 2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture | ||
出版年 | 2020 | ||
出版社 | Institute of Electrical and Electronics Engineers | ||
会议名称 | International Symposium on Computer Architecture | ||
开始页/总页数 | 1010 / 12 | ||
会议日期/会议地点 | 20200530-0603 / Valencia | 会议年/会议届次 | 2020 / 47th |
中图分类号 | TP3 | ||
馆藏号 | N2020080300164120 |