Exploring GPU acceleration of Deep Neural Networks using Block Circulant Matrices.

AllVideos Books Images Maps News Shopping

Scholarly articles for Exploring GPU acceleration of Deep Neural Networks using Block Circulant Matrices.

scholar.google.com › citations

… GPU acceleration of deep neural networks using block …
Dong · Cited by 7

Exploring GPU acceleration of Deep Neural Networks using Block ...

In this paper, we explore acceleration of DNNs using BCM on a state-of-the-art GPU. First, we identify the challenges posed by using BCMs.

[PDF] Exploring GPU acceleration of Deep Neural Networks using ...

www.semanticscholar.org › paper › Expl...

Topics · Deep Neural Networks · General Matrix Multiplication · Graphics Processing Unit · Convolutional Layers · Block-circulant Matrices · Computational Complexity ...

Exploring GPU acceleration of Deep Neural Networks using Block ...

www.researchgate.net › publication › 34...

One attractive approach is to leverage Block Circulant Matrices (BCM), compressing the linear transformation layers, e.g., convolutional and fully-connected ...

Exploring GPU acceleration of Deep Neural Networks using Block ...

par.nsf.gov › biblio › 10310866-explori...

Dong, Shi, Zhao, Pu, Lin, Xue, and Kaeli, David. "Exploring GPU acceleration of Deep Neural Networks using Block Circulant Matrices". Parallel Computing 100 (C) ...

[PDF] Energy-Efficient, High-Performance, Highly-Compressed Deep Neural ...

ywang393.expressions.syr.edu › En...

The fixed-point quantization and the proposed block-circulant matrix-based inference scheme enables the network to achieve as high as 3.5 TOPS computation ...

‪Shi Dong‬ - ‪Google Scholar‬

scholar.google.com › citations

2021. Exploring GPU acceleration of Deep Neural Networks using Block Circulant Matrices. S Dong, P Zhao, X Lin, D Kaeli. Parallel Computing 100 (ISSN 0167-8191) ...

[PDF] Accelerating and Compressing Deep Neural Networks Using Block ...

ywang393.expressions.syr.edu › A-...

In con- trast, C CNN (b) uses the block-circulant matrix to avoid storage waste and achieve a ne-grained tradeo of accuracy and compres- sion/acceleration.

Accelerating and compressing deep neural networks using block ...

dl.acm.org › doi

To overcome these limitations, this paper proposes CirCNN, a principled approach to represent weights and process neural networks using block-circulant matrices ...

[PDF] Exploring high performance deep neural networks on GPUs

repository.library.northeastern.edu › ...

EXPLORING GPU ACCELERATION OF DNNS USING BLOCK CIRCULANT MATRICES. DNN training on a GPU can be highly inefficient, even causing a significant slowdown. In ...

src/DNNMark - public/gem5-resources - Git at Google

gem5.googlesource.com › public › DNN...

Exploring GPU acceleration of Deep Neural Networks using Block Circulant Matrices. Parallel Computing. DNNMark. Configurable benchmark suite of Deep Neural ...