Multiplexed gradient descent: Fast online training of modern datasets on hardware neural networks without backpropagation

McCaughan, Adam N.; Oripov, Bakhrom G.; Ganesh, Natesh; Nam, Sae Woo; Dienstfrey, Andrew; Buckley, Sonia M.

doi:10.1063/5.0157645

Computer Science > Machine Learning

arXiv:2303.03986 (cs)

[Submitted on 5 Mar 2023]

Title:Multiplexed gradient descent: Fast online training of modern datasets on hardware neural networks without backpropagation

Authors:Adam N. McCaughan, Bakhrom G. Oripov, Natesh Ganesh, Sae Woo Nam, Andrew Dienstfrey, Sonia M. Buckley

View PDF

Abstract:We present multiplexed gradient descent (MGD), a gradient descent framework designed to easily train analog or digital neural networks in hardware. MGD utilizes zero-order optimization techniques for online training of hardware neural networks. We demonstrate its ability to train neural networks on modern machine learning datasets, including CIFAR-10 and Fashion-MNIST, and compare its performance to backpropagation. Assuming realistic timescales and hardware parameters, our results indicate that these optimization techniques can train a network on emerging hardware platforms orders of magnitude faster than the wall-clock time of training via backpropagation on a standard GPU, even in the presence of imperfect weight updates or device-to-device variations in the hardware. We additionally describe how it can be applied to existing hardware as part of chip-in-the-loop training, or integrated directly at the hardware level. Crucially, the MGD framework is highly flexible, and its gradient descent process can be optimized to compensate for specific hardware limitations such as slow parameter-update speeds or limited input bandwidth.

Subjects:	Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:2303.03986 [cs.LG]
	(or arXiv:2303.03986v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2303.03986
Journal reference:	APL Machine Learning 1, 026118 (2023)
Related DOI:	https://doi.org/10.1063/5.0157645

Submission history

From: Adam McCaughan [view email]
[v1] Sun, 5 Mar 2023 19:45:09 UTC (839 KB)

Computer Science > Machine Learning

Title:Multiplexed gradient descent: Fast online training of modern datasets on hardware neural networks without backpropagation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Multiplexed gradient descent: Fast online training of modern datasets on hardware neural networks without backpropagation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators