Implicit Bias of Gradient Descent on Linear Convolutional Networks

Gunasekar, Suriya; Lee, Jason; Soudry, Daniel; Srebro, Nathan

Computer Science > Machine Learning

arXiv:1806.00468 (cs)

[Submitted on 1 Jun 2018 (v1), last revised 11 Jan 2019 (this version, v2)]

Title:Implicit Bias of Gradient Descent on Linear Convolutional Networks

Authors:Suriya Gunasekar, Jason Lee, Daniel Soudry, Nathan Srebro

View PDF

Abstract:We show that gradient descent on full-width linear convolutional networks of depth $L$ converges to a linear predictor related to the $\ell_{2/L}$ bridge penalty in the frequency domain. This is in contrast to linearly fully connected networks, where gradient descent converges to the hard margin linear support vector machine solution, regardless of depth.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1806.00468 [cs.LG]
	(or arXiv:1806.00468v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1806.00468

Submission history

From: Suriya Gunasekar [view email]
[v1] Fri, 1 Jun 2018 17:58:58 UTC (58 KB)
[v2] Fri, 11 Jan 2019 02:51:38 UTC (72 KB)

Computer Science > Machine Learning

Title:Implicit Bias of Gradient Descent on Linear Convolutional Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Implicit Bias of Gradient Descent on Linear Convolutional Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators