Learned-Norm Pooling for Deep Feedforward and Recurrent Neural Networks

Gulcehre, Caglar; Cho, Kyunghyun; Pascanu, Razvan; Bengio, Yoshua

Computer Science > Neural and Evolutionary Computing

arXiv:1311.1780 (cs)

[Submitted on 7 Nov 2013 (v1), last revised 2 Sep 2014 (this version, v7)]

Title:Learned-Norm Pooling for Deep Feedforward and Recurrent Neural Networks

Authors:Caglar Gulcehre, Kyunghyun Cho, Razvan Pascanu, Yoshua Bengio

View PDF

Abstract:In this paper we propose and investigate a novel nonlinear unit, called $L_p$ unit, for deep neural networks. The proposed $L_p$ unit receives signals from several projections of a subset of units in the layer below and computes a normalized $L_p$ norm. We notice two interesting interpretations of the $L_p$ unit. First, the proposed unit can be understood as a generalization of a number of conventional pooling operators such as average, root-mean-square and max pooling widely used in, for instance, convolutional neural networks (CNN), HMAX models and neocognitrons. Furthermore, the $L_p$ unit is, to a certain degree, similar to the recently proposed maxout unit (Goodfellow et al., 2013) which achieved the state-of-the-art object recognition results on a number of benchmark datasets. Secondly, we provide a geometrical interpretation of the activation function based on which we argue that the $L_p$ unit is more efficient at representing complex, nonlinear separating boundaries. Each $L_p$ unit defines a superelliptic boundary, with its exact shape defined by the order $p$. We claim that this makes it possible to model arbitrarily shaped, curved boundaries more efficiently by combining a few $L_p$ units of different orders. This insight justifies the need for learning different orders for each unit in the model. We empirically evaluate the proposed $L_p$ units on a number of datasets and show that multilayer perceptrons (MLP) consisting of the $L_p$ units achieve the state-of-the-art results on a number of benchmark datasets. Furthermore, we evaluate the proposed $L_p$ unit on the recently proposed deep recurrent neural networks (RNN).

Comments:	ECML/PKDD 2014
Subjects:	Neural and Evolutionary Computing (cs.NE); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1311.1780 [cs.NE]
	(or arXiv:1311.1780v7 [cs.NE] for this version)
	https://doi.org/10.48550/arXiv.1311.1780

Submission history

From: KyungHyun Cho [view email]
[v1] Thu, 7 Nov 2013 18:30:37 UTC (704 KB)
[v2] Mon, 11 Nov 2013 03:32:43 UTC (719 KB)
[v3] Tue, 12 Nov 2013 18:32:42 UTC (719 KB)
[v4] Wed, 29 Jan 2014 22:55:24 UTC (628 KB)
[v5] Sat, 1 Feb 2014 18:17:38 UTC (832 KB)
[v6] Fri, 7 Feb 2014 18:55:42 UTC (832 KB)
[v7] Tue, 2 Sep 2014 00:53:40 UTC (8,103 KB)

Computer Science > Neural and Evolutionary Computing

Title:Learned-Norm Pooling for Deep Feedforward and Recurrent Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Neural and Evolutionary Computing

Title:Learned-Norm Pooling for Deep Feedforward and Recurrent Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators