Quaternion Recurrent Neural Networks

Parcollet, Titouan; Ravanelli, Mirco; Morchid, Mohamed; Linarès, Georges; Trabelsi, Chiheb; De Mori, Renato; Bengio, Yoshua

Statistics > Machine Learning

arXiv:1806.04418 (stat)

[Submitted on 12 Jun 2018 (v1), last revised 7 Jan 2019 (this version, v3)]

Title:Quaternion Recurrent Neural Networks

Authors:Titouan Parcollet, Mirco Ravanelli, Mohamed Morchid, Georges Linarès, Chiheb Trabelsi, Renato De Mori, Yoshua Bengio

View PDF

Abstract:Recurrent neural networks (RNNs) are powerful architectures to model sequential data, due to their capability to learn short and long-term dependencies between the basic elements of a sequence. Nonetheless, popular tasks such as speech or images recognition, involve multi-dimensional input features that are characterized by strong internal dependencies between the dimensions of the input vector. We propose a novel quaternion recurrent neural network (QRNN), alongside with a quaternion long-short term memory neural network (QLSTM), that take into account both the external relations and these internal structural dependencies with the quaternion algebra. Similarly to capsules, quaternions allow the QRNN to code internal dependencies by composing and processing multidimensional features as single entities, while the recurrent operation reveals correlations between the elements composing the sequence. We show that both QRNN and QLSTM achieve better performances than RNN and LSTM in a realistic application of automatic speech recognition. Finally, we show that QRNN and QLSTM reduce by a maximum factor of 3.3x the number of free parameters needed, compared to real-valued RNNs and LSTMs to reach better results, leading to a more compact representation of the relevant information.

Comments:	ICLR Update - Full rework
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:1806.04418 [stat.ML]
	(or arXiv:1806.04418v3 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1806.04418

Submission history

From: Titouan Parcollet [view email]
[v1] Tue, 12 Jun 2018 09:49:40 UTC (33 KB)
[v2] Thu, 5 Jul 2018 09:58:00 UTC (37 KB)
[v3] Mon, 7 Jan 2019 10:24:11 UTC (795 KB)

Statistics > Machine Learning

Title:Quaternion Recurrent Neural Networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Quaternion Recurrent Neural Networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators