Dual Swap Disentangling

Feng, Zunlei; Wang, Xinchao; Ke, Chenglong; Zeng, Anxiang; Tao, Dacheng; Song, Mingli

Computer Science > Computer Vision and Pattern Recognition

arXiv:1805.10583 (cs)

[Submitted on 27 May 2018 (v1), last revised 1 Jan 2020 (this version, v3)]

Title:Dual Swap Disentangling

Authors:Zunlei Feng, Xinchao Wang, Chenglong Ke, Anxiang Zeng, Dacheng Tao, Mingli Song

View PDF

Abstract:Learning interpretable disentangled representations is a crucial yet challenging task. In this paper, we propose a weakly semi-supervised method, termed as Dual Swap Disentangling (DSD), for disentangling using both labeled and unlabeled data. Unlike conventional weakly supervised methods that rely on full annotations on the group of samples, we require only limited annotations on paired samples that indicate their shared attribute like the color. Our model takes the form of a dual autoencoder structure. To achieve disentangling using the labeled pairs, we follow a "encoding-swap-decoding" process, where we first swap the parts of their encodings corresponding to the shared attribute and then decode the obtained hybrid codes to reconstruct the original input pairs. For unlabeled pairs, we follow the "encoding-swap-decoding" process twice on designated encoding parts and enforce the final outputs to approximate the input pairs. By isolating parts of the encoding and swapping them back and forth, we impose the dimension-wise modularity and portability of the encodings of the unlabeled samples, which implicitly encourages disentangling under the guidance of labeled pairs. This dual swap mechanism, tailored for semi-supervised setting, turns out to be very effective. Experiments on image datasets from a wide domain show that our model yields state-of-the-art disentangling performances.

Comments:	Accepted by NeurIPS 2018; Adding the theoretical proof for the disentanglement of labeled pairs
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1805.10583 [cs.CV]
	(or arXiv:1805.10583v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1805.10583

Submission history

From: Zunlei Feng [view email]
[v1] Sun, 27 May 2018 06:14:21 UTC (3,271 KB)
[v2] Sat, 23 Jun 2018 08:48:21 UTC (3,404 KB)
[v3] Wed, 1 Jan 2020 07:33:44 UTC (3,404 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Dual Swap Disentangling

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Dual Swap Disentangling

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators