Improving Face Recognition by Clustering Unlabeled Faces in the Wild

RoyChowdhury, Aruni; Yu, Xiang; Sohn, Kihyuk; Learned-Miller, Erik; Chandraker, Manmohan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2007.06995 (cs)

[Submitted on 14 Jul 2020 (v1), last revised 15 Jul 2020 (this version, v2)]

Title:Improving Face Recognition by Clustering Unlabeled Faces in the Wild

Authors:Aruni RoyChowdhury, Xiang Yu, Kihyuk Sohn, Erik Learned-Miller, Manmohan Chandraker

View PDF

Abstract:While deep face recognition has benefited significantly from large-scale labeled data, current research is focused on leveraging unlabeled data to further boost performance, reducing the cost of human annotation. Prior work has mostly been in controlled settings, where the labeled and unlabeled data sets have no overlapping identities by construction. This is not realistic in large-scale face recognition, where one must contend with such overlaps, the frequency of which increases with the volume of data. Ignoring identity overlap leads to significant labeling noise, as data from the same identity is split into multiple clusters. To address this, we propose a novel identity separation method based on extreme value theory. It is formulated as an out-of-distribution detection algorithm, and greatly reduces the problems caused by overlapping-identity label noise. Considering cluster assignments as pseudo-labels, we must also overcome the labeling noise from clustering errors. We propose a modulation of the cosine loss, where the modulation weights correspond to an estimate of clustering uncertainty. Extensive experiments on both controlled and real settings demonstrate our method's consistent improvements over supervised baselines, e.g., 11.6% improvement on IJB-A verification.

Comments:	ECCV 2020
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2007.06995 [cs.CV]
	(or arXiv:2007.06995v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2007.06995

Submission history

From: Aruni RoyChowdhury [view email]
[v1] Tue, 14 Jul 2020 12:26:50 UTC (625 KB)
[v2] Wed, 15 Jul 2020 17:30:17 UTC (625 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Improving Face Recognition by Clustering Unlabeled Faces in the Wild

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Improving Face Recognition by Clustering Unlabeled Faces in the Wild

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators