On InstaHide, Phase Retrieval, and Sparse Matrix Factorization

Chen, Sitan; Song, Zhao; Zhuo, Danyang

Computer Science > Machine Learning

arXiv:2011.11181v1 (cs)

[Submitted on 23 Nov 2020 (this version), latest version 25 Mar 2021 (v2)]

Title:On InstaHide, Phase Retrieval, and Sparse Matrix Factorization

Authors:Sitan Chen, Zhao Song, Danyang Zhuo

View PDF

Abstract:In this work, we examine the security of InstaHide, a scheme recently proposed by [Huang, Song, Li and Arora, ICML'20] for preserving the security of private datasets in the context of distributed learning. To generate a synthetic training example to be shared among the distributed learners, InstaHide takes a convex combination of private feature vectors and randomly flips the sign of each entry of the resulting vector with probability 1/2. A salient question is whether this scheme is secure in any provable sense, perhaps under a plausible hardness assumption and assuming the distributions generating the public and private data satisfy certain properties.
We show that the answer to this appears to be quite subtle and closely related to the average-case complexity of a new multi-task, missing-data version of the classic problem of phase retrieval. Motivated by this connection, we design a provable algorithm that can recover private vectors using only the public vectors and synthetic vectors generated by InstaHide, under the assumption that the private and public vectors are isotropic Gaussian.

Comments:	29 pages
Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR); Data Structures and Algorithms (cs.DS); Machine Learning (stat.ML)
Cite as:	arXiv:2011.11181 [cs.LG]
	(or arXiv:2011.11181v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2011.11181

Submission history

From: Sitan Chen [view email]
[v1] Mon, 23 Nov 2020 02:47:08 UTC (177 KB)
[v2] Thu, 25 Mar 2021 00:08:38 UTC (183 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2020-11

Change to browse by:

cs
cs.CR
cs.DS
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Sitan Chen
Zhao Song
Danyang Zhuo

export BibTeX citation

Computer Science > Machine Learning

Title:On InstaHide, Phase Retrieval, and Sparse Matrix Factorization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:On InstaHide, Phase Retrieval, and Sparse Matrix Factorization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators