SHIFT15M: Fashion-specific dataset for set-to-set matching with several distribution shifts

Kimura, Masanari; Nakamura, Takuma; Saito, Yuki

Computer Science > Machine Learning

arXiv:2108.12992 (cs)

[Submitted on 30 Aug 2021 (v1), last revised 8 Mar 2023 (this version, v2)]

Title:SHIFT15M: Fashion-specific dataset for set-to-set matching with several distribution shifts

Authors:Masanari Kimura, Takuma Nakamura, Yuki Saito

View PDF

Abstract:This paper addresses the problem of set-to-set matching, which involves matching two different sets of items based on some criteria, especially in the case of high-dimensional items like images. Although neural networks have been applied to solve this problem, most machine learning-based approaches assume that the training and test data follow the same distribution, which is not always true in real-world scenarios. To address this limitation, we introduce SHIFT15M, a dataset that can be used to evaluate set-to-set matching models when the distribution of data changes between training and testing. We conduct benchmark experiments that demonstrate the performance drop of naive methods due to distribution shift. Additionally, we provide software to handle the SHIFT15M dataset in a simple manner, with the URL for the software to be made available after publication of this manuscript. We believe proposed SHIFT15M dataset provide a valuable resource for evaluating set-to-set matching models under the distribution shift.

Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2108.12992 [cs.LG]
	(or arXiv:2108.12992v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2108.12992

Submission history

From: Masanari Kimura [view email]
[v1] Mon, 30 Aug 2021 05:07:59 UTC (1,637 KB)
[v2] Wed, 8 Mar 2023 15:25:18 UTC (20,728 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-08

Change to browse by:

cs
cs.CV

References & Citations

DBLP - CS Bibliography

listing | bibtex

Masanari Kimura
Takuma Nakamura
Yuki Saito

export BibTeX citation

Computer Science > Machine Learning

Title:SHIFT15M: Fashion-specific dataset for set-to-set matching with several distribution shifts

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:SHIFT15M: Fashion-specific dataset for set-to-set matching with several distribution shifts

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators