Efficient Compactions Between Storage Tiers with PrismDB

Raina, Ashwini; Lu, Jianan; Cidon, Asaf; Freedman, Michael J.

Computer Science > Databases

arXiv:2008.02352 (cs)

[Submitted on 5 Aug 2020 (v1), last revised 25 May 2022 (this version, v6)]

Title:Efficient Compactions Between Storage Tiers with PrismDB

Authors:Ashwini Raina, Jianan Lu, Asaf Cidon, Michael J. Freedman

View PDF

Abstract:In recent years, emerging storage hardware technologies have focused on divergent goals: better performance or lower cost-per-bit. Correspondingly, data systems that employ these technologies are typically optimized either to be fast (but expensive) or cheap (but slow). We take a different approach: by architecting a storage engine to natively utilize two tiers of fast and low-cost storage technologies, we can achieve a Pareto-efficient balance between performance and cost-per-bit. This paper presents the design and implementation of PrismDB, a novel key-value store that exploits two extreme ends of the spectrum of modern NVMe storage technologies (3D XPoint and QLC NAND) simultaneously. Our key contribution is how to efficiently migrate and compact data between two different storage tiers. Inspired by the classic cost-benefit analysis of log cleaning, we develop a new algorithm for multi-tiered storage compaction that balances the benefit of reclaiming space for hot objects in fast storage with the cost of compaction I/O in slow storage. Compared to the standard use of RocksDB on flash in datacenters today, PrismDB's average throughput on tiered storage is 3.3$\times$ faster and its read tail latency is 2$\times$ better, using equivalently priced hardware.

Subjects:	Databases (cs.DB); Distributed, Parallel, and Cluster Computing (cs.DC)
Cite as:	arXiv:2008.02352 [cs.DB]
	(or arXiv:2008.02352v6 [cs.DB] for this version)
	https://doi.org/10.48550/arXiv.2008.02352

Submission history

From: Ashwini Raina [view email]
[v1] Wed, 5 Aug 2020 20:34:47 UTC (5,058 KB)
[v2] Thu, 24 Sep 2020 04:05:18 UTC (5,491 KB)
[v3] Thu, 23 Sep 2021 15:53:47 UTC (7,027 KB)
[v4] Wed, 19 Jan 2022 19:54:48 UTC (8,355 KB)
[v5] Fri, 21 Jan 2022 05:20:18 UTC (8,356 KB)
[v6] Wed, 25 May 2022 18:28:47 UTC (8,673 KB)

Computer Science > Databases

Title:Efficient Compactions Between Storage Tiers with PrismDB

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Databases

Title:Efficient Compactions Between Storage Tiers with PrismDB

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators