【ERC Coffee House Tech Talk Series】Cosine: A Cloud-Cost Optimized Self-Designing Key-Value Storage Engine [Subarna Chatterjee]

Date

Thursday 14th July @ 14:00 – 15:00 (UK time)

Presenter

Subarna Chatterjee

Affiliation

Harvard University

Location

[Online] Meeting link: https://welink.zhumu.com/j/159295680

Abstract

We present a self-designing key-value storage engine, Cosine, which can take the shape of the close to “perfect” engine architecture given an input workload, a cloud budget, a target performance, and required cloud SLAs. By identifying and formalizing the first principles of storage engine layouts and core key-value algorithms, Cosine constructs a massive design space comprising of sextillion (10^36) possible storage engine designs over a diverse space of hardware and cloud pricing policies for three cloud providers – AWS, GCP, and Azure. Cosine spans across diverse designs such as Log-Structured Merge-trees, B-trees, Log-Structured Hash-tables, in-memory accelerators for filters and indexes as well as trillions of hybrid designs that do not appear in the literature or industry but emerge as valid combinations of the above. Cosine includes a unified distribution-aware I/O model and a learned concurrency-aware CPU model that with high accuracy can calculate the performance and cloud cost of any possible design on any workload and virtual machines. Cosine can then search through that space in interactive times to find the best design and materializes the actual code of the resulting storage engine design using a templated Rust implementation. We demonstrate that on average Cosine outperforms state-of-the-art storage engines such as write-optimized RocksDB, read-optimized WiredTiger, and very write-optimized FASTER by 23x, 25x, and 20x, respectively, for diverse workloads, data sizes, and cloud budgets across all YCSB core workloads and many variants.

Short Bio

Subarna Chatterjee is a post-doc at Harvard University advised by Stratos Idreos. Her research is about improving the performance of modern data systems by reasoning about the read-write tradeoff of the underlying data structures and algorithms. Prior to joining Harvard, she did her Ph.D. from Indian Institute of Technology Kharagpur and her first post-doc at Inria, Rennes, France. In 2016, she was selected as one of the “10 Women in Networking/Communications That You Should Watch” and is one of the young scientists to attend the Heidelberg Laureate Forum.

【ERC Coffee House Tech Talk Series】Cosine: A Cloud-Cost Optimized Self-Designing Key-Value Storage Engine [Subarna Chatterjee] / Huawei-Edinburgh Joint Lab by blogadmin is licensed under a Creative Commons Attribution CC BY 3.0

Posted by Antonios Katsarakis

30th June 2022

Categories

tech talk

Tags

No tags have been added to this post.

Previous post

【ERC Coffee House Tech Talk Series】In-Network Support for microsecond-scale Remote Procedure Calls and Policy Enforcement [Marios Kogias]

Next post

Upcoming paper in OOPSLA: Higher Level Effect Handlers in C++

Comments are closed

Comments to this thread have been closed by the post author or by an administrator.

Huawei-Edinburgh Joint Lab