State of Multi-cloud Storage and Compute

Jul 14, 2022

Pradeep Madhavarapu

Too Long, Must-read

The [Live] Multi-cloud-native architecture can deliver a 10X better TCO than the other variants. Some other unique insights we found –

Azure’s NVMe drives beat AWS and GCP by a significant amount w.r.to latencies.
AWS delivered the most respectable IO latencies when using the best network-attached storage.
bi(OS) flips compute utilization for OLAP use cases – processing consumes most, followed by onboarding and then consumption.

Use Case

Imagine an e-tailer planning for Black Friday 2022. Their target audience is the 50M+ consumers on the eastern seaboard of the United States. Even a minute’s downtime isn’t acceptable. At the same time, costs need to be at par (or lower) than a single-cloud deployment. And the architecture should scale out with no single point of failure. Imagine this e-tailer has to deliver in-session personalization for anonymous users. Based upon our experience working with Global Enterprises and Unicorns, we modeled the following read-write pattern –

Product Views, Add-to-cart, and Orders are written in real-time to bi(OS) by the customer’s micro-services.
These raw events are read by a microservice going back for the past few days. In other words, we modeled an “unbounded table” where data is appended to and read from, in a sliding-window manner. Further, for these reads, we did not assume any clever tricks of indexing or aggregations.
Business users are reading micro-aggregates in real-time and making instantaneous decisions.

Setup

bi(OS) was deployed in a multi-cloud manner across regions of the Big 3 in Virginia, US. This was the most common region that had availability for the resources required in our test.

We attempted to keep the configuration consistent across the Big 3 with the following rules –

Aim for 8GB of RAM per vCPU and use 8 vCPU machines
Use local SSDs for data; bi(OS) provides the redundancy across host, rack, DC, and Cloud
Use a single high-performance network attached disk of 512GB for logs
Up to 10Gbps of network IO bandwidth

We ran a sustained load of ~1000 (inserts + upserts + selects) / second. Inserts and Upserts were done 1 row at a time while selects read ~500 rows in every call. The load was run for 9+ hours and during this time no aspect of the system was saturated.

Insights

Read and Write Latencies of NVMe Drives – Azure wins.
Read and Write Latencies of network-attached high-performance storage – AWS wins.
bi(OS) flips compute utilization on its head for OLAP use-cases.Conclusion

State of Multi-cloud Storage and Compute

bi(OS) Works for People

Enterprise Management Associates

Adam Edgley

Shrinivas Ron

Frank Jas

Martin Quiroga

Designed for Your Success

Ready to Try It?

The bi(OS) difference

Contact Us