Colossus for Rapid Storage

dang · 2025-04-10T17:38:26 1744306706

(This was posted last night with https://cloud.google.com/blog/products/compute/whats-new-wit... above. We've changed the URL to the product-specific article.)

akshayshah · 2025-04-10T17:22:02 1744305722

Very cool! This makes Google the only major cloud that has low-latency single-zone object storage, standard regional object storage, and transparently-replicated dual-region object storage - all with the same API.

For infra systems, this is great: code against the GCS API, and let the user choose the cost/latency/durability tradeoffs that make sense for their use case.

korkybuchek · 2025-04-10T17:39:45 1744306785

> This makes Google the only major cloud that has low-latency single-zone object storage, standard regional object storage,

Absurd claim. S3 Express launched last year.

akshayshah · 2025-04-10T17:49:15 1744307355

Sure, but AFAIK S3’s multi-region capabilities are quite far behind GCS’s.

S3 offers some multi-region replication facilities, but as far as I’ve seen they all come at the cost of inconsistent reads - which greatly complicates application code. GCS dual-region buckets offer strongly consistent metadata reads across multiple regions, transparently fetch data from the source region where necessary, and offer clear SLAs for replication. I don’t think the S3 offerings are comparable. But maybe I’m wrong - I’d love more competition here!

https://cloud.google.com/blog/products/storage-data-transfer...

korkybuchek · 2025-04-10T18:53:44 1744311224

> Sure, but AFAIK S3’s multi-region capabilities are quite far behind GCS’s.

Entirely different claim.

akshayshah · 2025-04-10T19:15:47 1744312547

I claimed that Google is the only major cloud provider with all three of:

- single-zone object storage buckets

- regional object storage buckets

- transparently replicated, dual region object storage buckets

I agree that AWS has two of the three. AFAIK AWS does not have multi-region buckets - the closest they have is canned replication between single-region buckets.

whatreason · 2025-04-11T19:36:09 1744400169

not quite the same, but S3 does have https://aws.amazon.com/s3/features/multi-region-access-point..., which would let you treat multiple buckets in different regions as one single bucket (mostly). But you still do need to set up canned replication.

thayne · 2025-04-10T19:21:07 1744312867

S3 doesn't have "transparently-replicated dual-region object storage", which was part of the claim.

S3 does have replication, but it is far from transparent and frought with gotchas.

And it certainly doesn't have all of that with a single API.

grantwu · 2025-04-10T18:05:22 1744308322

Isn't S3 Express not the same API? You have to use a "directory bucket" which isn't an object store anymore, as it has actual directories.

To be honest I'm not actually sure how different the API is. I've never used it. I just frequently trip over the existence of parallel APIs for directory buckets (when I'm doing something niche, mostly; I think GetObject/PutObject are the same.)

dastbe · 2025-04-10T17:34:51 1744306491

?

s3: https://aws.amazon.com/pm/serv-s3

s3 express: https://aws.amazon.com/s3/storage-classes/express-one-zone/

cross-region replication: https://docs.aws.amazon.com/AmazonS3/latest/userguide/replic...

akshayshah · 2025-04-10T17:54:58 1744307698

The cross-region replication I’ve seen for S3 (including the link you’ve provided) is fundamentally different from a dual-region GCS bucket. AWS is providing a way to automatically copy objects between distinct buckets, while GCS is providing a single bucket that spans multiple regions.

It’s much, much easier to code against a dual-region GCS bucket because the bucket namespace and object metadata are strongly consistent across regions.

jeffbee · 2025-04-10T18:11:59 1744308719

The semantics they are offering are very different from S3. In Colossus a writer can make a durable 1-byte append and other observers are able to reason about the commit point. S3 does not offer this property.

dastbe · 2025-04-10T20:01:16 1744315276

Sure, but that's not what the parent said.

alobrah · 2025-04-10T01:21:36 1744248096

FYI this was unveiled at the 2025 Google Next conference, and they're apparently unveiling a gRPC client for Rapid Storage, which appears to be a very thin wrapper over Colossus itself, as this is just zonal storage.

jauntywundrkind · 2025-04-10T03:01:12 1744254072

Struggling to find a definition, but seemingly zonal just means there's a massive instance per cluster.

Did find some interesting recent (March 28th, 2025) reads though!

Colossus under the hood: How we deliver SSD performance at HDD prices https://cloud.google.com/blog/products/storage-data-transfer...

I kind of thought you meant ZNS / https://zonedstorage.io/ at first, or it's more recent better awesomer counterpart Host Directed Placement (HDP). I wish someone would please please advertize support for HDP, sounds like such a free win, tackling so many write amplification issues for so little extra complexity: just say which stream you want to write to, and writes to that stream will go onto the same superblock. Duh, simple, great.

Dylan16807 · 2025-04-10T07:05:15 1744268715

Delivering "HDD prices" is a bold claim there.

They charge $20/TB/month for basic cloud storage. You can build storage servers for $20/TB flat. If you add 10% for local parity, 15% free space, 5% in spare drives, and $2000/rack/month overhead, then triple everything for redundancy purposes, then over a 3 year period the price of using your own hard drives is $115/TB and google's price is $720. Over 5 years it's $145 versus $1200. And that's before they charge you massive bandwidth fees.

Lex-2008 · 2025-04-10T11:31:36 1744284696

I like your comparison with self-built storage, but comparing $20/TB/month with other CLOUD offerings, we see:

* hetzner storage box starts from $4/month for 1TB, and then goes down to $2.4/TB/month if you rent a 10TB box.

* mega starts from €10/month for 2TB, and goes down to €2/TB/month if you get a 16TB plan

* backblaze costs (starts from?) $6/TB/month

I was looking for a cheap cloud storage recently, so have a a list of these numbers :)

Moreover, these are not even the cheapest one. The cheapest one I found had prices starting from $6.5 for 5TB, going down to $0.64/TB/month for plans starting with 25TB (called uloz, but I haven't tested them yet).

Also, looking at lowendbox you can find a VPS in Canada with 2TB storage for $5/month and run whatever you want there.

How all that compares to $20/TB/month?!

Please feel free to correct me if i'm comparing apples to oranges, though. But I can't believe all of these offers are scam or so-called "promotional" offers which cost companies more than you pay for it.

ksec · 2025-04-10T12:28:37 1744288117

Thank You. So backblaze for $6/TB a month. I could have a TB of Data backed up safely against file corruption? I wonder how have I missed that.

Now you could use it with Synology NAS and it is a lot cheaper than doing RAID 5 for ZFS / BTRFS with Muti redundancy.

I wonder if there are any NAS that does that automatically? Any drawbacks? Also wonder if the price could go down to $5 / TB in a few years time.

ddorian43 · 2025-04-10T13:34:11 1744292051

The price of Backblaze WAS $5 a few years ago and they increased it to $6 (and added some free bandwidth).

Dylan16807 · 2025-04-10T20:02:21 1744315341

I'm still annoyed they increased the price for B2. Maybe "free" bandwidth gets people to use it more? But as far as their costs go, between the time they launched at $5 and the time they upped it to $6, hard drives (and servers full of hard drives) cost half as much per TB, with 1/4 as many servers needed for the same number of TB.

immibis · 2025-04-10T13:58:36 1744293516

I get the impression that business has always been about being the best schmoozer more than about having the best product.

BTW at Hetzner you can rent servers with very large (hundred of TB) non-redundant storage for an effective price of about $1.50/TB/month. If you want to build a cloud storage product, that seems like a good starting point - of course, once you take into account redundancy, spare capacity, and paying yourself, the prices you charge to your customers will end up closer to the price of Backblaze at a minimum.

no_wizard · 2025-04-10T14:33:33 1744295613

>I get the impression that business has always been about being the best schmoozer more than about having the best product

and thus, market efficiency feels like a myth. This feels most true when it comes to cloud services. They're way overpriced in multiple different common cases at the big providers

singhrac · 2025-04-10T18:30:15 1744309815

Yes, this is pretty much what Hetzner must have built with their object storage - and they get to 5 EUR/month, so really close to Backblaze pricing.

ddorian43 · 2025-04-10T12:18:44 1744287524

Of what you mentioned, only backblaze is similar (object storage with S3-like API), all others are apples to oranges.

Dylan16807 · 2025-04-10T19:24:27 1744313067

You don't need very many terabytes to cover the labor cost of installing and maintaining an S3-compatible server program.

ddorian43 · 2025-04-11T12:33:44 1744374824

You need a very big cluster for it to be worth it though for non-backup use-cases when using HDDs.

saagarjha · 2025-04-10T09:14:02 1744276442

You forgot paying yourself to set that up.

Dylan16807 · 2025-04-10T09:42:33 1744278153

That's covered by the build and overhead numbers. But if you want more on the build side, an extra $10k of labor per rack of 9 servers only increases the cost per TB by about $4.

concerndc1tizen · 2025-04-10T09:59:59 1744279199

You're paying the same for "cloud engineers".

Also, don't forget the hidden cost/risk of giving a third party full access to your data.

Sonnigeszeug · 2025-04-10T10:13:45 1744280025

Clicking yourself a Bucket takes 5 Minutes.

Building a Server and keeping it secure and up-to-date and fixing hardware issues, takes relevant time

coredog64 · 2025-04-11T01:20:01 1744334401

Not to mention that I can: - Create a bucket and store 1MB in it without any overhead - Create 50 buckets with strong perimeters around them such that someone deleting the entire account doesn’t bring down the other 49 - Create a bucket and fill it with terabytes of data within seconds and don’t need to wait for hardware to be racked and stacked - Create a bucket, fill it with 2TB of data, and delete it tomorrow

Cloud is more than bare metal, but plenty of folks discount the cost benefits of elasticity.

concerndc1tizen · 2025-04-11T20:53:38 1744404818

I suspect the problem is that we're engineers in domains that have very different needs.

For example, I agree that elasticity is great. But at the same time, to me, it sounds like bad engineering. Why do you need to store terabytes of data and then delete it - couldn't it be processed continuously, streamed, compressed, process changes only, and so on. A lot of engineering today is incredibly wasteful. Maybe your data source doesn't care, and just provides you with terabyte csv files, and you have no choice, but for engineers that care about efficiency, it reeks.

It might make a lot of sense in a highly corporate context where everything is hard, nobody cares, and the cost of inefficiency is just passed on to the customer (i.e. often government and tax payers). But the real problem here is that customers aren't demanding more efficiency.

Sonnigeszeug · 2025-04-13T09:37:14 1744537034

Alone the fact of audit gives you a lot of reasons to keep data. Even if it gets downsampled one way or the other.

And plenty of use cases have natural growth. I do not throw away my pictures for example.

Data also grows dependent of users. More users, more 'live' data.

We have such a huge advantage with digital, we need to stop thinking its wasteful. Everything we do digital (pictures, finance data, etc.) is so much more energy and space efficient than what we had 20 years ago, we should just not delete data because we feel its wasteful.

Twirrim · 2025-04-10T13:23:18 1744291398

Leverage erasure encoding for durability and avoid both the tripling and local parity. You'll get better durability than 3x while only taking up significantly less than 2x the space Backblaze open sourced their library and talk about it here, https://www.backblaze.com/blog/Reed-Solomon. They use a 17:20 ratio that'll get them 3 drive failure resistance for just 1.17x stretch (ie a 100mb file gets that resilient while taking up 117mb of space)

moandcompany · 2025-04-10T04:52:08 1744260728

"Zonal" relates to the concept of "availability zones" which are the next-smallest unit below a (physical) "region."

Most instances of a cloud ___ created in a region are allocated and exist at the zonal level (i.e. a specific zone of a region).

A physical "region" usually consists of three or more availability zones, and each zone is physically separated from other zones, limiting the potential for foreseeable disaster events from affecting multiple zones simultaneously. Zones are close enough networking-wise to have high throughput and low latency interconnection, but not as fast as same-rack, same-cluster communications.

Systems requiring high availability (or replication) generally attain this by placing instances (or replicas) in multiple availability zones.

Systems requiring high-availability generally start with multi-zone replication, and Systems with even higher availability requirements may use multi-region replication, which comes at greater cost.

bushbaba · 2025-04-10T03:58:52 1744257532

It’s GCP’s answer to AWS S3 express zone https://aws.amazon.com/s3/storage-classes/express-one-zone/

derefr · 2025-04-10T16:28:21 1744302501

In Google Cloud parlance, "regional" usually means "transparently master-master replicated across the availability zones within a region", while "zonal" means "not replicated, it just is where it is."

noahl · 2025-04-10T17:29:42 1744306182

Slight nit: "zonal" doesn't necessarily mean "not replicated", it means that the replicas could all be within the same zone. That means they can share more points of failure. (I don't know if there's an official definition of zonal.)

NB: I am on the rapid storage team.

re-thc · 2025-04-10T03:18:22 1744255102

> Struggling to find a definition, but seemingly zonal just means there's a massive instance per cluster.

There are a number of zones in a region. Region usually means city. Zone can mean data center. Rarely just means some sort of isolation (separate power / network).

jeffbee · 2025-04-10T04:57:10 1744261030

What on this page gives you that impression? Do I have to watch the 2-hour video to learn this?

korkybuchek · 2025-04-10T05:02:02 1744261322

Of course not. Gemini can summarize it for you.

CobrastanJorji · 2025-04-10T05:17:09 1744262229

I mean, sure, it can easily provide quick text summaries of this sort of thing, but I only consume ML summaries in the forms of podcast discussions between two simulated pundits, as God intended.

carbocation · 2025-04-10T03:16:39 1744254999

This could actually speed up some of my scientific computing (in some cases, data localization/delocalization is an important part of overall instance run-time). I will be interested to try it.

steveBK123 · 2025-04-10T18:14:48 1744308888

Had to go back to the classic microservices video as I was pretty sure they used Colossus but it was actually Galactus & Omega Star.

thethimble · 2025-04-10T19:23:38 1744313018

This is what OP is referring to in case you haven’t been enlightened https://youtu.be/y8OnoxKotPQ?si=JAK5iPMcG1yoAhiT

bushbaba · 2025-04-10T03:58:00 1744257480

Glad to see the zonal object store take off. Such massive bandwidth speed will re define data analytics where 99% of all queries able to run on a single node faster than what distributed compute can offer.

nashashmi · 2025-04-10T17:52:32 1744307552

This link makes so much more sense than the previous link did.

SSDs with high random I/o speeds are a significant contributor to the advantage. I think 20m writes per second are likely distributed over a network of drives to make that kind of speed possible.

acstorage · 2025-04-10T03:10:20 1744254620

Similar to S3 express one zone

nodesocket · 2025-04-10T05:15:57 1744262157

Is S3 Express One Zone performance greatly improved to standard S3 like GCP rapid storage? My understanding is S3 Express One Zone is just more cost effective.

> 20x faster random-read data loading than a Cloud Storage regional bucket.

nodesocket · 2025-04-10T05:19:30 1744262370

Update: Just read this article[1] which clarifies S3 Express One Zone. Yes, performance is greatly improved, but actually storage costs are 8x more than a standard S3 bucket. The naming S3 Express One Zone is terrible and a bit misleading on pricing changes.

[1] https://www.warpstream.com/blog/s3-express-is-all-you-need

cowsandmilk · 2025-04-10T06:59:51 1744268391

I understand your belief that One Zone implies less expensive, but I’m staunchly in favor of them having it in the name so people know that their data is in a single AZ. The storage class succinctly summarizes faster with lower availability.

nodesocket · 2025-04-10T14:50:15 1744296615

Fair, how about instead of S3 Express they call it S3 Max (One Zone). It doesn’t take a rocket scientist to come up with good product names, just copy Apple. Though I suppose what happens when engineers are left up to the marketing. :-)

onethumb · 2025-04-10T17:17:04 1744305424

If Apple's so great at naming things, tell me (without looking) which is bigger/better/faster for their CPUs: Max or Ultra?

nodesocket · 2025-04-10T17:28:59 1744306139

ha, ha, fair. Ultra. To be fair, I own a MacBook Pro M1 Max and Mac Mini M4 Pro and follow Apple products closely.

onethumb · 2025-04-10T17:40:37 1744306837

Yep, I love Apple, follow them closely, own a Mac Studio with an M3 Ultra and a MacBook Pro with an M4 Max, and it's still confusing. :)

I mean, surely a Mac Studio with an M4 Max must be the best, right? It's an entire CPU generation ahead and it's maximum! Of course, it's not... the M3 Ultra is the best.

Naming things is hard.

dangoodmanUT · 2025-04-10T05:20:33 1744262433

Yes, it’s horribly more expensive… I think you are thinking of one zone infrequent access

coredog64 · 2025-04-11T01:21:20 1744334480

AWS just reduced prices on One Zone Express today.

jashmatthews · 2025-04-10T05:19:49 1744262389

AWS claims 10x lower latency but I haven't personally checked.

__turbobrew__ · 2025-04-10T20:14:51 1744316091

I want chubby as a service so I can throw etcd and zookeeper in the trash.

alobrah · 2025-04-10T01:07:04 1744247224

For some reason, text highlight didn't work, so here's the text-highlighted link: https://cloud.google.com/blog/products/compute/whats-new-wit...

dang · 2025-04-10T04:10:26 1744258226

That link doesn't work for me, so here's the relevant bit:

Rapid Storage: A new Cloud Storage zonal bucket that enables you to colocate your primary storage with your TPUs or GPUs for optimal utilization. It provides up to 20x faster random-read data loading than a Cloud Storage regional bucket.

(Normally we wouldn't allow a post like this which cherry-picks one bit of a larger article, but judging by the community response it's clear that you've put your finger on something important, so thanks! We're always game to suspend the rules when doing so is interesting.)

noahl · 2025-04-10T16:35:31 1744302931

There's now another blog post about Rapid storage specifically: https://cloud.google.com/blog/products/storage-data-transfer... . (That wasn't up yet when the original post was made.)

dang · 2025-04-10T17:33:26 1744306406

Ah excellent—that's what we were waiting for. I've changed the URL to that from https://cloud.google.com/blog/products/compute/whats-new-wit... above. Thanks!

alobrah · 2025-04-10T06:03:03 1744264983

Apologies! First time making a post on hacker news, and I thought this was really exciting news. FWIW, I talked to the presenter after this was revealed during the NEXT conference today, and he seems to have implied that zonal storage is quite close to what Google seems to have with Colossus.

dang · 2025-04-10T17:33:54 1744306434

Oh no, don't apologize - this was a case where you did exactly the right thing and I'm glad you posted!

(I was just adding some explanation for more seasoned users who might wonder why we were treating this a bit differently.)

Also, welcome to posting on HN and we hope you'll continue!

xk3 · 2025-04-10T02:43:44 1744253024

The gods strip off interesting bits of URLs when you submit it

dang · 2025-04-10T04:13:38 1744258418

if you saw that code you wouldn't deify it

SweetLlamaMyth · 2025-04-10T12:11:06 1744287066

It took me 4-5 attempts to not read:

> If you saw that code, you wouldn't _defy_ it

lazide · 2025-04-10T04:36:14 1744259774

Moloch was also a god!

EE84M3i · 2025-04-10T08:43:51 1744274631

Is this related at all the the private invite only anywhere caches? (or maybe they're GA now?)

https://cloud.google.com/storage/docs/anywhere-cache

leg · 2025-04-10T17:58:17 1744307897

Anywhere Cache and Rapid Storage share some infrastructure inside of GCS and both are good solutions for improving GCS performance, but Anywhere Cache is an SSD cache in front of the normal buckets while Rapid Storage is a new type of bucket.

(I work on Google storage)

minzi · 2025-04-10T21:08:23 1744319303

Can you expand a bit on when it would make sense to use one versus the other?

leg · 2025-04-10T21:45:09 1744321509

Anywhere Cache shines in front of a multi-regional bucket. Once the data is cached, there's no egress charges and there's much better latency. This is great for someone who looks for spot compute capacity to run computations anywhere in the multi-region. It will also improve performance in front of regional buckets but as a cache, you'll see the difference between hits and misses.

Rapid Storage will have all of your data local and fast, including writes. It also adds the ability to have fast durable appends, which is something you can't get from the standard buckets.

devops000 · 2025-04-10T11:16:51 1744283811

Is it like PureStorage?

leg · 2025-04-10T17:25:21 1744305921

There's a detailed blog post about Rapid Storage now available, see https://news.ycombinator.com/item?id=43645309

(I work on Google storage)

dang · 2025-04-10T17:35:50 1744306550

Thanks! I've changed the URL of the current thread and re-upped this one. More at https://news.ycombinator.com/item?id=43646209.

pj_mukh · 2025-04-10T04:04:02 1744257842

Super interesting! Rapid Storage especially, very useful, but that first line:

"Today's innovation isn't born in a lab or at a drafting board; it's built on the bedrock of AI infrastructure. "

Uhh..No. Even as an AI developer I can tell that is some AI Comms person tripping over.

simonw · 2025-04-10T04:36:34 1744259794

Everyone needs to learn to use a single, unique, unambiguous URL for new product announcements like this.

Google aren't the only company that consistently mess this up, but given how they built a 1.95 trillion company on top of crawling URLs on the web they really should have an internal culture that values giving things unique URLs!

[I had to learn this lesson myself: I used to blog "weeknotes" every week or two where I'd bundle all of my project announcements together and it sucked not being able to link to them as individual posts]

decimalenough · 2025-04-10T04:59:51 1744261191

Google's not really at fault here: the OP submitted a link to an article called "Introducing Ironwood TPUs and new innovations in AI Hypercomputer" that happens to mention Rapid Storage way down the page.

In case marketing seems to move faster than documentation though, since I can't find any mention of this in the main GCS docs. https://cloud.google.com/search?hl=en&q=rapid%20storage

ncruces · 2025-04-10T12:11:09 1744287069

It's down here too: https://cloud.google.com/products/storage

No link and no details though.

alobrah · 2025-04-10T06:06:00 1744265160

They revealed it's in private preview atm ;)

hiddencost · 2025-04-10T02:57:26 1744253846

Hats off to whoever convinced management that selling Colossus via cloud was Artificial Intelligence. Bravo.

CobrastanJorji · 2025-04-10T05:21:06 1744262466

They're not saying that it's AI. They're saying it's for customers who do AI. Training means lots and lots of reads from a big data store, and if you're reading from, like, big Parquet files, that probably means lots of random reads. This is for that. Speedier data access, presumably at the cost of durability and availability, which is probably a great trade-off for people doing ML training jobs.

Zvez · 2025-04-10T11:41:53 1744285313

calling everything 'for AI' is the new standard

>if you're reading from, like, big Parquet files, that probably means lots of random reads

and it also usually means that you shouldn't use s3 in the first place for workloads like this. Because they are usually very inefficient comparing to distributed fs. Unless you have some prefetch/cache layer, you will get both bad timings and higher costs

CobrastanJorji · 2025-04-10T18:05:28 1744308328

But a distributed FS is far more expensive than cloud blob storage would be, and I can't imagine most workloads would need the features of a POSIX filesystem.

eitally · 2025-04-10T03:23:44 1744255424

I don't fault them for this at all. AI isn't possible without the full infra stack, which clearly includes storage (and compute, and networking, and data pipelining, and and and...). There's an entire ecosystem of ISVs that only do one of these things, very well (Pure Storage, for example, or Lamba or Coreweave, or Confluent (Kafka + Flink with LLM integration). While it might be more precisely accurate to state "AI enabling" tech, I'll give them a pass.

rfoo · 2025-04-10T06:41:33 1744267293

I think the joke here is that somehow management refused to sell Colossus (which is such an obvious nice product just like BigQuery) before and it takes "AI" to convince them.

derefr · 2025-04-10T16:37:26 1744303046

> which is such an obvious nice product just like BigQuery

I always assumed (from outside Google) that the problem was that Colossus had to make a "no malicious actors" assumption in its design in order to make the performance/scaling guarantees it does; and that therefore just exposing it directly to the public would make it possible for someone to DoS-attack the Colossus cluster.

My logic was that there's actually nothing forcing [the public GCP service of] BigTable to require that a full copy of the dataset be kept hot across the nodes, with pre-reserved storage space — rather than mostly decoupling origin storage from compute† — unless it was to prevent some DoS vector.

As for exactly what that DoS vector is... maybe GC/compaction policy-engine logic? (AFAICT, Colossus has pluggable "send compute to data" GC, which internal-BigTable and GCS both use. But external-BigTable forces the GC to be offloaded to the client [i.e. to the BigTable compute nodes the user has allocated] so that the user can't just load down the system with so many complex GC policies that the DC-scale Colossus cluster itself starts to fall behind its GC time budget.)

---

† Where by "decouple storage from compute", I mean:

• Each compute node gets a fixed-sized DAS diskset, like GCE local NVMe SSDs;

• each disk in that diskset gets partitioned up at some fixed ratio, into two virtual disksets;

• one virtual diskset gets RAID6'ed or ZFS'ed together, and is used as storage for non-Colossus-synced tablet-LDB nursery level SSTs;

• the other virtual diskset gets RAID0'ed or LVM-JBOD-ed together and is used as a bounded-size LFU read-through cache of the Colossus-synced tablets — just like BigQuery compute nodes presumably have.

(AFAIK the LDB nursery levels already get force-compacted into "full" [128MiB] Colossus-synced tablets after some quite-short finality interval, so it's not like this increases data loss likelihood by much. And BigTable doesn't guarantee durability for non-replicated keys anyway.)

rfoo · 2025-04-10T18:04:07 1744308247

> a "no malicious actors" assumption in its design in order to make the performance/scaling guarantees it does

Didn't think deep into it, could this be solved with billing designs with more nuance?

klabb3 · 2025-04-10T03:24:56 1744255496

That’s an actually impressive level of spin. Flashback to when shipping companies were slapping blockchain on international container shipments.

Foobar8568 · 2025-04-10T05:18:26 1744262306

Flashback to the cabbage on the Blockchain or when they wanted to tag each fish caught in the wild as well for hu traceability!

bushbaba · 2025-04-10T04:00:02 1744257602

You gotta feed the GPUs & TPUs with enough data to avoid them sitting idle. Which starts to become incredibly challenging with latest gen GPU/TPU chips

Rebelgecko · 2025-04-12T02:51:25 1744426285

Maybe they can start selling Capacitor as a file format for storing LLMs metadata or something

jeffbee · 2025-04-10T04:58:38 1744261118

I would pay serious money if they sold CFS as a service but on AWS.

immibis · 2025-04-10T14:01:56 1744293716

Hi, I'm looking for a job. Are you willing to pay me serious money to set up CFS as a service on your AWS?

jsnell · 2025-04-10T14:47:59 1744296479

Obviously not, since you could not deliver it. It seems that you maybe don't realize what CFS is in this context, and are thinking of something else that you could just "set up"?

What jeffbee is talking about is Google's proprietary Colossus File System, and all its transitive dependencies.

immibis · 2025-04-10T17:25:39 1744305939

I meant it sarcastically, but for "serious money" you can have any software system you can dream of. You have to dream of it, though - that's one of the hard parts.

It looks like every other clustered file system. What's special about Google's Colossus?

noahl · 2025-04-10T18:25:13 1744309513

There are some semantic differences compared to POSIX filesystems. A couple big ones:

  - You can only append to an object, and each object can only have one writer at the time. This is useful for distributed systems - you could have one process adding records to the end of a log, and readers pulling new records from the end.
  - It's also possible to "finalize" an object, meaning that it can't be appended to any more.

(I work on Rapid storage.)

immibis · 2025-04-11T09:13:27 1744362807

Why would you wish for a system with constraints like that, which other systems don't have?

jeffbee · 2025-04-11T12:39:06 1744375146

Other systems don't offer the performance that Colossus offers, is why. POSIX has all kinds of silly features that aren't really necessary for every use case. Throwing away things like atomic writes by multiple writers allows the whole system to just go faster.

immibis · 2025-04-12T17:24:34 1744478674

It sounds like you have to find a design that meets your performance target and usage patterns - just like anything else. It also sounds like Google's CFS is a grass is greener situation - you heard Google had something that solved the problem you have, so you want it. But the reason it sounds good, compared to the other designs, is that you haven't had to actually use it and run into its quirks yet.

re-thc · 2025-04-10T04:31:12 1744259472

Everything is AI these days. Does it still need convincing?

CobrastanJorji · 2025-04-10T05:22:32 1744262552

Ha, right after I read your comment, I looked at the bottom of this Hacker News page and saw their "Join us for AI Startup School" ad.

zifpanachr23 · 2025-04-10T04:38:08 1744259888

Reading the press release about the "Hypercomputer" and I can't tell what part of this is real and what part is marketing.

They say it comes in two configuration, 256 chips or 9,216 chips. They also say that the maximal configuration of 9,216 chips delivers 24x the compute power of the world's largest supercomputer (which they say is called El Capitan). They say that this comes to 42.6 exaFLOPs.

This implies that the 9,216 chip configuration doesn't actually exist in any form in reality, or else it would now be the world's largest supercomputer (by flops) by a huge margin.

Am I massively misunderstanding what the claims being made are about the TPU and the 42.6 exaFLOPs? I feel like this would be much bigger news if this was fully legit.

Edit: The flops being benchmarked are not the same as regular supercomputer flops.

phonon · 2025-04-10T04:43:27 1744260207

Supercomputers are measured based on 64 bit floating point operations. Here they (inaptly) compared it to their 8 bit floating point operations (which are only useful for AI workloads).

zifpanachr23 · 2025-04-10T04:45:44 1744260344

Gotcha. That makes a lot more sense. I was led to believe by the wording of the comparison that they were the same operations. Appreciate the explanation.

onlyrealcuzzo · 2025-04-10T13:29:07 1744291747

Why is it inapt?

If all you care about is an 8-bit AI workload (there's definitely a market for that), it's nice to have 24x the speed.

remus · 2025-04-10T14:02:01 1744293721

It's an apples to oranges comparison.

onlyrealcuzzo · 2025-04-10T14:24:17 1744295057

It's apples to apples if you care about 8-bit (a lot of people do these days).

AFAIK, there wasn't a faster 8-bit super computer to compare to - which is why they made the comparison.

p_l · 2025-04-10T13:11:14 1744290674

Also the set of supported/accelerated operations in the fastest path is different no matter whether you use 8, 16, or 32bit floats, thus the common use of "TOPS" as benchmark number recently.

davedx · 2025-04-10T09:58:49 1744279129

Terrifyingly complicated and buzzword packed. I really don't know what to make of any of this or what it does, and I work with AI applications in my day job.

I'm guessing the $300 of Google Cloud credit offered in this webpage wouldn't go very far using any of this stuff?

lysecret · 2025-04-10T14:15:14 1744294514

You can try out everything for 300 dollars easily. Most expensive thing you can do is get a server with 8 h200s and spend 90 dollars an hour.

miroljub · 2025-04-10T09:47:09 1744278429

Like with any other new Google product, better wait a few years to see if it sticks before investing in its usage. In most cases, you'd be better off searching for an alternative from the start.

Spooky23 · 2025-04-10T11:08:51 1744283331

[flagged]

dang · 2025-04-10T17:40:28 1744306828

Please don't do this here.

acstorage · 2025-04-10T05:29:20 1744262960

If you want object storage faster than S3 Express One Zone or GCP Rapid Storage without the zonal limitation check out ACS: https://acceleratedcloudstorage.com

You can bring data in and out of the GPU quickly and improve utilization.