> Down the line, $100 billion is not out of the question, Ghodsi says—and even that could be a conservative figure. It’s simple math: Enterprise AI is already a trillion-dollar market, and it’s certain to grow much larger. If the category leader grabs just 10% of the market, Ghodsi says, that’s revenues of “many, many hundred billions.”
Um... isn't that the same fallacy they trotted out before the dot-com bubble of 2001? [1][2]
I mean I know people have a short memory span, I guess I'm just surprised it's this short. I would have thought that most folks with even a cursory knowledge of Silicon Valley history know about the 1% fallacy. I'm not even a founder, and even I know what it refers to.
This quote should have been a giant red flag for Kenrick Cai and Forbes. It should have single-handedly changed the angle of the story from puff piece to investigative. To see them both uncritically regurgitating something so thoroughly disproven is another depressing nail in the coffin of modern-day journalism.
Databricks unique selling point is a scalable data lake. The AI functionality is obviously used as well, however, its real use case is inflated by a lot of hot air.
I know multiple AE level folks who have left big cloud companies (ie Google, Amazon, SFDC, etc) for literal 7 figure pay packages at Databricks and then vaporized sometime there after.
Figma got acquired by an enterprise superpower, simultaneously creating a Kraken of a competitor and closing a path to exit. Canva are also one of the worse companies with shareholder liquidity, which I know in at least a few cases lead to fire-sale dispositions.
Do those events change the fundamentals though? Canva's bottom line is still excellent.
The previous valuation was just PE froth. Canva just raised money at the top of the market at an absurd valuation, but it's the cash flow that matters.
> Canva are also one of the worse companies with shareholder liquidity, which I know in at least a few cases lead to fire-sale dispositions.
"dispositions"? Makes no sense in the context. I worked at Canva and got to sell 20% in 2020 and 2021, the latter at the $40B valuation. So as a shareholder I'm relatively happy with the amount of liquidity in the stock.
You’re describing tenders. They’re fine for small-scale planned liquidity.
If a fund hits a risk limit, on the other hand, and must sell before the end of the year for compliance reasons, and the company is known to block transfers, the solution is a financially-engineered mechanism that throws off buyers and reduces the price. This liquidity discount gives management more control at the expense of shareholders. It’s also commonly priced into valuation expectations, at least by investors who aren’t all froth.
The same that happened to the valuations of the rest of the companies on the list. There's been a massive haircut across private software companies in the last 12-18 months. Stripe has had a similar fall in valuation.
So it is about single private company by founders of Apache Spark. Not sure I'd call it mafia. Further I see that a high performance distributed data processing system implemented in Rust is building up around Apache Arrow. So Databricks lead and hence valuations will be quite difficult to maintain.
Databricks unique selling point is the scalable data lake. The AI functionality is used, however, is hyped with a lot of hot air in this article. I can't see Databricks hitting $100 billion anytime soon.
This reads like a pre-IPO puff piece, nothing more. Without getting into how obnoxious the term is, there's no "mafia" here. And I'm not buying the altruism so vainly on display. Goes to show how modern tech journalism is basically commissioned.
I was hoping the article would dive into some deeper insight as to why they started the business if they weren't interested.
I'm not "interested in starting a business" either. But I seem to keep doing it. I was wondering why the other day.
My core interest isn't business, but I did enjoy my time as a software engineer and I like technology. I also have a background in design, and love that part. I've worked in marketing and PR, and enjoy that. Finance I'm a bit meh on, but I know how to put together a forecast, worked in accounting for a bit, so I can do the job. I've never worked in HR, but I've managed a few teams.
Anybody else in a similar boat?
What got you to start your business? Or why do you stick with it?
I feel similarly. My favorite things about running a business are:
* The variety of problems I get to work on. I write code, I do sales calls, I manage people, I negotiate contracts, I do taxes, you get the idea. I really like learning things and I never get bored.
* I like the intensity. I have to do everything as well as I possibly can, and so I feel like I'm running at close to 100% efficiency at all times. It is very satisfying to feel that you're putting your best work you could.
Both of these things are rare at "normal" jobs. I was lucky enough to have similar experiences when I "worked for the man", but those experiences were never close in terms of quality or intensity.
>Users feed in their data and the AI makes predictions about the future. John Deere, for example, installs sensors in its farm equipment to measure things like engine temperature and hours of use. Databricks uses this raw data to predict when a tractor is likely to break down.
I'm pretty sure this is not what databricks does. Isn't databricks just a way to run queries on unstructured data? Or am I thinking of something else?
>They wanted to replicate what the big tech companies were doing with neural networks, without the complex interface.
This is utterly wrong. What data bricks did has nothing to do with neural network. Probably Forbes wanted to mean Hadoop. I am happy for the team though.
Submitted title from "There was PayPal mafia. Meet the Berkeley mafia". That broke the site guidelines ("Please use the original title, unless it is misleading or linkbait; don't editorialize.")
Since the article's own title is also linkbait, I've replaced it with a representative phrase from the main text.
If 1 of these 7 Berkeley CS Ph.D.'s decides to leave to found his own company, now that they all have exec experience - what VC would turn them down?
Every single one of them will have top tier startup experience, having been part of the founding team of a huge success story, AND the most technical of their first 10 hires at the same time.
But will it be successful? The PayPal Mafia were successful in their other pursuits. And not just successful, but all founded their own companies valued > than databricks (I think, based on rough memory of their valuations).
At least two of them already had that background, though. They weren’t blank slates: “billionaire startup alumni founds second billionaire startup with new colleagues” would be a perfectly accurate title.
> Stoica was an exec at $300 million video streaming startup Conviva, while Shenker had been the first CEO at Nicira, a networking firm sold in 2012 to VMware for about $1.3 billion
Only Stoica, Ghodsi, and Zaharia are listed on https://www.forbes.com/real-time-billionaires and I don't think any of them were billionaires before Databricks? Exec at a $300M company or CEO overseeing a $1.3B exit means rich, yes, but not billionaire rich.
Yes it does and its bleak. I don't understand the valuation based on assumption that even the smallest startup will need to write some fancy map-reduce spark jobs to do analytics and AI. Most companies are best served by a warehouse like snowflake and a realtime layer for analytics. I don't understand the value add of databricks.
But you don't have to write map-reduce jobs at all? You can just write SQL queries or Pandas programs, and they automatically get parallelized by Databricks. Databricks is a data warehouse (just like Snowflake).
In a twist, pandas programs don't get parallelized on Spark. Someone had to go and write a parallel layer that duplicated the pandas API, because otherwise you ended up with the entire pandas program executing on a single executor.
FWIW what we see are whole different categories of workloads. For primarily API-driven microservice workloads, ETL of data stores into Snowflake makes sense. But for primarily batch or stream workloads- implemented literally as batches or as data streams that have varying unit-of-work semantics, and where the target data model isn't read only analytics but read write operational- something like Spark can make a lot of sense.
We use Databricks on a data processing pipeline. I don't know of anyone in love with it. It is by far the number 1 source of problems on that pipeline. Just in the last week:
* It deleted hundreds of log files without warning
* We had a failure starting a cluster; the web UI listed the cluster, but in fact it no longer existed - we had to recreate it.
* Log files that it didn't delete show that it is having problems pulling some internal metadata from an AWS IP address (we are on Azure).
If the directive from on high came down that we are to rip it out and replace it with something else, nobody would be surprised, or care.
As a data scientist, the greatest thing about Databricks is their marketing and sales departments. Not that they have a bad product, but they're not selling anything brilliantly unique.
Most use cases currently served by Apache Spark clusters would run 10x faster on a laptop with fast SSD (Macbook perhaps), and an ad hoc cat/grep/sed pipeline /s
There's a DuckDB and Polars and similar tools now, which can finally outperform the venerable unix tools. Once the data is on the laptop you can get order of magnitude faster execution than Spark.
The unsolved problems are 1) what if the data and what you do with it suddenly doesn't fit on a laptop (giving everyone 64 GB RAM laptops for example seems like a waste) and 2) how do you deliver the relevant subset of the data from the petabyte place where you store it to the laptop.
If someone could solve that, Spark could finally go to hell.
Boy wasn’t it fortunate that some of them had done all this paperwork and exec-ing before in successful tech startups! They might have really struggled without a shit-ton of business background!