In 2011, Google claimed that each search query takes about 0.3Wh [1]. Earlier th...

gpm · 2025-08-21T14:50:45 1755787845

14 years of progress on energy efficiency might also have an impact here...

nialse · 2025-08-21T17:16:23 1755796583

10-ish 18 month doublings would be around 1000x so it explains a lot.

mikaraento · 2025-08-21T15:39:40 1755790780

Around 2008 a core step in search was basically a grep over all documents. The grep was distributed over roughly 1000 machines so that the documents could be held in memory rather than on disk.

Inverted indices were not used as they worked poorly for “an ordered list of words” (as opposed to a bag of words).

And this doesn’t even start to address the ranking part.

smokel · 2025-08-21T21:39:59 1755812399

It seems highly unlikely that they did not use indices. Scanning all documents would be prohibitively slow. I think it is more likely that the indices were really large, and it would take hundreds to thousands of machines to store the indices in RAM. Having a parallel scan through those indices seems likely.

Wikipedia [1] links to "Jeff Dean's keynote at WSDM 2009" [2] which suggests that indices were most certainly used.

Then again, I am no expert in this field, so if you could share more details, I'd love to hear more about it.

[1] https://en.wikipedia.org/wiki/Google_data_centers

[2] https://static.googleusercontent.com/media/research.google.c...

bruckie · 2025-08-23T05:20:16 1755926416

I worked on search at Google around that timeframe, and it definitely used an index. As far as I know, it has from the very beginning.

You can solve the ordered list of words problem in ways that are more efficient than grepping over the entire internet (e.g. bigrams, storing position information in the index).

jeffbee · 2025-08-21T14:52:45 1755787965

> the takeaway is that ordinary Google searching is also quite energy-intense.

A related takeaway should be that machine inference is pervasive and has been for years, and that defining "AI" to mean just chatbots is to ignore most of the iceberg.

gcr · 2025-08-21T15:16:05 1755789365

I'd still love to see a report that accurately captures training cost. Today's report[1] notably excludes training cost.

Not just "one training run," but the cost of a thousand AI engineers starting failing runs to get to that one deployed model.

1: Link to Google's tech report: https://services.google.com/fh/files/misc/measuring_the_envi... "We leave the measurement of AI model training to future work."

xnx · 2025-08-21T15:40:16 1755790816

> I'd still love to see a report that accurately captures training cost. Today's report[1] notably excludes training cost.

From 2022, so possibly out of date: "ML training and inference are only 10%–15% of Google’s total energy use for each of the last three years, each year split ⅗ for inference and ⅖ for training." That's probably close enough to estimate 50/50, or the full energy cost to deliver an AI result is double the inference energy.

https://research.google/blog/good-news-about-the-carbon-foot...

jeffbee · 2025-08-21T15:47:41 1755791261

It still kills me, every time, that the title embedded in the metadata of that original PDF is "Revamped Happy CO2e Paper".

gcr · 2025-08-21T17:26:10 1755797170

My gosh you're right! The paper in question is https://arxiv.org/pdf/2204.05149, "The Carbon Footprint of Machine Learning Training Will Plateau, Then Shrink"

lukeschlather · 2025-08-21T15:28:02 1755790082

.3 Wh is 1080 joules. A liter of gasoline contains over 30 million joules. So this is like .034 milliliters of gasoline. But with grid power so even less than that since gasoline is very inefficient.

WD-42 · 2025-08-21T15:44:29 1755791069

They just be doing something crazy because any time I query my local llm the lights in my office dim and the temperature rises a few degrees. Definitely far more energy than running the microwave for 1 second.

oskarkk · 2025-08-21T19:18:37 1755803917

What hardware runs your local LLM?

WD-42 · 2025-08-21T20:00:37 1755806437

A RTX 5080, not the lowest power card.

oskarkk · 2025-08-21T21:01:30 1755810090

Not the lowest power, but surely it uses less power than a microwave. 360W TDP, 850W required system power, while my microwave is 1000W.

WD-42 · 2025-08-21T23:25:03 1755818703

The difference is most prompts take far longer than a single second to return.

oskarkk · 2025-08-22T03:08:21 1755832101

Yeah, but couple hundred watts certainly shouldn't dim your lights.

bruckie · 2025-08-23T05:22:41 1755926561

And if it does, you should get your wiring checked! If voltage is sagging enough to dim your lights with such a small load, that indicates a lot of resistance somewhere in the wiring, which could lead to fires.

mvieira38 · 2025-08-21T15:15:56 1755789356

At that point was Google already using deep learning for search? I'd guess the number fluctuated a bit during the rollout of this kind of feature

gcr · 2025-08-21T15:23:06 1755789786

Google was not using deep learning for search in 2011. Deep learning as a concept didn't really take off until AlexNet in 2012 anyway.

Various ML "learn-to-rank" tooling was in use at Google for a while, but incorporating document embedding vectors w/ ANN search into the ranking function probably happened over the course of 2018-2021 [1], I think. Generative AI only started appearing in ordinary search results in 2024.

1: https://cloud.google.com/blog/topics/developers-practitioner...

prats226 · 2025-08-21T21:38:57 1755812337

With google serving AI overviews, now an average search query should cost more? Compute is getting cheaper but also algorithms getting more and more complex, increasing compute?