I checked the fine print on the product website: by “up to 4x faster LLM prompt ...

aurareturn · 2026-03-03T15:59:55 1772553595

Yes. This is known. They added neural accelerators, aka Tensor core equivalent, in the GPU. This will make prompt processing competitive vs similar class GPUs.

solarkraft · 2026-03-04T17:39:20 1772645960

It’s a big deal! Prompt processing was previously the Mac’s weak point. Sure, output generation matters for file recital in programming, but in general conversation I’d rather have it output a short answer anyway (after extensive processing by a smart model).

otterley · 2026-03-04T22:27:34 1772663254

General conversation is already free with all the major providers (Claude, ChatGPT, etc.). That's not where the major gains in productivity lie.

jasonjmcghee · 2026-03-03T15:05:30 1772550330

It would probably be worth finding a more friendly way to market this, but it's a reasonable / accurate way to say it.

The prompt processing sped up.

Not the output generation.

M4 was notoriously slow at this compared to DGX etc.