So assuming you utilized this with (4) x 4090s is there a theoretical comparativ...

andersa · 2024-04-13T16:44:36 1713026676

It depends on what you do with it and how much bandwidth it needs between the cards. For LLM inference with tensor parallelism (usually limited by VRAM read bandwidth, but little exchange needed) 2x 4090 will massively outperform a single A6000. For training, not so much.

thangngoc89 · 2024-04-12T15:23:25 1712935405

I believe this is mostly for memory capacities. PCIe access between GPUs is slower than soldered RAM on a single GPU