RPCS3 Inside Look: A Deep-Dive into Hardware and Performance Scaling

powerbook5300CS · on Aug 23, 2020

Intel: still the undisputed king of single thread performance. For some applications that’s all that matters.

ekianjo · on Aug 23, 2020

Relevant extract:

> Unlike other software, RPCS3’s PPU & SPU threads need to communicate constantly which results in a major bottleneck if these threads are split across multiple CCXes / chiplets. That ends up with the CPU hitting this bottleneck constantly with all the data moving around. This is why we do not recommend Ryzen CPUs unless they have a 3 or 4 core CCX design (6-8 core Ryzen CPUs, or a 4 core Ryzen APU). A 4 core CCX design is ideal as RPCS3 can fit all the PPU & SPU threads onto a single CCX, allowing users to bypass inter-CCX latency bottleneck entirely, provided the PPU & SPU threads are being scheduled properly to be placed on a single CCX.

> While later Ryzen generations have greatly improved latency, it’s still a major bottleneck for RPCS3 if all the PPU & SPU threads cannot be placed on a single CCX. Another thing to note is that Ryzen users should definitely update to Windows 10 1903 or later, as Microsoft improved their scheduler which helps to avoid this bottleneck as well.

> The Intel CPUs on the other hand are quite the opposite. They do not suffer from the latency issues explained above due to its monolithic design and when you combine all that with a faster single core, you will notice that they do often perform better than their AMD equivalent.

sb057 · on Aug 23, 2020

Still charging a king's ransom, too. Looking on Amazon right now, the i9-9900K (which reached 32.3 FPS) costs 422.59 USD, compared to R7 3700X (which reached 28.0 FPS), which costs 329.99 USD.

That's a nearly 2:1 ratio for performance/price scaling.

csdreamer7 · on Aug 23, 2020

Current price for the R7 3700X on amazon is $289.99. That is one thing about AMD's prices, they fall over time.

I know this because I bought a 3700X recently. I debated buying online vs getting a 3700X for 269.99 in-store in Microcenter.

Narishma · on Aug 23, 2020

I think in this case it has more to do with the multiple-CCX design and the synchronization bottleneck it creates than single-threaded performance.

loeg · on Aug 23, 2020

> Intel: still the undisputed king of single thread performance.

Certainly by sales volume.

> For some applications that’s all that matters.

Is this one of them?

powerbook5300CS · on Aug 23, 2020

Yes, RTFA. That’s why the comment is relevant.

loeg · on Aug 23, 2020

I read the article, no need to be rude. The article is about maximizing memory throughput. It is not unusual to need multiple threads to actually saturate the memory controller(s). And in fact, RPCS3 uses several threads. So it is not clear that single-thread performance is the only significant factor, or that your unsubstantiated and context-free claim about Intel's single thread performance was relevant.

makomk · on Aug 23, 2020

Not really. RPCS3 is very heavily multi-threaded, and the big thing that makes it perform better on Intel CPUs than AMD is differences in the performance of cross-core synchronization and data transfer. Current AMD desktop processors are split into CCXs containing at most 4 cores, and any communication between CCXs has to go via the seperate IO die which is relatively slow.

jy14898 · on Aug 23, 2020

I wonder what memory they used with the Ryzen builds, as the memory frequency may decide the FCLK which affects some (all?) core to core communication speed

ngcc_hk · on Aug 24, 2020

PciE 4.0?