We've been able to run order matching engines for entire exchanges on a single t...

HolyLampshade · 2025-05-13T12:32:10 1747139530

> We've been able to run order matching engines for entire exchanges on a single thread for over a decade by this point.

This is the bit that really gets me fired up. People (read: system “architects”) were so desperate to “prove their worth” and leave a mark that many of these systems have been over complicated, unleashing a litany of new issues. The original design would still satisfy 99% of use cases and these days, given local compute capacity, you could run an entire market on a single device.

queuebert · 2025-05-13T14:20:05 1747146005

Why can you not match orders in parallel using logarithmic reduction, the same way you would sort in parallel? Is it that there is not enough other computation being done other than sorting by time and price?

mike_hearn · 2025-05-13T19:38:46 1747165126

It's an inherently serial problem and regulations require it to be that way. Users who submit first want their orders to be the one that crosses.

perlgeek · 2025-05-14T18:29:02 1747247342

Stupid question, since I know next to nothing about exchanges and regulations... but couldn't you just process serially by security?

E.G. if user A wants to buy Apple stock and user B wants to buy Facebook stock, does it matter which order came first? And if yes, why?

mike_hearn · 2025-05-15T06:12:36 1747289556

I think that's allowed but this is where my meagre expertise runs out. You normally have to process orders serially or at least using algorithms that yield the exact same outcome that serial execution would give, but only within a single order book.

tossandthrow · 2025-05-13T17:38:24 1747157904

I think it is the temporal aspect of order matching - for exchanges it is an inherently serial process.

bluGill · 2025-05-13T13:25:08 1747142708

You are only able to do that because you are doing simple processing on each transaction. If you had to do more complex processing on each transaction it wouldn't be possible to do that many. Though it is hard for me to imagine what more complex processing would be (I'm not in your domain)

bob1029 · 2025-05-13T13:55:07 1747144507

The order matching engine is mostly about updating an in-memory order book representation.

It is rarely the case that high volume transaction processing facilities also need to deal with deeply complex transactions.

I can't think of many domains of business wherein each transaction is so compute intensive that waiting for I/O doesn't typically dominate.

bluGill · 2025-05-13T14:09:47 1747145387

HFT would love to do more complex calculations for some of their trades. They often make the compromise of using a faster algorithm that is known to be right only 60% of the time vs the better but slower algorithm that is right 90% of the time.

That is a different problem from yours though and so it has different considerations. In some areas I/O dominates, in some it does not.

queuebert · 2025-05-13T14:23:36 1747146216

In a perfect world, maximizing (EV/op) x (ops/sec) should be done for even user software. How many person-years of productivity are lost each year to people waiting for Windows or Office to start up, finish updating, etc?

agentultra · 2025-05-13T14:20:09 1747146009

I work in card payments transaction processing and IO dominates. You need to have big models and lots of data to authorize a transaction. And you need that data as fresh as possible and as close to your compute as possible... but you're always dominated by IO. Computing the authorization is super cheap.

Tends to scale vertically rather than horizontally. Give me massive caches and wide registers and I can keep them full. For now though a lot of stuff is run on commodity cloud hardware so... eh.