We used to use the BCD opcodes for this kind of thing. Masking off the 0x30, shi...

jsheard · 2024-03-09T17:28:54 1710005334

The BCD instructions are one of the few things that was dropped in the x86-64 transition, so it wouldn't work at all anymore.

kragen · 2024-03-09T18:33:05 1710009185

they'd never been extended to work on more than 8 bits, even in the 8086; they only really existed for 8080 compatibility, and arguably the 8080 primarily had them to ease the path from earlier intel processors designed for, if i'm not mistaken, literal pocket calculators

in pocket calculators you have to display the result in decimal after performing a single arithmetic operation, so it's much more efficient to do the arithmetic in bcd than to convert from decimal to binary, perform the arithmetic operation, and then convert back

pacaro · 2024-03-09T18:57:00 1710010620

Yeah, if you wanted numbers greater than 99 you had to use them in the x87 (if you had one)

pcwalton · 2024-03-09T22:22:09 1710022929

Agner says that AAA has latency 5 on Cannon Lake, so using that instruction is a bit faster than doing the operations manually. But if you vectorize (or use SWAR) I imagine you can start to beat the legacy instructions with larger numbers.