In C or C++ you should be able to use the stdlib `fma( )` function and have the compiler lower the call to the instruction, if it isn't completely out to lunch. No intrinsics should be necessary.
Alternatively, set `#pragma STDC FP_CONTRACT ON` and just write `x*y + z`.
Alternatively, set `#pragma STDC FP_CONTRACT ON` and just write `x*y + z`.