Reproducibility of floating point operation result

Question

Is it possible for an floating-point arithmetic operation to yield different results on different CPUs? By CPUs i mean all of x86 and x64. And by different results i mean even if only a single least important bit is different.. I need to know if I can use floating point operations on project where it's vital to have exactly the same results corresponding to same input on different machines.

Edit: added c++ tag.
Also to clarify: I need reproducible results run-time. I wouldn't expect identical results from different compilations.

They should produce exactly the same results. Unless you're using a really old pentium... — Wug, Aug 14 '12 at 12:45
For reproducible results that actually make mathematical sense, use decimals, not floats. See [this](http://stackoverflow.com/questions/3039650/ruby-bigdecimal-sanity-check-floating-point-newb) post for more on the difference between the two. — jake, Aug 14 '12 at 12:49
@aguazales I don't use Ruby, also im ok with floats as long as they work exactly the same on all FPUs, so the big question remains if they actualy do ? :] — user1316208, Aug 14 '12 at 12:55
@Wug haha yeah I heard about that bug , I dont need to cover that case :] — user1316208, Aug 14 '12 at 12:59
Whoops, I'm sorry! I thought I was browsing the Ruby on Rails tag :S My bad. — jake, Aug 14 '12 at 12:59
Im gonna add a C++ tag to my question, despite this is general question, my implementation uses c++ (also i will get more attention hopefuly) — user1316208, Aug 14 '12 at 13:11

ecatmur · Accepted Answer · 2012-08-14T13:41:20.957

In the gaming industry this is referred to as deterministic lockstep, and is very important for real-time networked games where the clients and server need to be in agreement about the state of physics objects (players, projectiles, deformable terrain etc).

According to Glenn Fiedler's article on Floating Point Determinism, the answer is "a resoundingly limp maybe"; if you run the same binary on the same architecture and restrict the use of features that are less well specified than basic floating-point, then you can get the same results. Otherwise, if you use different compilers, or allow your code to use SSE or 80-bit floating point, then results will vary between different executables and different machines.

Yosef Kreinin recommends:

scanning assembler output for algebraic optimisations and applying them to your source code;
suppressing fused multiply-add and other advanced instructions (e.g. the sin trigonometric function);
and using SSE or SSE2, or otherwise setting the FPU CSR to 64-bit. (Yes, this conflicts with Glenn Fiedler's recommendation.)

And of course, test your code on multiple different machines; take hashes of intermediate outputs, so you can tell just where and when your simulations are diverging.

Thanks, im still not sure whether to try for FP consistency, or just go with fixed-precision :( — user1316208, Aug 14 '12 at 13:53

Eric Postpischil · Answer 2 · 2012-08-14T22:10:36.970

If you call a dynamically-linked library, you may get different code on different processors. (For example, the Accelerate library on Mac OS X uses different implementations of its routines on different processors.)

However, if you use identical executable images (including all libraries) that do not dispatch based on processor model and have identical inputs (including any changes made to floating-point modes or other global state that can affect floating-point), then the processor produces identical results for all elementary floating-point arithmetic (add, subtract, multiply, divide, compare, convert).

Certain operations might not be fully specified to return identical results on different processors, such as the inverse-square-root-estimate instruction.

Concerns mentioned in ecatmur’s answer about optimizations made by the compiler, fused multiply-add, and SSE/SSE2/FPU use, do not apply to identical binaries. Those concerns apply only when different compilations (different switches, different target platforms, different compiler versions) might produce different code. Since you have excluded different compilations, these concerns are not relevant.

If you build for both a 32-bit target (i386) and a 64-bit target (x86_64) you are making two executable images (in one “fat” file), and the concerns about different compiler products apply.

Reproducibility of floating point operation result

2 Answers2

Linked

Related