I have an Intel Xeon Silver 4210 @ 2.20ghz with 40 cores spread on 2 NUMA nodes. I need to know what could be the maximum theoretical GFLOPS for this architecture for single and double precision arithmetics.
The values I have found around the web are very different from one another, so I don't know which one to take into account, and also the formulas I have found are not the same and lead to different results (some say 1760 GFLOPS for single precision and 352 for double precision, others 2816 GFLOPS for double precision).
Moreover, Intel in this document https://www.intel.com/content/dam/support/us/en/documents/processors/APP-for-Intel-Xeon-Processors.pdf reports a value of 153.6 GFLOPS.
What should I expect the correct value to be?