How do I use the Intel AVX vector instruction set from Java? It's a simple question but the answer seems to be hard to find.
-
What exactly are you trying to achieve? Can you clarify your question? – Mureinik Dec 27 '14 at 09:21
-
I want to perform bitwise AND and XOR computations using the AVX rather than Java int arrays. – Albert Hendriks Dec 27 '14 at 10:04
-
Possible duplicate: http://stackoverflow.com/q/10784951/523744 – Samuel Audet Feb 27 '15 at 03:09
-
A related keyword is SIMD – Software Engineer Nov 19 '20 at 10:01
6 Answers
As I know, most current Java JVM JITters don't support automatic vectorization or just do that for very simple loops, so you're out of luck.
In Mono's .NET implementation there's Mono.Simd
for manual vector code emission and then later MS introduced the System.Numeric.Vectors
. Unfortunately there's nothing similar in Java. I don't know if Java's vector class is vectorized using SIMD or not but I don't think it is.
If you want to use CPU-specific features like AVX then your only choice is JNI. Write your bottle neck part in C or C++ and call it from Java
There's another solution by Scala to use vectorized code without modifying the JVM that you can read in How we made the JVM 40x faster
Update:
Now there's a new Vector API being developed for writing vector code manually
Provide an initial iteration of an incubator module, jdk.incubator.vector, to express vector computations that reliably compile at runtime to optimal vector hardware instructions on supported CPU architectures and thus achieve superior performance to equivalent scalar computations.
- Vector API Developer Program for Java* Software
- Oracle and Intel seek to build a Java API for SIMD support
Read more:
-
7UPDATE: JEP-338 was delivered in java 16 at 2021/02/18. @SoftwareEngineer – Goofy Feb 20 '21 at 09:20
Depending on the work, you may not have to do much. AVX2 is used automatically by JVM on some operations on Arrays and String, on supporting platforms like Haswell onwards and Xeon v3 onwards.
https://en.wikipedia.org/wiki/Advanced_Vector_Extensions#CPUs_with_AVX2
From Ryzen 3 / Epyc Rome, you'll also have single cycle AVX2 on AMD processors: https://www.anandtech.com/print/14525/amd-zen-2-microarchitecture-analysis-ryzen-3000-and-epyc-rome
Direct use of the instructions and intrinsics are not easily available though.

- 2,417
- 1
- 23
- 22
Java 19 now supports SIMD with new Vector Intrinsics API , this API much faster then JNI implementations. Its directly converting your java method calls to inline assembly instructions. Checkout : https://vksegfault.github.io/posts/java-simd/

- 729
- 2
- 10
- 20
To use these operations from Java you need a library like JavaCV This library enables vector operations for both Intel Processors and GPUs like NVIDIA.

- 525,659
- 79
- 751
- 1,130
-
1This seems a collection of special-purpose routines, not a general toolkit for vector operations. Am I missing something? – Aleksandr Dubinsky Feb 18 '15 at 21:19
-
@AleksandrDubinsky for a general purpose API I would use JavaCL which is also a wrapper for OpenCL. . – Peter Lawrey Feb 18 '15 at 21:48
One option is to use Aparapi and install the Intel OpenCL driver. (Your code will be vectorized to work on both CPUs and GPUs, as long as an appropriate OpenCL driver is installed.)
Another option is to use JNI and call a C++ function that uses AVX intrinsics or was auto-vectorized by a compiler.

- 22,436
- 15
- 82
- 99
Check for yeppp library. It has java binding and its very fast cross platform SIMD Library.

- 729
- 2
- 10
- 20
-
the above link has been dead. Check https://sites.google.com/site/yeppplibrary/ – phuclv Sep 27 '20 at 02:37