I am writing a C code in Linux (maybe later in Windows) and the code includes massive Linear Algebra computations with parallel processing. I am using GotoBLAS2 which handles lots of jobs. Still there are some matrix computations (especially sparse matrices) which should be carried out manually. I searched the internet and found out some libraries such as Open MP. Also, it seems that threading can be helpful. What would be the best approach for parallel computation in C? Thanks
Additional information about my code: I am specially concerned about Sparse Matrix-Vector multiplications. Also, I have a large set of vectors {v1, v2, v3, v4, ...} and these vectors should be multiplied by different numbers, which can be carried out using parallel computation. Thanks