I am currently trying to speed up my large sparse (scipy) matrix multiplications. I have successfully linked my numpy installation with OpenBLAS and henceforth, also scipy. I have run these tests with success.
When I use numpy.dot(X,Y)
I can clearly see performance boosts and also that multiple cores are used simultaneously. However, when I use scipy's dot functionality, no such performance boosts can be seen and still one one core is used. For example:
x = scipy.sparse.csr_matrix(numpy.random.random((1000,1000)))
x.dot(x.T)
Does anyone know how I can make BLAS also work with scipy's dot functionality?