Questions tagged [cudpp]
14 questions
8
votes
5 answers
Sorting objects with Thrust CUDA
Is it possible to sort objects using the Thrust library?
I have the following struct:
struct OB{
int N;
Cls *C; //CLS is another struct.
}
Is it possible to use thrust in order to sort an array of OB according to N? Can you provide a simple…

liz
- 241
- 2
- 4
- 5
7
votes
2 answers
high performance prefix sum / scan function in CUDA, looking for thrust, cuDPP library alterative
I'm looking for high performance multiscan / multi prefix-sum (many rows in a one kernel execution) function for my project in CUDA.
I've tried the one from Thrust library but it's a way too slow. Also thrust crash after being compiled with nvcc…

user1946472
- 91
- 2
- 3
5
votes
1 answer
CUDA Stream compaction: understanding the concept
I am using CUDA/Thrust/CUDPP. As I understand, in Stream compaction, certain items in an array are marked as invalid and then "removed".
Now what does "removal" really mean here? Suppose the original array A and has length 6. If 2 elements are…

smilingbuddha
- 14,334
- 33
- 112
- 189
5
votes
1 answer
Thrust vs CUDPP
For using data parallel algorithms on the GPU with CUDA there are two standard libraries,
CUDPP and Thrust, which implement sorting, reduction , prefix sum etc.
So what are the main differences between the libraries, in terms of performance and…

smilingbuddha
- 14,334
- 33
- 112
- 189
2
votes
1 answer
OpenMPI CUDA and CUDPP
I have mpi program to distribute the large array amongst several processes on cluster.
Each process calculates its own sum of array elements and returns the result to host.
I want to run parallel prefix scan on array elements of each process.
Any…

Coder
- 3,090
- 8
- 49
- 85
1
vote
1 answer
Does cudpp only work for devices of a certain compute capability
I want to use cudpp. I am curious if it only works for devices of a certain compute capability ?
If yes, what is the lower bound.
I found this error message in their hash table implementation code:
"Error in cudppHashTable call in testHashTable…

Programmer
- 6,565
- 25
- 78
- 125
1
vote
1 answer
CUDPP faster than THRUST in sorting? so what about structure arrays?
I have an array of structures and I want to sort them. I have been using Thrust::sort and it works well. Then people tell me CUDPP is faster in sorting but I found it's not possible to sort structure arrays straight forward.
Here they have…

Kiarash
- 7,378
- 10
- 44
- 69
1
vote
1 answer
cudppscan gives wrong output when input data size is greater than 700,000
I'm using CUDA cudppScan to calculate prefix-sum of an array of elements.
It goes well when input data size is small.
But when data size is greater than about 700,000 some elements of the output turn out to be wrong values.
I'm doing cudppScan of an…

user435644
- 11
- 1
1
vote
1 answer
Is libcudpp.so missing from CUDA 7.0?
I am using jCuda and tried to compile and run one of its examples(JCudppHashSample) available on jcuda on Ubuntu 14.04 with CUDA 7.0.
But at compile time the library jcudpp says that there is no libcudpp available. And I am wondering is it missing…

AmirSojoodi
- 1,080
- 2
- 12
- 31
1
vote
0 answers
radix sort performance on K10 GPU
I'm looking for a fast implementation of a sort algorithm on GPU for large arrays (hundreds M elements). I've tried the cudpp one already and got between 450M and 500M 4 bytes keys + 4 byte field per second. That did not look bad, however still in…

user3030851
- 123
- 11
1
vote
1 answer
Segmented Sort with CUDPP/Thrust
Is it possible to do segmented sort in with CUDPP in CUDA? By segmented sort, I mean to sort elements of array which are protected by flags like below.
A[10,9,8,7,6,5,4,3,2,1]
Flag array[1,0,1,0,0,1,0,0,0,0]
Sort elements of A which are between…

username_4567
- 4,737
- 12
- 56
- 92
0
votes
1 answer
Algorithm in thrust/cudpp to find kth smallest element
I am looking for an implemention of kth smallest element algorithm in thrust/cudapp. I Googled for it but dont seem to find it. Does any one know if there exists such an algorithm?
I saw that there is reordering but it does not say kth smallest.

Programmer
- 6,565
- 25
- 78
- 125
0
votes
1 answer
how to use cudpp without cmake?
I need to use the scan function from cudpp. However, cudpp requires cmake to compile. Is there a way to use the scan function from cudpp into C code for CUDA directly? If so, how do I do that?

user1760748
- 149
- 2
- 9
-1
votes
1 answer
error when using CUDPPHandle
I'm working on CUDA as a beginner and am trying to execute a pre written code the compile gives error for every usage of CUDPPHandle... for example
void HPGMST()
{
//Reinitialize the ranking arrays, must be orig but this also works
…

soodankit
- 81
- 8