Questions tagged [rapids]

RAPIDS is a framework for accelerated machine learning and data science on GPUs

Questions pertaining to RAPIDS. From https://rapids.ai/ :

The RAPIDS suite of open source software libraries gives you the freedom to execute end-to-end data science and analytics pipelines entirely on GPUs. It relies on NVIDIA® CUDA® primitives for low-level compute optimization, but exposes that GPU parallelism and high-bandwidth memory speed through user-friendly Python interfaces.

RAPIDS also focuses on common data preparation tasks for analytics and data science. This includes a familiar DataFrame API that integrates with a variety of machine learning algorithms for end-to-end pipeline accelerations without paying typical serialization costs. RAPIDS also includes support for multi-node, multi-GPU deployments, enabling vastly accelerated processing and training on much larger dataset sizes.

195 questions
9
votes
4 answers

Rapids / docker: could not select device driver "" with capabilities: [[gpu]]

I'm new to Rapids, and rarely have had a good experience with conda. So I'm trying to work with a containerized version. I'm new to Docker, and the combination of unknowns leaves me unable to sort things out. I have an Ubuntu 18.04 server, # uname…
user3897315
  • 487
  • 2
  • 5
  • 14
8
votes
2 answers

Dask Vs Rapids. What does rapids provide which dask doesn't have?

I want to understand what is the difference between dask and rapids, what benefits does rapids provides which dask doesn't have. Does rapids internally use dask code? If so then why do we have dask, cause even dask can interact with GPU.
DjVasu
  • 113
  • 9
8
votes
2 answers

Is there a way to run RAPIDS on windows pc?

I am trying to run Nvidia rapids on a windows computer but haven't had any luck. I have installed docker desktop for windows and downloaded the rapids image. Cuda 10.0 is installed, and Nvidia-container-toolkit isn't. I haven't been able to make it…
Martin Talero
  • 83
  • 1
  • 3
7
votes
1 answer

Precomputed distance matrix in DBSCAN

Reading around, I find it is possible to pass a precomputed distance matrix into SKLearn DBSCAN. Unfortunately, I don't know how to pass it for calculation. Say I have a 1D array with 100 elements, with just the names of the nodes. Then I have a 2D…
Jaime Nebrera
  • 79
  • 1
  • 2
6
votes
2 answers

How to do a matrix dot product in the GPU with rapids.ai

I'm using CUDF it's part of the rapids ML suite from Nvidia. Using this suite how would I do a dot product? df = cudf.DataFrame([('a', list(range(20))), ('b', list(reversed(range(20)))), ('c', list(range(20)))]) e.g. how would I perform a dot…
Pablojim
  • 8,542
  • 8
  • 45
  • 69
4
votes
2 answers

ERROR: Could not find a version that satisfies the requirement dask-cudf (from versions: none)

Describe the bug When I am trying to import dask_cudf I get the following ERROR: --------------------------------------------------------------------------- ModuleNotFoundError Traceback (most recent call…
sogu
  • 2,738
  • 5
  • 31
  • 90
4
votes
2 answers

MultiGPU Kmeans clustering with RAPIDs freezes

I am new into Python and Rapids.AI and I am trying to recreate SKLearn KMeans in a multinode GPU (I have 2 GPUs) using Dask and RAPIDs (I am using rapids with its docker, which mounts a Jupyter Notebook too). The code I show below (also I show an…
JuMoGar
  • 1,740
  • 2
  • 19
  • 46
4
votes
2 answers

Installing cuDF & cuML into Colab with Rapids.ai version 0.11+

I'm trying to install Rapids library with cuDF and cuML to Colab session, and executing code accroding to this example: from Install RAPIDS library on Googe Colab notebook !wget -nc…
try
  • 348
  • 3
  • 8
4
votes
2 answers

Recommended cudf Dataframe Construction

I'm interested in recommended and fast ways of creating cudf DataFrames from dense numpy objects. I have seen many examples of splitting out columns of a 2d numpy matrix to tuples then calling cudf.DataFrame on a list of tuples -- this is rather…
quasiben
  • 1,444
  • 1
  • 11
  • 19
4
votes
2 answers

How to use RAPIDS to speed up the modules separated by container, in a pipeline

We have a function that allows users to drag and drop a module through the UI interface to form a data processing pipeline, such as reading data, doing preprocessing, doing classification training, etc. After dragging/dropping, these modules will be…
gnuwind
  • 41
  • 4
3
votes
1 answer

Does Apache Spark 3 support GPU usage for Spark RDDs?

I am currently trying to run genomic analyses pipelines using Hail(library for genomics analyses written in python and Scala). Recently, Apache Spark 3 was released and it supported GPU usage. I tried spark-rapids library start an on-premise slurm…
3
votes
1 answer

CuGraph implementation of NetworkX all_pairs_dijkstras

I'm trying to convert a cpu bound algorithm I have to a GPU one, and I'm having all sorts of trouble with cugraph. Some of it is my ignorance, the other part is just the infancy and underdevelopment of cugraph, and the final part is me just sucking…
Moose Sims
  • 141
  • 2
  • 11
3
votes
1 answer

Rolling linear regression for use with groupby operation on a cuDF dataframe

I would like to calculate the rolling slope of y_value over x_value using cuML LinearRegression. Sample data (cuDF dataframe): | date | x_value | y_value | | ------ | ------ | ---- | | 2020-01-01 | 900 | 10 | | 2020-01-01 |…
nasiha
  • 31
  • 1
3
votes
0 answers

XGBOOST GridSearchCV with GPU_HIST uses both GPU and CPU simultaneously

I am using this code to train an xgboost model on a GPU The problem is that both GPU (NVIDIA 1050) and CPU cores are being used at the same time. NVIDIA system monitor shows a utilization of 85 to 90% and linux system monitor shows all cores…
AntonisA
  • 33
  • 5
3
votes
1 answer

How do you determine memory stats while using rapids.ai?

I'm using python libraries of rapids.ai and one of the key things I'm starting to wonder is: how do I inspect memory allocation programatically? I know I can use nvidia-smi to look at some overall high level stats, but specifically I woud like to…
Robert
  • 1,220
  • 16
  • 19
1
2 3
12 13