Questions tagged [data-science-experience]

IBM Data Science Experience is an interactive, collaborative, cloud-based environment where data scientists can use multiple tools to activate their insights.

IBM Data Science Experience is an interactive, collaborative, cloud-based environment where data scientists can use multiple tools to activate their insights.

Source: http://datascience.ibm.com/blog/welcome-to-the-data-science-experience/

261 questions
38
votes
2 answers

How do I read a parquet in PySpark written from Spark?

I am using two Jupyter notebooks to do different things in an analysis. In my Scala notebook, I write some of my cleaned data to…
Ross Lewis
  • 755
  • 2
  • 7
  • 17
9
votes
2 answers

ValueError: Invalid endpoint: s3-api.xxxx.objectstorage.service.networklayer.com

I'm trying to access a csv file in my Watson Data Platform catalog. I used the code generation functionality from my DSX notebook: Insert to code > Insert StreamingBody object. The generated code was: import os import types import pandas as…
Chris Snow
  • 23,813
  • 35
  • 144
  • 309
5
votes
0 answers

Unable to connect to Cloud object storage instance IBM Watson Studio

I am trying to connect to COS from IBM Watson studio but I get an error... When I hit enter I get the following error: Unable to find products data_catalog or data_science_experience in the entitlements response for account id:…
5
votes
2 answers

How do I load data from a StreamingBody object using Insert to Code to pandas in Watson Studio?

The Insert to Code feature enables you to access data stored in Cloud Object Storage when working in Jupyter notebooks in Watson Studio. Some file types (e.g. txt files) will have just StreamingBody and Credentials as insert to code options: How…
4
votes
1 answer

how to set checkpiont dir PySpark Data Science Experience

Could you help me with instructions on how to set the checkpoint dir for a PySpark session on IBM's Data Science Experience?. The need came because i have to run connectedComponents() from GraphFrames and it raises the following error Py4JJavaError:…
ElBrocas
  • 399
  • 4
  • 13
3
votes
3 answers

SystemML:Cannot import the submodule mllearn (and therefore Keras2DML function)

I am using IBM Watson Studio (Default spark python environment) and trying to convert a Keras model to systemml DML and train it on Spark. !pip install systemml import systemml this executes just fine. But this - from systemml import mllearn…
3
votes
5 answers

No FileSystem for scheme: cos

I'm trying to connect to IBM Cloud Object Storage from IBM Data Science Experience: access_key = 'XXX' secret_key = 'XXX' bucket = 'mybucket' host = 'lon.ibmselect.objstor.com' service = 'mycos' sqlCxt = SQLContext(sc) hconf =…
Chris Snow
  • 23,813
  • 35
  • 144
  • 309
3
votes
1 answer

DSX Python import error : undefined symbol: PyUnicodeUCS2_AsUTF8String

On IBM DSX, I have a spark service instance on which I have installed a few newer versions of packages such as numpy. I am facing an issue with the import of numpy. The following code: import numpy raises this error message: ImportError:…
Sven Hafeneger
  • 801
  • 6
  • 13
3
votes
2 answers

how to use the "display" function in a scala 2.11 with Spark 2.0 notebook in dsx

In dsx is there a way to use "display" in a scala 2.11 with Spark 2.0 notebook (I know it can be done in a python notebook with pixiedust). Eg: display(spark.sql("SELECT COUNT(zip), SUM(pop), city FROM hive_zips_table WHERE state…
Vik M
  • 45
  • 1
  • 4
3
votes
2 answers

How to supply user functions/modules in DSX

I have some helper utilities defined in a separate python script. I would like to make the script available to the DSX notebook, so I can call them in the cell, but I don't want to put the script into the cell directly. What are some of the ways to…
3
votes
2 answers

Bluemix Analytics for Apache Spark log file information required

I would like more information when debugging my spark notebook. I have found some log files: !ls $HOME/notebook/logs/ The files are: bootstrap-nnnnnnnn_nnnnnn.log jupyter-nnnnnnnn_nnnnnn.log …
Chris Snow
  • 23,813
  • 35
  • 144
  • 309
2
votes
1 answer

how to perform box-cox transformation to single column in python

I am trying to apply a box-cox transformation to a single column but I am unable to do that. Can somebody help me with this issue? from sklearn.datasets import fetch_california_housing import pandas as pd import seaborn as sns import…
2
votes
1 answer

IBM DSX Local : ImportError: No module named project_lib

I am trying to use the IBM DSX built-in package "project_lib" but got the error "No module named project_lib". The package is for accessing the assets (i.e. files) of my projects. I am using IBM DSX local, not sure if the package is available for…
Jun Luo
  • 23
  • 2
2
votes
1 answer

Deploy a model from an R notebook in IBM Watson Studio

I'm trying to find out how I can deploy an R notebook as a model in Watson Studio. According the documentation, there is a bit of deployment code which needs to go in a cell of the notebook, however I can only find documentation for Python and…
2
votes
1 answer

Can not install the CRAN package `caret` on IBM Watson Studio R notebooks

While trying to install the R package caret on IBM Watson Studio R notebooks, it installs a lot of dependencies but in the end fails with non-zero exit status errors. The dependency recipes cannot be installed for some…
Sumit Goyal
  • 575
  • 3
  • 16
1
2 3
17 18