Questions tagged [dsx]

IBM Data Science Experience is an interactive, collaborative, cloud-based environment where data scientists can use multiple tools to activate their insights. Data scientists can use the best of open source, tap into IBM's unique features, grow their capabilities, and share their successes. Source: http://datascience.ibm.com/blog/welcome-to-the-data-science-experience/

22 questions
3
votes
2 answers

how to use the "display" function in a scala 2.11 with Spark 2.0 notebook in dsx

In dsx is there a way to use "display" in a scala 2.11 with Spark 2.0 notebook (I know it can be done in a python notebook with pixiedust). Eg: display(spark.sql("SELECT COUNT(zip), SUM(pop), city FROM hive_zips_table WHERE state…
Vik M
  • 45
  • 1
  • 4
3
votes
2 answers

How to supply user functions/modules in DSX

I have some helper utilities defined in a separate python script. I would like to make the script available to the DSX notebook, so I can call them in the cell, but I don't want to put the script into the cell directly. What are some of the ways to…
2
votes
1 answer

Write csv to Ibm bluemix object storage from DSX python 2.7 notebook

I am trying to write a pandas dataframe as CSV to Bluemix Object Storage from a DSX Python notebook. I first save the dataframe to a 'local' CSV file. I then have a routine that attempts to write the file to Object Storage. I get a 413 response -…
Ted Morris
  • 21
  • 2
2
votes
2 answers

How to share information across notebooks in a DSX project

Is it possible to share information (such as credentials) across multiple notebooks in a DSX project, e.g. with environment variables? For example a Cloud Foundry application in Bluemix has a control setting where environment variables can be…
2
votes
3 answers

Netezza Drivers not available in Spark (Python Notebook) in DataScienceExperience

I have a project code in Python Notebook and it ran all good when Spark was hosted in Bluemix. We are running the following code to connect to Netezza (on premises) which worked fine in Bluemix. VT = …
1
vote
2 answers

Converting sensor tag data in DSX

I'm working on converting the existing recipe for Data Science Experience (DSX) to use data from a connected Sensor Tag device. However the mobile applications for that device send the data as strings rather than numerics - this is causing the DSX…
1
vote
2 answers

Programmatically determine if running in DSX

How can I programmatically determine if the python code in my notebook is running under DSX? I'd like to be able to do different things under a local Jupyter notebook vs. DSX.
1
vote
1 answer

DSX notification if scheduled notebook does not run?

I'm trying to troubleshoot an hourly scheduled notebook as per this question: How to troubleshoot a DSX scheduled notebook? When listing the kernel logs I noticed at 3am the notebook was not…
Chris Snow
  • 23,813
  • 35
  • 144
  • 309
1
vote
3 answers

How to troubleshoot a DSX scheduled notebook?

I have a DSX notebook that I can run manually usng the DSX user interface and it populates some data in a Cloudant database. I have scheduled the notebook to run hourly. Overnight I would have expected the job to have run many times, but the…
Chris Snow
  • 23,813
  • 35
  • 144
  • 309
1
vote
2 answers

unable to save changes in jupyter notebook on DSX

Occasionally, I'm unable to save changes to my notebook in DSX. I believe this is because my session has timed out. How can I prevent my changes from being lost?
Chris Snow
  • 23,813
  • 35
  • 144
  • 309
1
vote
1 answer

matplotlib - ImportError: No module named _tkinter

I have a simple notebook with the following code: %matplotlib inline However, when running it I get the following error: ImportError: No module named _tkinter I have another notebook in the same project, and that one is able to run the statement…
Chris Snow
  • 23,813
  • 35
  • 144
  • 309
1
vote
1 answer

Is it possible for a spark job on bluemix to see a list of the other processes on the operating system?

A common approach for connecting to third party systems from spark is to provide the credentials for the systems as arguments to the spark script. However, this raises some questions about security. E.g. See this question Bluemix spark-submit --…
Chris Snow
  • 23,813
  • 35
  • 144
  • 309
0
votes
2 answers

How do I access a postgres table from pyspark on IBM's Data Science Experience?

Here is my code: uname = "xxxxx" pword = "xxxxx" dbUrl = "jdbc:postgresql:dbserver" table = "xxxxx" jdbcDF = spark.read.format("jdbc").option("url", dbUrl).option("dbtable",table).option("user", uname).option("password", pword).load() I'm…
0
votes
1 answer

DSX images generated by pixiedust display command are ugly

Any ideas why the display command in DSX gives such ugly image unlike the ones in Databricks? Plus I don't see that I can even add a hue color (even default) is ok like Databricks. Databricks Data Science Experience
Vik M
  • 45
  • 1
  • 4
0
votes
1 answer

Scheduled job does not appear to run and no kernel files are created

I have a scheduled notebook job that has been running without issue for a number of days, however, last night it stopped running. Note that I am able to run the job manually without issue. I raised a previous question on this topic: How to…
Chris Snow
  • 23,813
  • 35
  • 144
  • 309
1
2