Questions tagged [sframe]

SFrame is a tabular data structure used to store millions of rows of data that do not necessarily fit into memory for analysis.

SFrame is Python package realized scalable, out-of-core dataframe which allows to work with datasets that are larger than the amount of RAM on your system.

Homepage: https://pypi.python.org/pypi/SFrame

82 questions
13
votes
4 answers

Sampling a dataframe based on a given distribution

How can I sample a pandas dataframe or graphlab sframe based on a given class\label distribution values eg: I want to sample an data frame having a label\class column to select rows such that each class label is equally fetched thereby having a…
stackit
  • 3,036
  • 9
  • 34
  • 62
12
votes
6 answers

Efficient way to get the unique values from 2 or more columns in a Dataframe

Given a matrix from an SFrame: >>> from sframe import SFrame >>> sf =SFrame({'x':[1,1,2,5,7], 'y':[2,4,6,8,2], 'z':[2,5,8,6,2]}) >>> sf Columns: x int y int z int Rows: 5 Data: +---+---+---+ | x | y | z | +---+---+---+ | 1 | 2 |…
alvas
  • 115,346
  • 109
  • 446
  • 738
8
votes
3 answers

How can I use `apply` with a function that takes multiple inputs

I have a function that has multiple inputs, and would like to use SFrame.apply to create a new column. I can't find a way to pass two arguments into SFrame.apply. Ideally, it would take the entry in the column as the first argument, and I would…
user3600497
  • 1,621
  • 1
  • 18
  • 22
7
votes
2 answers

String object to dateTime object in SFrame

I have a huge dataset of around 20gb. I have read the data using graphlab.SFrame.read_csv(). I have a date column which is read as string in the format yyyy-dd-mm. But i want the column to be read as a datetime object. How can I do it? I understand…
Dreams
  • 5,854
  • 9
  • 48
  • 71
6
votes
4 answers

Filtering and displaying values in GraphLab Sframe?

So, I started working with Graphlab for my Machine learning class a week ago. I am still very new to Graphlab and i read through the API but couldn't quite get the solution I was looking for. So, here is the question. I have this data with multiple…
Lesley
  • 73
  • 1
  • 6
5
votes
1 answer

Is there a GraphLab SFrame.show() equivalent in Pandas?

Graphlabs SFrame.show() display a comprehensive view of each column in the SFrame and I was wondering if there is an equivalent in Pandas? df.describe() just doesn't provide the level of details at a glance that sf.show() does.
wgpubs
  • 8,131
  • 15
  • 62
  • 109
5
votes
4 answers

How to extract rows in sframe where there's a joint condition and two separate conditions?

I have an sframe as such: +---------+------+-------------------------------+-----------+------------------+ | term_id | lang | term_str | term_type | reliability_code…
alvas
  • 115,346
  • 109
  • 446
  • 738
5
votes
2 answers

Convert graphlab sframe into a dictionary of {key: values}

Given an SFrame as such: +------+-----------+-----------+-----------+-----------+-----------+-----------+ | X1 | X2 | X3 | X4 | X5 | X6 | X7 …
alvas
  • 115,346
  • 109
  • 446
  • 738
5
votes
1 answer

Efficient splitting of data in Python

Consider following code one, two = sales.random_split(0.5, seed=0) set_1, set_2 = one.random_split(0.5, seed=0) set_3, set_4 = two.random_split(0.5, seed=0) What I am trying to in this code is to randomly split my data in Sales Sframe (which is…
Khurram Majeed
  • 2,291
  • 8
  • 37
  • 59
4
votes
0 answers

SFrame for Python 3.4

On the project's website it tells me, that there is Python 3.4 support (Readme.md, which is shown as text on that page). However, I fail to see where I could get that Python 3.4 compatible version of SFrame from. It's not on anaconda and not on…
Zelphir Kaltstahl
  • 5,722
  • 10
  • 57
  • 86
4
votes
1 answer

Group by in SFrame without installing graphlab

How to use groupby operation in SFrame, without installing graphlab. I would love to do some aggregation, but in all examples in the internet I have seen aggregation function comes from Graphlab. Like: import graphlab.aggregate as…
Vladimir
  • 369
  • 1
  • 3
  • 12
3
votes
1 answer

How to load sframe format file in pandas?

Is there any way to directly open .sframe extension file in pandas. Like an easy way df = pd.read_csv('people.sframe') Thank you.
user14596741
3
votes
1 answer

How to create an SFrame compatible with TuriCreate for object detection task

I am trying to create an SFrame containing images and bounding boxes' coordinates, in order to perform object detection using TuriCreate. I have created my own dataset by IBM Cloud Annotations, exported as CreateML format. When I run: usage_data =…
Simone
  • 4,800
  • 12
  • 30
  • 46
3
votes
2 answers

graphlab.SFrame.show() Error: Requested port is unavailable: None

upon entering the command: graphlab.show(); I get the following error: Error: Requested port is unavailable: None So, I tried to set the port by typing: gl.canvas.set_target('ipynb', 8888). But not fruitful. I checked the open ports in…
Bakhtawar
  • 107
  • 2
  • 14
3
votes
1 answer

Finding rows with "Not Applicable" value from a specific column from Graphlab SFrame

Given a Graphlab.SFrame object with the following column names: >>> import graphlab >>> sf = graphlab.SFrame.read_csv('some.csv') >>> s.column_names() ['Dataset', 'Domain', 'Score', 'Sent1', 'Sent2'] One could easily drop the rows with "not…
alvas
  • 115,346
  • 109
  • 446
  • 738
1
2 3 4 5 6