Questions tagged [featuretools]

Featuretools is a Python library for automated feature engineering on relational datasets using a technique called Deep Feature Synthesis.

Featuretools is an open source python library for automated feature engineering for tabular relational datasets.

Resources

221 questions
16
votes
1 answer

Featuretools: Can it be applied on a single table to generate features even when there is no datetime related column?

The featuretools documentation states in its very first sentence: "Featuretools is a framework to perform automated feature engineering. It excels at transforming temporal and relational datasets into feature matrices for machine learning." This…
Ashok K Harnal
  • 1,191
  • 2
  • 15
  • 28
11
votes
1 answer

How to apply Deep Feature Synthesis to a single table

After processing, my data is one table with several columns that are features and one column which is a label. I would like to use featuretools.dfs to help me predict the label. Is it possible to do it directly, or do I need to split my single table…
The Anh Nguyen
  • 119
  • 1
  • 5
6
votes
2 answers

[featuretools]'EntitySet' object has no attribute 'entity_from_dataframe'

I tried to learn featuretools following documentation from featuretools.com. A error came up: AttributeError: 'EntitySet' object has no attribute 'entity_from_dataframe' Could you help me? Thank you. Code: import featuretools as ft data =…
Enjoy
  • 143
  • 1
  • 6
6
votes
2 answers

Should we exclude target variable from DFS in featuretools?

While passing the dataframes as entities in an entityset and use DFS on that, are we supposed to exclude target variable from the DFS? I have a model that had 0.76 roc_auc score after traditional feature selection methods tried manually and used…
Harish Rajula
  • 699
  • 6
  • 11
5
votes
2 answers

how to make features using featuretools, for the new data(on which we want to make prediction)

I have a single dataframe and want to use featuretools for auto feature engineering part. I am able to do it with normalize entities function. code snippet is below: es = ft.EntitySet(id = 'obs_data') es = es.entity_from_dataframe(entity_id = 'obs',…
Mohit Sharma
  • 590
  • 3
  • 10
5
votes
1 answer

featuretools last_time_index is not set

I've built an entity set and one of the tables in this entity set is called "inspections". I've set the time_index column for this table, but when running dfs, I'm getting the warning "Using training_window but last_time_index is not set on entity…
Nick Bernini
  • 121
  • 4
4
votes
1 answer

Featuretools categorical handling

Featuretools offers integrated functionality to handle categorical variables variable_types={"product_id": ft.variable_types.Categorical} https://docs.featuretools.com/loading_data/using_entitysets.html However should these be strings or…
Georg Heiler
  • 16,916
  • 36
  • 162
  • 292
4
votes
2 answers

Creating Entity Set in Featuretools error TypeError: 'str' object does not support item assignment

I have this 3 dataframes: df_train cortado:____________________ SK_ID_CURR TARGET NAME_CONTRACT_TYPE_Cash loans \ 0 100002 1 1 1 100003 0 1 2 100004 …
4
votes
1 answer

Increase performance when calculating feature matrix?

Does calculate_feature_matrix use any libraries such as numba to increase performance?
Jeff Hernandez
  • 2,063
  • 16
  • 20
3
votes
1 answer

Legacy Problems trying FeatureTools exercises

I was trying this exercise in GitHub: https://github.com/WillKoehrsen/automated-feature-engineering/blob/master/walk_through/Automated_Feature_Engineering.ipynb but the code is too old to run on the newest version. The first problem was…
jasonkim
  • 31
  • 1
3
votes
1 answer

Using multiple parent IDs for cutoff times in deep feature synthesis

My data looks like: People <-- Events <--Activities. The parent is People, of which the only variable is the person_id. Events and Activities both have a time index, along with event_id and activity_id, both which have a few features. Members of the…
Gabe
  • 89
  • 5
3
votes
1 answer

featuretools progress bar when running dfs

When using featuretools is there a way to show a progress bar when running dfs?
Ed Polanco
  • 31
  • 1
3
votes
1 answer

using featuretools to create "time window features"

I was trying to automate the creation of "row window features", using featuretools package but I couldn't find an easy way to create them. What I mean with "row window features" is that for each cutoff point I want to create features that extract…
Pablo
  • 3,135
  • 4
  • 27
  • 43
3
votes
1 answer

Create features based on cutoff times in featuretools

Im using featuretools and I need to create a feature that uses the cutoff time for its calculation. My entityset consist in a client table and a subscription table (it has more but for the question only these are necessary): import featuretools as…
3
votes
0 answers

AssertionError: Index is not unique on dataframe (Entity cust) using Featuretools

I have a dataframe as below customerid term age 08a858899538ddb8e015390510b321f0830199897 30 24 18a858959537a097401537a4e316e25f730196361 60 72 a8589c253ace09b0153af6ba58f1f313019822366 45 …
Ian Okeyo
  • 299
  • 1
  • 4
  • 7
1
2 3
14 15