I have a Pandas data frame and I need to convert it to H2O frame. I use the following code-
Code:
# Convert pandas dataframe to H2O frame
start_time = time.time()
input_data_matrix = h2o.H2OFrame(input_df)
logger.debug("3. Time taken to convert H2O Frame- " + str(time.time() - start_time))
Output:
2019-02-05 04:38:55,238 logger DEBUG 3. Time taken to convert H2O Frame- 9320.119945764542
The data frame (i.e. input_df
) size 183K x 435 with no null or NaN values.
It is taking around 2 hours. Is there any better way to perform this operation?