I have generated a UserDefinedFunction like this:
def function1(instance):
if(instance['Atr1'] == '--'):
return '++'
else:
return '++++'
from pyspark.sql.functions import UserDefinedFunction
udf = UserDefinedFunction(lambda instance: function1(instance), StringType())
udf(df)
Where my dataframe has some attributes: 'Atr1', 'Atr2', 'AtrN'...
I get the error:
AttributeError: 'DataFrame' object has no attribute '_get_object_id'
I want to get a column with only that atribute. How could I do it?