I ran into an issue while using a conditioned function in order to create a new column in my dataframe.
While using the defined function to split the text based on an if statement the error that returned is the ValueError: ValueError: The truth value of a Series is ambiguous.
def split_name(Name):
if (test_data['Tussenvoegsel'].str.len() != 0) :
temp = str(re.findall(r'(\.[^.]*)$', Name))
return (temp[3:100])
else :
return ("")
test_data['Achternaam'] = test_data['Naam'].apply(lambda x: split_name(x))
Dataframe column where it's applied to is:
Naam Tussenvoegsel
0 Dhr. V. Andersen
1 Mevr. J.C. van der Kosan van der
2 Dhr. P.M.M. van Zomer van
3 Mevr. M.J.J. Raimondo
4 Mevr. E. van de Doorn van de
Example of the expected outcome:
Naam Tussenvoegsel Achternaam
0 Dhr. V. Andersen Andersen
1 Mevr. J.C. van der Kosan van der
2 Dhr. P.M.M. van Zomer van
3 Mevr. M.J.J. Raimondo Raimondo
4 Mevr. E. van de Doorn van de
I've tried the split method, tried it without and with lambda, but I don't get it working.
Thanks!