How do I derive the surnames of everyone whose first names have a condition in a pandas data frame?

Question

Using pandas, I want to derive a last name column for a set of first names that are 4 or more characters long.

I have tried these:

data = pd.read_csv("Data.csv")
data

#split the EmployeeName into firstname and lastname

flname = data['EmployeeName'].str.split(expand=True)
flname

#add first name column to data frame
data['FirstName'] = flname[0]

#apply condition on first name
dfname = data['FirstName'].apply(lambda x:x if len(x) \> 4 else None)
dfname = dfname.dropna()

#add last name and new first name columns to data frame
data['LastName'] = flname[0]
data['NewFirstName'] = dfname
data

#This is the wrong bit that throws an error
derived_name = data.apply(lambda x:x if data\['FirstName'\] in data\['NewFirstName'\] else None)
derived_name.dropna()

#TypeError: unhashable type: 'Series'

#Are there shorter ways of writing these code lines with pandas?

Please add an example of your data (csv) – gtomer May 29 '23 at 10:29 — gtomer, May 29 '23 at 10:29

score 0 · Answer 1 · edited May 29 '23 at 10:40

0

I resolved this with answer to question 1387.

df = data[data['NewFirstName'].notna()]
df
df['LastName']

Thanks all. But are there shorter ways to answer the question?

edited May 29 '23 at 10:40

gtomer

5,643
1
10
21

answered May 29 '23 at 10:34

Bumze

1
2

Do not post redundant (line 2) code that might mislead others. – ניר May 30 '23 at 04:18
Noted. Thanks for mentioning. – Bumze May 31 '23 at 09:18

Ajey Dikshit · Answer 2 · 2023-05-29T11:15:00.847

0

Splitting the data

data[['Firstname', 'Lastname']] = data['EmployeeName].str.split(expand=True)

After splitting the name columns, you should use masking as it makes this very easy.

data[data['Firstname'].str.len() >= 4]['Lastname']

should give you the desired output

edited May 29 '23 at 11:15

answered May 29 '23 at 11:08

Ajey Dikshit

21
3

Thanks for the short method @Ajey D. `data[data['FirstName'].str.len() >= 4]['LastName']` – Bumze May 29 '23 at 14:47

How do I derive the surnames of everyone whose first names have a condition in a pandas data frame?

2 Answers2