How to apply LabelEncoder for a specific column in Pandas dataframe

Question

I have a dataset loaded by dataframe where the class label needs to be encoded using LabelEncoder from scikit-learn. The column label is the class label column which has the following classes:

[‘Standing’, ‘Walking’, ‘Running’, ‘null’]

To perform label encoding, I tried the following but it does not work. How can I fix it?

from sklearn import preprocessing
import pandas as pd

df = pd.read_csv('dataset.csv', sep=',') 
df.apply(preprocessing.LabelEncoder().fit_transform(df['label']))

If you just run `preprocessing.LabelEncoder().fit_transform(df['label'])` on its own, outside of `apply()`, do you get the encoded labels? — andrew_reece, May 09 '18 at 17:29
Yes you are right, the error disappears but I don't see encoding! The classes are not transformed. That's why I use `apply()` so that the transformation applied in the dataframe — Kristofer, May 09 '18 at 17:34
`apply()` accepts a function, which it will apply to the each point. Here you are sending the transformed data to `apply()`, not a function and hence the error. — Vivek Kumar, May 10 '18 at 05:35

niraj · Accepted Answer · 2018-05-09T17:45:00.800

61

You can try as following:

le = preprocessing.LabelEncoder()
df['label'] = le.fit_transform(df.label.values)

Or following would work too:

df['label'] = le.fit_transform(df['label'])

It will replace original label values in dataframe with encoded labels.

edited May 09 '18 at 17:45

answered May 09 '18 at 17:39

niraj

17,498
4
33
48

Thank you for your answer. I think there is an error `AttributeError: 'DataFrame' object has no attribute 'label'`. I am using Python 3.6 – Kristofer May 09 '18 at 17:42
Is `label` not the column in `dataframe`? or did it work? – niraj May 09 '18 at 17:43
The column `label` is the class label containing one of these values `[‘Standing’, ‘Walking’, ‘Running’, ‘null’]` – Kristofer May 09 '18 at 17:44
YES! `df['label'] = le.fit_transform(df['label'])` works! Thank you very much – Kristofer May 09 '18 at 17:45
1

`df['label'] = le.fit_transform(df['label'])` worked perfectly, thanks. – Sameen Mar 09 '23 at 11:00

Darshan Jain · Answer 2 · 2020-04-28T07:45:43.483

4

You can also do:

from sklearn.preprocessing import LabelEncoder
le = LabelEncoder()
df.col_name= le.fit_transform(df.col_name.values)

where col_name = the feature that you want to label encode

edited Apr 28 '20 at 07:45

answered Apr 27 '20 at 14:05

Darshan Jain

781
9
19

1

even better it would be `df.col_name.values` – seralouk Apr 28 '20 at 07:43

score 2 · Answer 3 · answered Oct 01 '21 at 10:08

2

 from sklearn.preprocessing import LabelEncoder
 le = LabelEncoder()
 X[:, 2] = le.fit_transform(X[:, 2])

this could be helpful if you want to change the particular column in your CSV data

answered Oct 01 '21 at 10:08

METTA APPALA GANESH KUMAR

21
2

How to apply LabelEncoder for a specific column in Pandas dataframe

3 Answers3