How to convert object columns to string and use replace?

Question

I have some columns ['subject', 'H.period', 'DD.period.t'] etc. Actually all columns are object type.

dtype printscreen

How can i convert these columns into string type?

And how can i use .replace for converting "," into "." in a csv file? I need to use these data in machine learning K Neighbors algorithm.

sacuL · Accepted Answer · 2018-09-12T16:40:39.787

There is no string dtype in pandas. As noted in the docs:

Note When working with heterogeneous data, the dtype of the resulting ndarray will be chosen to accommodate all of the data involved. For example, if strings are involved, the result will be of object dtype. If there are only floats and integers, the resulting array will be of float dtype.

As far as replacing , for . in your whole dataframe, Use replace with regex = True:

df = df.replace(',','.',regex=True)
# or
df.replace(',','.',regex=True, inplace = True)

For example: If your dataframe df looks like:

>>> df
  col1         col2
0  x,x    blah,blah
1  y,z  hello,world
2  z.z       ,.,.,.

Then:

df = df.replace(',','.',regex=True)
>>> df
  col1         col2
0  x.x    blah.blah
1  y.z  hello.world
2  z.z       ......

Thanks a lot, solved my problem! – Alex Colombari Sep 12 '18 at 16:49 — Alex Colombari, Sep 12 '18 at 16:49

score 0 · Answer 2 · answered Sep 12 '18 at 16:44

Although the dtype is indeed 'object', when applying the type() function to the columns labels individually you will find out that they do actually belong to the class 'str'. So that is fine.

As to your question about replacement I would suggest something like this:

length = len(df[df.columns[0]])
for column in df.columns:
     for index in range(length):
          df[column][index] = df[column][index].replace(",",".")

How to convert object columns to string and use replace?

2 Answers2