Extracting information from Pandas dataframe column headers

Question

I have a pandas dataframe with column headers, which contain information. I want to loop through the column headers and use logical operations on each header to extract the columns with the relevant information that I have.

my df.columns command gives something like this:

['(param1:x)-(param2:y)-(param3:z1)',
'(param1:x)-(param2:y)-(param3:z2)',
'(param1:x)-(param2:y)-(param3:z3)']

I want to select only the columns, which contain (param3:z1) and (param3:z3).

Is this possible?

Try `df.columns.str.contains('param3:z1|param3:z3')` – Space Impact Aug 19 '20 at 20:34 — Space Impact, Aug 19 '20 at 20:34

score 1 · Answer 1 · answered Aug 19 '20 at 20:42

1

You can use filter:

df = df.filter(regex='z1|z3')

answered Aug 19 '20 at 20:42

NYC Coder

7,424
2
11
24

Hi NYC Coder, this has been successful on my sample data. Tomorrow I shall try it on my full dataset and see if it works. Many thanks. – mel el Aug 19 '20 at 21:31

Extracting information from Pandas dataframe column headers

1 Answers1