I have dataframe as follows: I wanted to get the count of D, T and N in new columns of dataframe as Dcount TCount Ncount
data = {'CHROM':['chr1', 'chr2', 'chr1', 'chr3', 'chr1','chr1', 'chr2', 'chr1'],
'POS':[939570,3411794,1043223,22511093,24454031,3411794,22511093,1043223],
'MI':['T', 'T', 'D', 'D', 'T', 'N', 'D', 'N'],
'CSK':['D', 'D', 'N', 'T', 'N', 'D', 'T', 'T'],
'DD':['N', 'D', 'D', 'D', 'T', 'N', 'D', 'N'],
'RR':['D', 'T', 'N', 'T', 'D', 'D', 'T', 'N'],
'RCB':['D', 'D', 'D', 'D', 'D', 'D', 'D', 'D'],
'DC':['D', 'D', 'T', 'D', 'D', 'D', 'N', 'D']
}
df1 = pd.DataFrame(data)
df1
CHROM POS MI CSK DD RR RCB DC
0 chr1 939570 T D N D D D
1 chr2 3411794 T D D T D D
2 chr1 1043223 D N D N D T
3 chr3 22511093 D T D T D D
4 chr1 24454031 T N T D D D
5 chr1 3411794 N D N D D D
6 chr2 22511093 D T D T D N
7 chr1 1043223 N T N N D D
I want to get the count of T
, D
, N
in a new dataframe.
Expected output:
CHROM POS MI CSK DD RR RCB DC Dcount Tcount Ncount
0 chr1 939570 T D N D D D 4 1 1
1 chr2 3411794 T D D T D D 4 2 0
2 chr1 1043223 D N D N D T 3 1 2
3 chr3 22511093 D T D T D D 4 2 0
4 chr1 24454031 T N T D D D 3 2 1
5 chr1 3411794 N D N D D D 4 0 2
6 chr2 22511093 D T D T D N 3 2 1
7 chr1 1043223 N T N N D D 2 1 3