Hi Stack Overflow community. I think I am trying to code the impossible with matplotlib, so if there is a different python library that will better suit me, please let me know!
I have an entire amino acid sequence (Represented as capital letters in the image) of a protein (protein x). This will be my x axis.
I have two excel columns: Disease and Control. These columns contain parts of the whole protein x's amino acid sequence. Sometimes there are multiple hits where the disease or control column will contain two of an identical amino acid section of protein x. I want these to stack on top of each other so that one can see how many hits the disease and control have on protein x.
Confusing? sorry, here's a sample of what I was able to come up with using powerpoint.
Amino Acid Comparison
The black text is the reference sequence. Purple is control. Pink is disease. Make sense now?
I need to do this with a HUGE dataset, so no, I do not want to "just use powerpoint for hours". I also want to do it with any reference sequence of my choosing.
I'm not asking someone to do my job for me. I need someone to point me in the right direction. Is there a special library? Should I be converting everything into numbers and then relabeling as text?
Thanks and I appreciate any advice.