I have three data frames (Forest, Agriculture, and Urban) with each having 1 row and 24145 columns (see example at the bottom). Each column represents a different molecular formula and the value within each cell corresponds to the relative amount of that formula in the sample (Forest, Agriculture, and Urban).
I'm trying to figure out the best way to find which molecular formulae are unique to each of the three samples above. For example, if I have one molecular formula (C10H10) that has a value of 0.12 for Forest but 0 for both Agriculture and Urban, I want to be able to obtain a final product that shows that particular formula was only present in the Forest sample.
Ultimately, I want to then make a plot with this final product where I can plot the molecular formula information on the axes (ratio of oxygen to carbon on the x and ratio of hydrogen to carbon on the y) and have individual points within the plot corresponding to those unique formulae, color coded to represent which sample they were uniquely found in.
Thanks in advance!
Small example of the input, with the three separate data frames combined into one called Samples (input in reality has 24145 different molecular formulae, not just the 4 listed here):
C10H10O3N1S0 C10H1004N1S0 C10H10O5N1S0 C10H10O5N1S1
Forest 0.00 1.44 0.00 0.00
Agriculture 0.00 0.00 1.11 4.94
Urban 1.29 0.00 1.33 0.00