I have two columns of data (say id
and master_id
) in R. It should be the case that all the values in id
should be present in master_id
. But, I suspect that is not the case and I want to identify which ones are the erroneous values. I cannot just inspect the data as I am dealing with data of the order of 100k.
How do I go about finding the erroneous values?