I can successfully use foverlaps
with a small sample of my dataset, but when use the full data (data.tables with over 30k rows), it breaks down and throws the following error:
Error message:
Error in if (any(x[[xintervals[2L]]] - x[[xintervals[1L]]] < 0L)) stop("All entries in column ", :
missing value where TRUE/FALSE needed
The way I am interpreting the error message is that there are no overlaps between the two data.tables.
Q1-Am I interpreting the message well?
Q2-Any idea why this might happen with the larger dataset? Is it possible that this is due to the size of the dataset?
I do have a lot of unique values, which according to foverlaps
help file, can be expected to slow things down proportionally, but not before it get into millions of rows, which is far from being the case here. Thank you.