I'm curious what the best / fastest ways to combine multiple tables / data.frames is if you do not want duplicate rows.
I have files with tens of millions of entries, however, there may be duplicates across files.
File A
IDNum Name Time
123 Light 12:30
234 Lind 12:30
123 Light 1:00
File B
IDNum Name Time
123 Light 1:00
123 Light 12:30
234 Lind 12:30
File C
IDNum Name Time
666 Ryuk 2:00
123 Light NA
Desired Output
IDNum Name Time
123 Light NA
123 Light 12:30
123 Light 1:00
234 Lind 12:30
666 Ryuk 2:00
I know there are a lot of ways to do this using various methods of join/merge, but is there any specific way which is fast? I have hundreds of files, with multiple millions of rows, with some unknown, likely large, amount of duplicates.