I have a dataset like:
order_id | order_item_id | category
1 | 1 | book
1 | 2 | pen
1 | 3 | book
now I have to remove the order_item_id
and its row that has duplicate value on category columns but still leave 1 of them. How can i achieve that?