I have a dataframe (df
) containing approximately 100 soccer player numbers (if more players sign-up, the number increases). Each player_number
consists of 6 digits (e.g. 178530).
Every player should review 5 other players, so eventually all players are reviewed by 5 others. Therefore I would like to randomly assign 5 different player numbers (from the player_number
column) to each player_number
. To prevent assigning reviews to themselves and/or players having to review the same player twice (or more), each player_number
should only occur once in every column and in every row. The dataframe should look like this:
player_number review1 review2 review3 review4 review5
178530 207145 655600 443274 604060 804226
245678 947821 214525 332324 174589 868954
…
Player 178530 needs to review players 207145, 655600 etc.
For review1 column, I have used:
set.seed(1)
df$review1 <- sample(df$player_number, nrow(df), replace=F)
This works for review1
, but applying it to the other review columns leads to duplicate player_number
in several rows. Can anyone help me out so each player_number
only occurs once in every column and in every row? Thanks in advance.
Edit: in a previous version I simplified the player_number too much (1:100)