0

This is related to:
How to create example data set from private data (replacing variable names and levels with uninformative place holders)?
Which stems from :
How to make a great R reproducible example?

I have come to realize that having an anonymous data set (were the data and labels are uninformative but consistent with the original data) is half the battle of producing a reproducible example (for questions or bug reporting) from script and data you can not share (eg. proprietary information, unpublished findings, etc.).

Any suggestions as how to automate the translation of a script so that it matches the variables in the anonymous data frame created using one of the answers provided at this stackoverflow answer

This may be more of a challenge than it is worth, but if it went into widespread use (say as a package), it could reduce the occurrence of questions and bug reports that do not have a reproducible example.

Community
  • 1
  • 1
Etienne Low-Décarie
  • 13,063
  • 17
  • 65
  • 87
  • There are so many example datasets in R, I would be surprised if not one of them matched, at least to some extent, your scenario. – Brandon Bertelsen Nov 29 '12 at 00:36
  • It, of course, depends on the type of data. If you have a data set based on means and deviations around the mean, one option could be either to use `rnorm` or bootstrap on the values and change the axis labels to something generic. This is not a new trick in StackExchange. – Mikko Nov 29 '12 at 09:34
  • As indicated, the data is no longer the problem. It is matching the new anonymous data to a complex script that is the problem (mapping all the new variable names). – Etienne Low-Décarie Dec 05 '12 at 17:23

0 Answers0