Questions tagged [reproducible-research]

Reproducible research is the idea that the result of scientific research should be published with data and code in order to make it possible for other researchers to verify the results.

Reproducible research may be especially important to you if your investigation involves large amount of data or very complex calculations.

One possible set of tools for reproducible research is using r with sweave or knitr.

Unit tests for functions in a Jupyter notebook?

I have a Jupyter notebook that I plan to run repeatedly. It has functions in it, the structure of the code is this: def construct_url(data): ... return url def scrape_url(url): ... # fetch url, extract data return parsed_data for i…

python unit-testing testing jupyter reproducible-research

asked Oct 21 '16 at 08:52

Richard

62,943
126
334
542

votes

3 answers

Fully reproducible parallel models using caret

When I run 2 random forests in caret, I get the exact same results if I set a random seed: library(caret) library(doParallel) set.seed(42) myControl <- trainControl(method='cv', index=createFolds(iris$Species)) set.seed(42) model1 <-…

r r-caret reproducible-research

asked Nov 15 '12 at 18:01

Zach

29,791
35
142
201

votes

1 answer

Example of using dput()

Being a new user here, my questions are not being fully answered due to not being reproducible. I read the thread relating to producing reproducible code but to avail. Specifically lost on how to use the dput() function. Could someone provide a step…

r reproducible-research

asked Apr 24 '18 at 05:47

Tyler

votes

10 answers

programmatically add cells to an ipython notebook for report generation

I have seen a few of the talks by iPython developers about how to convert an ipython notebook to a blog post, a pdf, or even to an entire book(~min 43). The PDF-to-X converter interprets the iPython cells which are written in markdown or code and…

ipython jupyter-notebook reproducible-research

asked Nov 28 '12 at 21:30

zach

29,475
16
67
88

votes

9 answers

Reproducible results in Tensorflow with tf.set_random_seed

I am trying to generate N sets of independent random numbers. I have a simple code that shows the problem for 3 sets of 10 random numbers. I notice that even though I use the tf.set_random_seed to set the seed, the results of different runs do not…

python tensorflow random-seed reproducible-research

asked Jul 09 '18 at 16:07

Mehdi Rezaie

votes

6 answers

Set working directory in Python / Spyder so that it's reproducible

Coming from R, using setwd to change the directory is a big no-no against reproducibility because others do not have the same directory structure as mine. Hence, it's recommended to use relative path from the location of the script. IDEs slightly…

python spyder reproducible-research

asked Jul 15 '16 at 10:18

Heisenberg

8,386
12
53
102

votes

1 answer

How to save and load random number generator state in Pytorch?

I am training a DL model in Pytorch, and want to train my model in a deterministic way. As written in this official guide, I set random seeds like this: np.random.seed(0) torch.manual_seed(0) torch.backends.cudnn.deterministic =…

pytorch random-seed reproducible-research

asked Mar 11 '19 at 08:17

hajduistvan

votes

2 answers

knitr - error when importing python module

I am having trouble when running the python engine in knitr. I can import some modules but not others. For example I can import numpy but not pandas. {r, engine='python'} import pandas I get the error. Quitting from lines 50-51 (prepayment.Rmd)…

python r knitr sys reproducible-research

asked May 11 '15 at 16:30

Glen Thompson

9,071
4
54
50

votes

11 answers

List of loaded/imported packages in Julia

How can I get a list of imported/used packages of a Julia session? Pkg.status() list all installed packages. I'm interested in the ones that that were imported/loaded via using ... or import ... It seems that whos() contains the relevant…

julia reproducible-research

asked Aug 29 '14 at 20:00

Julian

1,271
2
12
17

votes

1 answer

How can one use Binder (mybinder.org) with private Github repositories?

After reviewing this exact issue (https://github.com/jupyterhub/binderhub/issues/237) it seems that the functionality for this has been implemented with this merged pull request (https://github.com/jupyterhub/binderhub/pull/671). However I can not…

github jupyter-notebook jupyter android-binder reproducible-research

asked Feb 12 '19 at 10:56

tallamjr

1,272
1
16
21

votes

1 answer

An Overview of Nix/OS Architecture?

While the Nix/OS wiki and manuals provide a lot of excellent information, I am still having trouble getting an architectural overview. Apologies for the quantity and naivity of the questions; feel free to answer a subset: 1. What constitutes a Nix…

linux portability nix nixos reproducible-research

asked Aug 12 '16 at 06:22

Ixxie

1,393
1
9
17

votes

0 answers

Better reproductibility of rPackages (pin version of packages) in nix in comparison to guix

I'm actually evaluate different solution to enhance/explore reproductibility in my R/Python scientific workflow : data with reproductible analysis (plot, analysis) and paper. There is, as you know, two big linux flavours offer some solutions : Nix…

r nix reproducible-research nixpkgs guix

asked May 28 '21 at 19:41

reyman64

votes

1 answer

Why are my results still not reproducible?

I want to get reproducible results for a CNN. I use Keras and Google Colab with GPU. In addition to recommendations to insert certain code snippets, which should allow a reproducibility, I also added seeds to the layers. ###### This is the first…

tensorflow keras conv-neural-network google-colaboratory reproducible-research

asked Oct 17 '19 at 13:14

Code Now

votes

2 answers

Using BERT for next sentence prediction

Google's BERT is pretrained on next sentence prediction tasks, but I'm wondering if it's possible to call the next sentence prediction function on new data. The idea is: given sentence A and given sentence B, I want a probabilistic label for…

tensorflow deep-learning reproducible-research nlp

asked Mar 11 '19 at 22:29

Paul

votes

2 answers

Use rmarkdown/knitr to hold all code until the end

I'd like to be able to generate a document using knitr/rmarkdown that keeps all the output together, but leaves the code until the end, ideally as a referenced footnote of sorts (i.e. the code for each figure or output can be looked up in the…

r knitr r-markdown reproducible-research

asked Feb 11 '15 at 15:40

micturalgia

2 3

…

15 16 Next