Questions tagged [higher]

5 questions
3
votes
0 answers

Adafactor from transformers hugging face only works with Transfromers - does it not work with Resnets and MAML with higher?

To reproduce I am running the MAML (with higher) meta-learning algorithm with a resnet. I see this gives issues in my script (error message pasted bellow). Is Adafactor not suppose to work with Resnets or other models? Steps to reproduce the…
1
vote
2 answers

What is the official implementation of first order MAML using the higher PyTorch library?

After noticing that my custom implementation of first order MAML might be wrong I decided to google how the official way to do first order MAML is. I found a useful gitissue that suggests to stop tracking the higher order gradients. Which makes…
1
vote
1 answer

When should one call .eval() and .train() when doing MAML with the PyTorch higher library?

I was going through the omniglot maml example and saw that they have net.train() at the top of their testing code. This seems like a mistake since that means the stats from each task at meta-testing is shared: def test(db, net, device, epoch, log): …
Charlie Parker
  • 5,884
  • 57
  • 198
  • 323
0
votes
0 answers

How does one use the mean and std from training in Batch Norm?

I wanted to use the means, stds from training rather than batch stats since it seems if I use batch statistics my model diverges (as outline here When should one call .eval() and .train() when doing MAML with the PyTorch higher library?). How does…
Charlie Parker
  • 5,884
  • 57
  • 198
  • 323
0
votes
2 answers

How to use have batch norm not forget batch statistics it just used in Pytorch?

I am in an unusual setting where I should not use running statistics (as that would be considered cheating e.g. meta-learning). However, I often run a forward pass on a set of points (5 in fact) and then I want to evaluate only on 1 point using the…
Charlie Parker
  • 5,884
  • 57
  • 198
  • 323