I want to train regression model(not classification) which outputs are continuous numbers.
Let's say I have input variable, X, which ranges between -70 to 70. And I have output variable, Y, which ranges between -5 to 5. X has 39 features and Y has 16 features and they have 50,000 examples each. Then I would like to train them with deep belief network(DBN) in python.
I used the script in theano homepage which described DBN with MNIST data(classification). http://deeplearning.net/tutorial/DBN.html
Could you tell me what specific lines that I have to change in order to apply regression problem which I explained above?
For example I have to change...
- sigmoid function to tanh function. : I heard tanh activation function works better than sigmoid in regression. right?
- instead of using negative-log likelihood, I have to use least-square error....?
- Input and output data normalization by zscore?
Please please let me know if you have any idea to solve this problem... Most of the machine learning examples are based on MNIST hand digit classification. I would be happy if you recommend me some nice blogs or homepage where I can get useful information related to regression.
Thank you advance.