according to InfogainLoss layer I have replace SoftMaxWithLoss -> InfoGainLoss from branch https://github.com/shaibagon/caffe/tree/upgrade_infogain (which robustly combines softmax layer and infogain loss layer).
Now all predictions are 1st class only.
Any suppositions?
additional info solver, net, H and log my sover, net, generator H matrix, and log.
https://drive.google.com/a/smedx.com/file/d/0B4lunYl8YWUOQ3U3NzN6Tll5NEE/view?usp=sharing