I am trying to create a multi-view convolutional neural network that starts off separately applying convolutional and pooling layers to each of the inputs. The structure would look something like this.
Just as an example, I would want to have the top, bottom, left, and right view of a cat, apply convolutional and pooling layers to each separately, and then combine this info later.
I have been mostly using Keras on this project, however I cannot seem to figure this out with what Keras has to offer. Is there any way to do this with Keras, and if not, how can I create the first step of this multi-view CNN?