I am brand new to Tensorflow and trying to incorporate it into my project. I am using a raspberry pi4 with a camera to detect change from my bird feeder, take a picture, and then identify that bird with tensorflow. I am using the Birds_V1 to accomplish this.
I am curious as to how to interpret the input and output. According to the overview the input is:
expected to be 3-channel RGB color images of size 224 x 224, scaled to [0, 1].
I am confused by what is meant by scaled to [0,1] Furthermore, the output
image_classifier: A probability vector of dimension 965, corresponding to a background class and 964 bird species in the labelmap.
I am completely lost by what is meant here.
Lastly, I ran interpreter.get_input_details() and interpreter.get_output_details() to see what it would output to me.
I got this printed back, with the top being the input and bottom the output:
[{'name': 'module/hub_input/images_uint8', 'index': 170, 'shape': array([ 1, 224, 224, 3]), 'dtype': <class 'numpy.uint8'>, 'quantization': (0.0078125, 128), 'quantization_parameters': {'scales': array([0.0078125], dtype=float32), 'zero_points': array([128]), 'quantized_dimension': 0}}]
[{'name': 'module/prediction', 'index': 171, 'shape': array([ 1, 965]), 'dtype': <class 'numpy.uint8'>, 'quantization': (0.00390625, 0), 'quantization_parameters': {'scales': array([0.00390625], dtype=float32), 'zero_points': array([0]), 'quantized_dimension': 0}}]
I don't know how to interpret this, and was wondering if I should be going off of this instead or ignoring it if its not important.
I appreciate any clarification that can be given to any part of this, and appreciate any sources that you may deem useful for solving this. I have looked for focumentation for help, but haven't gotten anywhere yet.