Here's the link to the paper regarding MobileNet V3.
According to the paper, h-swish and Squeeze-and-excitation module are implemented in MobileNet V3, but they aim to enhance the accuracy and don't help boost the speed.
h-swish is faster than swish and helps enhance the accuracy, but is much slower than ReLU if I'm not mistaken.
SE also helps enhance the accuracy, but it increases the number of parameters of the network.
Am I missing something? I still have no idea how MobileNet V3 can be faster than V2 with what's said above implemented in V3.
I didn't mention the fact that they also modify the last part of their network as I plan to use MobileNet V3 as the backbone network and combine it with SSD layers for the detection purpose, so the last part of the network won't be used.
The following table, which can be found in the paper mentioned above, shows that V3 is still faster than V2 is.