When we say "non-linearity of deep neural networks", what do we actually mean by the term "non-linearity" in this context ?
Also, the purpose of the activation function is to introduce non-linearity into the network. What does this non-linearity means ? (I am new to Deep learning.)