I have seen Should Feature Selection be done before Train-Test Split or after? thread and read it. A person had explained there very good. However, is it a must? I mean, if I use the whole dataset for any issue or data without splitting it for feature selection processes. Do I always overfit?
For example, I have tried Boruta for my whole dataset. It has given me 23 features. However, I have also tried Boruta for only with my train set and test set. It has resulted 15 and 11 features in order.
How can I understand there if I overfit or not?