I have an autocorrelation problem in my panel data. So I decided to use first difference method so deal with this problem.
Most of my independent variables are binary. So if I do the finite difference method over this, I get -1, 0, and 1 instead of 0 or 1 as before.
Is this ok?
Besides, my data set time flow is as follows which I am not sure how I can apply first difference method in this case when I have multiple difference incidents happening on the same day:
Date ID X Y Z L M A B C D E
01/01/2017 A 0 1 0 0 0 0 1 0 0 7.8
01/01/2017 A 0 1 0 0 0 1 0 0 1 6.5
01/01/2017 B 0 0 0 0 1 1 0 0 1 6.5
01/03/2017 A 0 1 0 0 0 0 0 0 0 7.8
01/04/2017 C 0 0 1 0 0 1 0 0 0 6.5
01/04/2017 C 0 0 0 0 0 0 1 0 0 7.3
I sort this again according to Date and ID which become as follows:
Date ID X Y Z L M A B C D E
01/01/2017 A 0 1 0 0 0 0 1 0 0 7.8
01/01/2017 A 0 1 0 0 0 1 0 0 1 6.5
01/01/2017 B 0 0 0 0 1 1 0 0 1 6.5
01/03/2017 A 0 1 0 0 0 0 0 0 0 7.8
01/04/2017 C 0 0 1 0 0 1 0 0 0 6.5
01/04/2017 C 0 0 0 0 0 0 1 0 0 7.3
Besides, Is this new data sorting ok to use in my Panel regression and also take the first difference over this utilizing this row sequence?