0
print(len(df))
264764
df = df.merge(df2[['id','Re']], on=['id'], suffixes=('', '_fwd1'), how='left')
print(len(df))
2714684

why would this result in more data than df originally had? Even when i do how='inner' which should be the intersection of keys i get more observations.

martineau
  • 119,623
  • 25
  • 170
  • 301
ben shapiro
  • 111
  • 1
  • 7

0 Answers0