I have a dataframe with a text column that looks like this:
product
"HI Celebrate Cake White 612612"
"GO Choc Celebrat bl 682222"
"BI Chocolate Buttercream f34 9282222"
"Graduation Cake 28"
"Slab Image Cake 13273"
"Slab Celebration Cake 2556659"
"Grain Bread"
I want to be able to strip everything after the third word. How do I go about doing this with pyspark or spark sql?