I am using this sed command to strip documents of all their (for me) unnecessary characters.
sed 's/[^a-zA-Z]/ /g'
However after mining my data a bit I realized a pretty basic mistake:
not including '
cuts all my don't
s into don t
s, which sucks.
So i want to include '
in my regex. I'm still new to this kind of "coding" if I may call it that way, so excuse my newbie mistake or even better, explain it to me!
sed 's/[^a-zA-Z']/ /g'
this obviously doesn't work
sed 's/[^a-zA-Z\']/ /g'
however this doesn't either, I thought \
escapes the '
?