0

I was wondering if there was an awk/sed regex that I can use to detect any lines that contain non-english characters, for example:

あと1つのスマッシュが見つからへん…もう寝よう
ความหวังดีของคนเรา สุดท้าย
Το θερμόμετρο χτυπάει
LA #MACIF a félicité #DotYou pour le site http://www.roulonspourlavenir.com , un petit compliment dans un monde de brutos... c'est bon ça!!

but since they are twitter posts, I need to keep lines with links, @ symbols, therefore I believe I am looking to detect any non-ascii characters?

Thanks for any help,

Tomek

Tomek
  • 4,689
  • 15
  • 44
  • 52

1 Answers1

0

Does this work?

/[^a-z0-9]/i
zzzzBov
  • 174,988
  • 54
  • 320
  • 367