I need to find abbreviations text in many languages. Current regex is:
import regex as re
pattern = re.compile('(?:[\w]\.)+', re.UNICODE | re.MULTILINE | re.DOTALL | re.VERSION1)
pattern.findall("U.S.A. u.s.a.")
I don't need u.s.a in the result, i need only uppercase text. [A-Z] won't work in any language except english.