I am trying to split a string, where multi-word proper nouns are recognized as one token. For example, the following code needs to be changed,
import re
s = 'Multi-Criteria Decision Making (MCDM) is increasingly used in RE projects.'
out = re.compile("\s").split(s)
print(out)
in order to get this desired outcome:
['Multi-Criteria Decision Making', 'MCDM', 'is', 'increasingly', 'used', 'in', 'RE', 'projects']
I have found this, but I am not able to incorporate it to the code correctly.
Thanks in advance!