I need to extract the words and phrases within a text. For example, the text is:
Привет, hello, как дела? english word, еще одно русское слово, слово-1224, тест 4456
And script should return the following:
Привет
как
дела
еще
одно
русское
слово
слово-1224
That is, I need to take from the text of all the words that begin with the Russian letters ([а-яА-Яё-]
), and can contain numbers and letters of the Russian alphabet. How is this implemented?