I am working on pyspark dataframe and I have a column of words
(array<string> type)
. What should be the regex pattern to remove numeric values and numeric values from words?
+---+----------------------------------------------+
|id | words |
+---+----------------------------------------------+
|564|[fhbgtrj5, 345gjhg, ghth578ghu, 5897, fhrfu44]|
+---+----------------------------------------------+
expected output:
+---+----------------------------------------------+
|id |words |
+---+----------------------------------------------+
|564| [fhbgtrj, gjhg, ghthghu, fhrfu]|
+---+----------------------------------------------+
Please help.