How could I use spacy to detect a pattern like:
mygov-24.mygov.ca - last updated: 06/05/21
I want to detect the following patterns:
- mygov-24.mygov.ca - last updated: 06/05/21
- mygov-24.mygov.ca - last updated: 02/04/21
- mygov-24.mygov.ca - last updated: 01/02/21
- ....
As you could see the date changes but everything remains the same. How can I use spacy to create a pattern matcher, that tells if the input string has the same pattern? Also, If the pattern is detected, I want to extract the date. Is that possible with Spacy?
I went through, https://spacy.io/usage/rule-based-matching but not sure where to start.
EDIT Given a group of dynamic phrases as above, is there a way to identify the variables within the phrases?