I have HTML where I need to collect all the content that has a particular format, e.g. get everything that is in the 00.000.000/0000-00
or XX.YYY.IIO/KKKK-LL
formats.
Would use of regular expressions be the best way to accomplish this, or how else can I accomplish this?