I have an array consisting of URLS of the form:
$URLs = @("https://somesite.com/folder1/page/1/"
,"https://somesite.com/folder222/page/1/"
,"https://somesite.com/folder222/page/2/"
,"https://somesite.com/folder444/page/1/"
,"https://somesite.com/folder444/page/3/"
,"https://somesite.com/folderBBB/page/1/"
,"https://somesite.com/folderBBB/page/5/")
They always have /page/1/, I need to add (or reconstruct) all missing URLS from the highest page down to 1 so it ends up like so:
$URLs = @("https://somesite.com/folder1/page/1/"
,"https://somesite.com/folder222/page/1/"
,"https://somesite.com/folder222/page/2/"
,"https://somesite.com/folder444/page/1/"
,"https://somesite.com/folder444/page/2/"
,"https://somesite.com/folder444/page/3/"
,"https://somesite.com/folderBBB/page/1/"
,"https://somesite.com/folderBBB/page/2/"
,"https://somesite.com/folderBBB/page/3/"
,"https://somesite.com/folderBBB/page/4/"
,"https://somesite.com/folderBBB/page/5/")
I'd imagine the Pseudo-Code would be something like:
- For each folder, extract the highest page number:
hxxps://somesite.com/folderBBB/page/5/
Expand this out from (5) to (1)
hxxps://somesite.com/folderBBB/page/1/ hxxps://somesite.com/folderBBB/page/2/ hxxps://somesite.com/folderBBB/page/3/ hxxps://somesite.com/folderBBB/page/4/ hxxps://somesite.com/folderBBB/page/5/
Output this into an array
Any pointers would be welcome!