I have to manually populate a spreadsheet of many webpages. I only have to pick up some details from every page such as its title, description, etc.. Doing this manually is becoming too monotonous and boring so I thought I could semi-automate it using Matlab.
Suppose this is the page as an example: http://www.smythstoys.com/uk/en-gb/video-games-tablets/c-751/xbox-one/p-14141/xbox-one-1tb-console/
I can read this page into matlab using:
page = urlread('..the_webpage..');
This basically copies its source code into a string variable. Viewing the source I can see that the Title is in its <title></title>
tag and so is the description.
Is there any way I can extract these values from the string and into cell arrays. From there I can then move them easily to an excel spreadsheet. I tried using textscan
but it did not work as I cannot tell the delimiter between the values.