Generating URLs from MATLAB

2 views (last 30 days)
I am trying to write a script that generates URLs from 'aa' to 'zz' and pings the website, returning the <TITLE> of the page if the URL exists.
This is what I have done so far. I have no idea what to do next. Could you please help?
v={'a','b','c','d','e','f','g','h','i','j','k','l','m','n','o','p','q','r','s','t','u','v','w','x','y','z'};
subfolder='www';
s=sprintf('GET http://www.%.edu')
s =
'GET http://www.'
s=sprintf('GET http://www.v%.edu')
s =
'GET http://www.v'

Accepted Answer

Image Analyst
Image Analyst on 17 Oct 2020
I imagine you'd strip off the "GET " and call webread() with the URL.
v = {'a','b','c','d','e','f','g','h','i','j','k','l','m','n','o','p','q','r','s','t','u','v','w','x','y','z'};
for k = 1 : length(v)
url = sprintf('http://www.%s%s.edu', v{k}, v{k});
fprintf('Now reading %s...\n', url);
pageContents = webread(url);
end
That's untested. Please look up webread() in the help for the correct way to use it.
  7 Comments
Image Analyst
Image Analyst on 17 Oct 2020
Edited: Image Analyst on 17 Oct 2020
Try it this way:
v = {'a','b','c','d','e','f','g','h','i','j','k','l','m','n','o','p','q','r','s','t','u','v','w','x','y','z'};
z = v;
for k = 1 : length(v)
for m = 1 : length(z)
url = sprintf('http://www.%s%s.edu', v{k}, z{m});
fprintf('Now reading %s...\n', url);
try
pageContents = webread(url);
catch ME
% Goes here if web site does not exist, then continues with the loop.
fprintf(' Error : the web site "%s" does not exist or did not respond.\n', url);
end
end
end
Ntombikayise Bhengu
Ntombikayise Bhengu on 17 Oct 2020
Thank you so much. It works. I really appreciate you taking the time to help me.

Sign in to comment.

More Answers (0)

Categories

Find more on Startup and Shutdown in Help Center and File Exchange

Products


Release

R2020a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!