extract dates from yahoo finance - please help!
Show older comments
I am trying to "scrape" stock data from yahoo finance. Although I am successful in extracting the stock prices, I am having difficulties in extracting the dates. I am using the following command without success:
dateField=regexp(historicalPriceFile, '<td class="yfnc_tabledata1" nowrap align="right">([\d\w-]+)</td>', 'tokens');
Please help!
Thank you in advance,
Chris
7 Comments
Walter Roberson
on 24 Dec 2012
Could you give an example line to be matched?
Side note: \w includes digits, so [\d\w-] can be simplified to [\w-]
Chris
on 24 Dec 2012
Edited: Walter Roberson
on 24 Dec 2012
Walter Roberson
on 24 Dec 2012
Edited: Walter Roberson
on 24 Dec 2012
The date you shows has spaces and commas, which [\d\w-] does not include. The date does not appear to have any dash. Perhaps [\w\s,]
Chris
on 24 Dec 2012
Walter Roberson
on 24 Dec 2012
[\w\s,] works for me.
Your sample date has no '-' in it, so no point having the '-' in the []. Your sample date has a space and comma in it, which are not matched by [\w-]
Chris
on 24 Dec 2012
Chris
on 25 Dec 2012
Answers (1)
Walter Roberson
on 25 Dec 2012
dateField=regexp(historicalPriceFile, '<td class="yfnc_tabledata1" nowrap align="right">([\w\s,]+)</td>', 'tokens');
... as discussed above.
Categories
Find more on Money.Net in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!