Regexp to pull numerics only

33 views (last 30 days)
Stephen
Stephen on 3 Jul 2019
Commented: Stephen on 4 Jul 2019
I'm using regexp to pull a version number from a text file:
returned = string(regexp (filetext, '(?<=Version_String )[A-Z]*\d*.\d*.\d*.\d*', 'match'))
This returns a string
"YZ1.1.2.3"
which is good.
When I try and extract just the numbers and decimal point from this string with
version = regexp (returned, '\d*.\d*.\d*.\d*', 'match')
I get:
version = 1x2 string array "YZ1.1" ".2.3"
which is not what I was expecting. I was hoping for 1.1.2.3
Can someone explain why this is returned and possibly how to fix it please?
edit: Apologies to all, I missed the decimal points after the \d, I have corrected. I wrote this on my phone in my lunch break!
  1 Comment
Stephen23
Stephen23 on 3 Jul 2019
Edited: Stephen23 on 3 Jul 2019
I don't see anything in any of your regular expressions that would match the points.
When I test your first example, I do not get the result that you have shown us:
>> S = 'A B C Version_String YZ1.1.2.3 X Y Z';
>> regexp(S, '(?<=Version_String )[A-Z]*\d*\d*\d*\d*', 'match')
ans =
'YZ1'
which is exactly what I would expect (as the matching fails at the first dot character).

Sign in to comment.

Accepted Answer

Guillaume
Guillaume on 3 Jul 2019
  • \d*\d*\d*\d* is exactly the same as \d*, match 0 or more digits. It probably would be better as \d+, match 1 or more digits
  • That regular expression cannot return anything with letters or dots in it, so it's clearly not what you are using.
  1 Comment
Stephen
Stephen on 4 Jul 2019
Yes, thank you, disregarding the errors I made when writing up the question, the problem was using * instead of +

Sign in to comment.

More Answers (0)

Tags

Products


Release

R2019a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!