How do I interpolate 1D-data if I do not have unique values?

Question

1 vote

I have various size distributions in the same format with fixed size classes and sometimes I have multiple cummulative_share value of 0. Than I can not use interp1 to interpolate the size at 50% share. How may I fix that issue? Thanks for any help.

Example:

>> size_classes= [0.1 ; 0.2 ; 0.3 ; 0.4]

cummulative_share= [0 ; 0 ; 10; 100]

d50=interp1(cummulative_share, size_classes, 0.5)

size_classes =

cummulative_share =

Error using griddedInterpolant The grid vectors must contain unique points.

Error in interp1 (line 161) F = griddedInterpolant(X,V,method);

1 Comment
Show -1 older comments Hide -1 older comments

Kien Pham on 15 Jan 2018

Thank you! Your method seems to work well for my case. Cheers.

Sign in to comment.

Sign in to answer this question.

Follow Question

Answer 1

Star Strider on 23 Jan 2017

Open in MATLAB Online

2 votes

The usual way to deal with that is to create an increasing vector of eps values and add it to the vector with the duplicates.

This assignment:

cummulative_share = cumsum(ones(size(cummulative_share)))*eps + cummulative_share;

will do exactly that.

Example:

size_classes= [0.1 ; 0.2 ; 0.3 ; 0.4];
cummulative_share= [0 ; 0 ; 10; 100];
cummulative_share = cumsum(ones(size(cummulative_share)))*eps + cummulative_share;
d50=interp1(cummulative_share, size_classes, 0.5)
d50 =
   205.0000e-003

7 Comments
Show 5 older comments Hide 5 older comments

Star Strider on 24 Jan 2017

Open in MATLAB Online

My approach should work the same way with your repeated values of 100, although you may have to increase the offset value.

This appears to work:

v = [0; 0; 0; 100; 100; 100; 100]                       % Initial Repeated Vector
ve = cumsum(ones(size(v))).*v*eps                       % Scaled Offset For Non-Zero Elements
ve = ve + cumsum(ones(size(v))).*(v==0)*eps             % Add Scaled Offset For Zero Elements
vi = v + ve                                             % Interpolation Vector
test1 = vi - v                                          % Check Result (Delete)
test2 = diff(vi)                                        % Check Incremental Increase (Delete)

The idea is to add an ‘infinitesimal’ incremental offset. The constraint is that ‘infinitesimal’ has to be within the range of double-precision floating-point addition, and has to work for both zero and non-zero repeated values. It should not affect the interpolation results, since few (if any) real-world data will have that level of precision. The offset addition could be created as a function file (with ‘v’ and the input and ‘vi’ as the output) if you have this problem frequently with your data.

I kept the two ‘test’ assignments in my posted code to demonstrate the incremental addition. Delete them in the code you use.

Also, the offset values are small enough (by design in my approach here) that it will not be noticeable with most format settings.

Star Strider on 24 Jan 2017

@John — I don’t doubt that you’re correct. I know of no other way to approach this problem.

Grzegorz Lippe on 27 Apr 2021

Open in MATLAB Online

No, I think he's right, one eps is to small and will get round off with bigger values:

>> (10+eps)==10
ans =
  logical
   1

I used 1e-9 in my use case, but that only works with numbers < 1000. The chosen magnitude should not be more than 15 times smaller, because that is roundabout the precision with double floats.

Sign in to comment.

Answer 2

John D'Errico on 24 Jan 2017

6 votes

When you have duplicates, so multiple x's with different values for y, interpolation makes no sense since an interpolant requires that it returns the value of y at the given x. If you have multiple y's there, then what do you return?

Even if all of the y values at that point are the same, interpolation tools will not be easily able to diagnose this fact. So they get upset at you.

If you add eps to the x's to make them distinct, then interpolation will be numerically difficult. There will be nasty things that happen. Worse, if you are using a spline method to interpolate, expect insanity, because the spline interpolant will be wildly oscillatory. Adding eps is a BAD idea. Sorry, but it is.

Instead, recognize that if you have multiple y values for a given x, that interpolation is impossible. Instead, just form the average value of y for any reps. Replace the multiple reps with a SINGLE value, with the average y at that point. This is now fully consistent with interpolation. It is as good as you can do. In fact, it is this scheme that is usually recommended by those who do interpolation. (Spoken as a person who worked with interpolation tools for 29 years, and was the go-to person for this class of problems, tasked with providing those tools to a large corporation that used interpolation extensively.)

I've provided a tool ( consolidator , found on the file exchange) that does exactly what you need, replacing all replicates with a single point.

0 Comments
Show -2 older comments Hide -2 older comments

Sign in to comment.

How do I interpolate 1D-data if I do not have unique values?

1 Comment
Show -1 older comments Hide -1 older comments

Accepted Answer

7 Comments
Show 5 older comments Hide 5 older comments

More Answers (1)

0 Comments
Show -2 older comments Hide -2 older comments

Categories

Tags

Community Treasure Hunt

How do I interpolate 1D-data if I do not have unique values?

1 Comment Show -1 older comments Hide -1 older comments

Accepted Answer

7 Comments Show 5 older comments Hide 5 older comments

More Answers (1)

0 Comments Show -2 older comments Hide -2 older comments

Categories

Tags

See Also

Community Treasure Hunt

1 Comment
Show -1 older comments Hide -1 older comments

7 Comments
Show 5 older comments Hide 5 older comments

0 Comments
Show -2 older comments Hide -2 older comments