divide the matrix (Rx2) into submatrices based on the values of the second column

Question

Alberto Acri on 20 Sep 2023

0
Link

Direct link to this question

https://au.mathworks.com/matlabcentral/answers/2023557-divide-the-matrix-rx2-into-submatrices-based-on-the-values-of-the-second-column

Commented: Dyuman Joshi on 22 Sep 2023

Accepted Answer: Dyuman Joshi

matrix_out.mat

Open in MATLAB Online

HI! I tried to split the 'matrix_out' matrix into submatrices with steps of 0.1 and for the most part I succeeded.

load matrix_out
% =======
matrix_out_0 = matrix_out(matrix_out(:,2) < 0.1, :);
tot_percent_matrix_out_0 = sum(matrix_out_0(:,2));
matrix_separation_0 = [{matrix_out_0}, tot_percent_matrix_out_0];
% =======
matrix_separation = {};
j = 0.1:0.1:1.2;
for K = 1:width(j)
    matrix_out_new = matrix_out((matrix_out(:,2) >= j(K) & matrix_out(:,2) < (0.1*K)+0.1), :);
    tot_percent_matrix_out_new = sum(matrix_out_new(:,2));
    matrix_separation = [matrix_separation; {matrix_out_new},tot_percent_matrix_out_new];
end
matrix_separation = [matrix_separation_0 ; matrix_separation]
matrix_separation = 13×2 cell array
    {31×2 double}    {[ 0.5500]}
    { 6×2 double}    {[ 0.9100]}
    {69×2 double}    {[17.9400]}
    {33×2 double}    {[11.3900]}
    {13×2 double}    {[ 5.7800]}
    {10×2 double}    {[ 5.5900]}
    {10×2 double}    {[ 6.4000]}
    { 8×2 double}    {[      6]}
    { 6×2 double}    {[ 5.1300]}
    {11×2 double}    {[10.4500]}
    {11×2 double}    {[11.4000]}
    {14×2 double}    {[15.9400]}
    { 3×2 double}    {[ 3.6900]}

In the code, however, I noticed that the value 423|1.2 is found both in the penultimate and in the last cell inside 'matrix_separation'.

The value 423|1.2 should only appear in the last cell given the range >=1.2 & <1.3! Thanks to whoever solves this doubt...

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Dyuman Joshi on 20 Sep 2023

0
Link

Direct link to this answer

https://au.mathworks.com/matlabcentral/answers/2023557-divide-the-matrix-rx2-into-submatrices-based-on-the-values-of-the-second-column#answer_1314297

Edited: Dyuman Joshi on 20 Sep 2023

Open in MATLAB Online

matrix_out.mat

discretize and splitapply for the win!

load matrix_out
%Mention the bins to group data in
j = [0 0.1:0.1:1.2 Inf];
%Discretize the data
idx = discretize(matrix_out(:,2),j);
%Split the array according to the groups
out1 = splitapply(@(x) {x}, matrix_out, idx)
out1 = 13×1 cell array
    {31×2 double}
    { 6×2 double}
    {69×2 double}
    {33×2 double}
    {13×2 double}
    {10×2 double}
    {10×2 double}
    { 8×2 double}
    { 6×2 double}
    {11×2 double}
    {11×2 double}
    {13×2 double}
    { 3×2 double}

You can see above that the 2nd last group is 13x2 instead of 14x2. The sum obtained will be modified accordingly as well.

%Get the sum of the 2nd column according to the groups
out2 = splitapply(@(x) sum(x), matrix_out(:,2), idx)
out2 = 13×1
    0.5500
    0.9100
   17.9400
   11.3900
    5.7800
    5.5900
    6.4000
    6.0000
    5.1300
   10.4500
%Concatenate to get the final output
out = [out1 num2cell(out2)]
out = 13×2 cell array
    {31×2 double}    {[ 0.5500]}
    { 6×2 double}    {[ 0.9100]}
    {69×2 double}    {[17.9400]}
    {33×2 double}    {[11.3900]}
    {13×2 double}    {[ 5.7800]}
    {10×2 double}    {[ 5.5900]}
    {10×2 double}    {[ 6.4000]}
    { 8×2 double}    {[      6]}
    { 6×2 double}    {[ 5.1300]}
    {11×2 double}    {[10.4500]}
    {11×2 double}    {[11.4000]}
    {13×2 double}    {[14.7400]}
    { 3×2 double}    {[ 3.6900]}

3 Comments
Show 1 older commentHide 1 older comment

Alberto Acri on 21 Sep 2023

Edited: Alberto Acri on 21 Sep 2023

Open in MATLAB Online

Hi @Dyuman Joshi!

I was checking your code.

I noticed that in out{3,1} there are values (in the second column) between 0.2 and 0.3 (0.2 and 0.3 inclusive).

p.s. It didn't happen on the other matrices because there weren't the most extreme values.

I need to have intervals in the following way:

<0.10
>=0.10 & <0.20
>=0.20 & <0.30
>=0.30 & <0.40
...

Can you modify the code you provided me?

Dyuman Joshi on 22 Sep 2023

Open in MATLAB Online

matrix_out.mat

What you are seeing is the limitation of floating point numbers.

load matrix_out
%Mention the bins to group data in
j = [0 0.1:0.1:1.2 Inf];
%% Let's see what the data is stored as
%First the matrix values
%Displayed value
disp(matrix_out(10:15,2))
    0.0100
    0.0100
    0.2200
    0.3000
    0.2600
    0.2400
%Stored value
fprintf('%0.42f\n',matrix_out(10:15,2))
0.010000000000000000208166817117216851329431
0.010000000000000000208166817117216851329431
0.220000000000000001110223024625156540423632
0.299999999999999988897769753748434595763683
0.260000000000000008881784197001252323389053
0.239999999999999991118215802998747676610947
%Now the values of the groups
%Displayed value
disp(j')
         0
    0.1000
    0.2000
    0.3000
    0.4000
    0.5000
    0.6000
    0.7000
    0.8000
    0.9000
    1.0000
    1.1000
    1.2000
       Inf
%Stored values
fprintf('%0.42f\n',j)
0.000000000000000000000000000000000000000000
0.100000000000000005551115123125782702118158
0.200000000000000011102230246251565404236317
0.300000000000000044408920985006261616945267
0.400000000000000022204460492503130808472633
0.500000000000000000000000000000000000000000
0.599999999999999977795539507496869191527367
0.699999999999999955591079014993738383054733
0.799999999999999933386618522490607574582100
0.899999999999999911182158029987476766109467
1.000000000000000000000000000000000000000000
1.099999999999999866773237044981215149164200
1.199999999999999955591079014993738383054733
Inf

You can see that the values are not exactly 0.1, 0.2, 0.3 etc. The only values that are stored exactly as their decimal representation are the powers of 2 (0.5 = 2^-1, 1 = 2^0).

This means there will be some errors while working with floating point numbers.

So, what to do now? There is a workaround - Scale up the data to integers and operate.

As the data in the 2nd column of the matrix_out have values upto the 2nd digit after the decimal, so scale up by a factor of 10^2.

%Scale up by a factor of 100
%Scaling the data
vec = floor(matrix_out(:,2)*100);
%Scaling the bins 
j = [0 10:10:120 Inf]; 
%Discretize the data according to the scaled values
idx = discretize(vec,j);
%Split the array according to the groups
out = splitapply(@(x) {x}, matrix_out, idx)
out = 13×1 cell array
    {31×2 double}
    { 6×2 double}
    {62×2 double}
    {40×2 double}
    {13×2 double}
    {10×2 double}
    {10×2 double}
    { 8×2 double}
    { 6×2 double}
    {11×2 double}
    {11×2 double}
    {13×2 double}
    { 3×2 double}
disp(out{3,1})
0000    0.2200
0000    0.2600
0000    0.2400
0000    0.2400
0000    0.2000
0000    0.2500
0000    0.2600
0000    0.2900
0000    0.2700
0000    0.2200
0000    0.2500
0000    0.2100
0000    0.2700
0000    0.2200
0000    0.2300
0000    0.2600
0000    0.2900
0000    0.2700
0000    0.2200
0000    0.2400
0000    0.2300
0000    0.2500
0000    0.2600
0000    0.2400
0000    0.2500
0000    0.2600
0000    0.2400
0000    0.2500
0000    0.2900
0000    0.2600
0000    0.2500
0000    0.2600
0000    0.2800
0000    0.2600
0000    0.2400
0000    0.2700
0000    0.2500
0000    0.2600
0000    0.2500
0000    0.2800
0000    0.2500
0000    0.2500
0000    0.2500
0000    0.2600
0000    0.2700
0000    0.2600
0000    0.2600
0000    0.2900
0000    0.2100
0000    0.2600
0000    0.2700
0000    0.2900
0000    0.2900
0000    0.2700
0000    0.2600
0000    0.2500
0000    0.2900
0000    0.2900
0000    0.2800
0000    0.2800
0000    0.2600
0000    0.2100

Sign in to comment.

divide the matrix (Rx2) into submatrices based on the values of the second column

0 Comments
Show -2 older commentsHide -2 older comments

Accepted Answer

3 Comments
Show 1 older commentHide 1 older comment

More Answers (0)

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

divide the matrix (Rx2) into submatrices based on the values ​​of the second column

0 Comments Show -2 older commentsHide -2 older comments

Accepted Answer

3 Comments Show 1 older commentHide 1 older comment

More Answers (0)

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

divide the matrix (Rx2) into submatrices based on the values of the second column

0 Comments
Show -2 older commentsHide -2 older comments

3 Comments
Show 1 older commentHide 1 older comment