Faster alternative to mode?

Question

1 vote

Simple as that, in my code I am first determining the neighbourhood of a voxel and then determining the most frequent value in the neighbourhood. I am doing this for millions of voxels, so the computation time adds up. I am expecting this to take quite some time, however my entire code has been running for about 2.5 days now with no end in sight and I've determined that the bottleneck is this section of my code. Using profiler it appears that the function "mode" takes up almost 50% of the computation time of the this function, and about 25% of my entire code.

My method of determining the neighbourhood is the next slowest part of the code using: neighbourhood = im(i-1:i+1,j-1:j+1,k-1:k+1);

Does anyone have a good alternative to "mode" i can use in this situation? If anyone has any more efficient suggestions for finding the neighbourhood of the voxels that would also be appreciated.

Thanks!

for i = 1:size(lcoords,1)
    %Get neighbourhood of voxel i from list of image voxel subscripts
    neighbourhood = im(lcoords(i,1)-1:lcoords(i,1)+1,lcoords(i,2)-1:lcoords(i,2)+1, lcoords(i,3)-1:lcoords(i,3)+1);
    neighbourhood(neighbourhood == imcoords(i,2)) = []; %Ignore parts of the neighbourhood that have same value as current voxel
    if ~isempty(neighbourhood) %If the voxel is surrounded by any values not including its own region, find the mode of the neighbourhood
        imcoords(i,3) = mode(neighbourhood(:)); %record the mode
    end
end

5 Comments
Show 3 older comments Hide 3 older comments

darova on 7 Apr 2020

Open in MATLAB Online

Example:

tic
for i = 1:1e6
    a = ones(5);
    a(3) = [];    
end
toc
tic
for i = 1:1e6
    a = ones(5);
    a(3) = 0;
end
toc

Because ofre-sizing the arrays on each iteration

Eric Chadwick on 7 Apr 2020

Thanks, I will change those lines, but the real bottleneck is the mode function. Thats what I would like to simplify as it takes up 50% of the time in my function.

Sign in to comment.

Sign in to answer this question.

Follow Question

Answer 1

Matt J on 7 Apr 2020

Edited: Matt J on 7 Apr 2020

Open in MATLAB Online

1 vote

Yes, actually there are no zeros in this matrix. -1s could be changed to zeros as they represent void space that I am uninsterested in. However the point of the code is to identify voxels that touch void space (at least one -1) and determine which value that voxel is touching the most.

Starting another answer for this line of solution. I have a class on the File Exchange that can store 3D arrays in sparse form.

https://www.mathworks.com/matlabcentral/fileexchange/29832-n-dimensional-sparse-arrays

Not only could this save you a lot of memory, but it sounds like at least some of what you're trying to do can be done via sparse 3D convolution with a 3x3x3 kernel of ones. This can be done using the convn method of the class. For example, below I create a 3D binary image A the same dimensions as yours. It contains about 1.7 million non-zeros and consumes 26 MB. Only one of the 1's is not touching any zeros. Using convolution, the code calculates a separate binary matrix B with all elements not touching any zeros removed. Since there is only one such element, B contains only 1 fewer element than A:

>> A=spfun(@ceil, ndSparse.sprand([2854,2906,2013],1e-4)  );
>> A(1:3,1:3,1:3)=1;
>> 
>> whos A
  Name   Size               Kilobytes     Class      Attributes
                                                               
  A      2854x2906x2013         26102     ndSparse             
                                                               
>> 
>> tic; B=A-(convn(A,ones(3,3,3),'same')>26.999); toc; 
Elapsed time is 12.614820 seconds.
>> 
>> nnz(A)
ans =
     1669474
>> nnz(B)
ans =
     1669473

21 Comments
Show 19 older comments Hide 19 older comments

Eric Chadwick on 9 Apr 2020

Open in MATLAB Online

Okay great, so that worked for every pixel in the image, now to get the mode of the neighbourhood for only the pixels on the border elements I combined your two answers (see code). However, I am sitll not getting the same results that my original method achieved (which I have validated to be correct). I believe the problem is that the first loop that produces A which should only contain border elements, seems to contain a double layer of elements where the border is between a phase and void space. This results in the majority of these border elements to have a neighbourhood mode of the same phase they are apart of.

Any idea what would cause this "double layer"?

Also the second loop takes nearly a minute to complete on an image of size 502x445x320 that I am using for testing. This is much longer than my previous method. Do you think this will increase significantly with a larger image like my original code?

[di,dj,dk]=ndgrid(-1:+1);
im(im==-1) = 0;
im = ndSparse(im);
A = ndSparse.spalloc(size(im),4*nnz(coords));
tic
for n = 1:27
    A = A + abs(im-circshift(im,[di(n),dj(n),dk(n)]));
end
toc
tic
Ishifts=repmat(im,1,1,1,27);
for n=1:27
    
   Ishifts(:,:,:,n)=circshift(im,[di(n),dj(n),dk(n)]); 
    
end
result=mode( sparse2d(Ishifts)  , 2);
result=ndSparse(result, size(im));
toc
ccoords(:,1) = find(A);
ccoords(:,2) = im(ccoords(:,1));
ccoords(:,3) = result(ccoords(:,1));

Eric Chadwick on 15 Apr 2020

A finished and had 205,097,786 non-zeros

Matt J on 15 Apr 2020

Ah well. Sadly, it's not nearly as sparse as I'd hoped and not enough for you to get an advantage out of ndSparse representation. The mode calculation is going to allocate at least 130 GB with that many non-zeros.

I would say your best bet is to do the processing in chunks like I recommended in my first answer. What I envision would be to process im(:,:,1:100), then im(:,:,100:199), then im(199:298) and so on. The overlap is deliberate, so that every 3x3x3 neighborhood is captured in the partitioning.

Sign in to comment.

Answer 2

Matt J on 6 Apr 2020

Edited: Matt J on 6 Apr 2020

Open in MATLAB Online

0 votes

It would be better not to implement the mode calculation repeatedly in a loop over the voxels. If you have enough RAM to hold 27 copies of your image volume, then this is one way to replace the loop with a vectorized calculation:

[di,dj,dk]=ndgrid(-1:+1);
Ishifts=repmat(im,1,1,1,27);
for n=1:27
    
   Ishifts(:,:,:,n)=circshift(im,[di(n),dj(n),dk(n)]); 
    
end
result=mode( Ishifts  , 4);

6 Comments
Show 4 older comments Hide 4 older comments

Matt J on 7 Apr 2020

The 32-bit image has replaced the formerly binary values of the image with numbers ranging from -1 to greater than 255.

Is there any sparsity that you can take advantage of? Does the image contain mostly zeros?

Eric Chadwick on 7 Apr 2020

Yes, actually there are no zeros in this matrix. -1s could be changed to zeros as they represent void space that I am uninsterested in. However the point of the code is to identify voxels that touch void space (at least one -1) and determine which value that voxel is touching the most.

For example, if a voxel of value 1 touches at least one -1, I am interested in it and will then determine whether it is touching mostly other voxels of another positive value or mostly -1s.

I have not worked with sparse matrices very much. What is your idea?

Sign in to comment.

Faster alternative to mode?

5 Comments
Show 3 older comments Hide 3 older comments

Accepted Answer

21 Comments
Show 19 older comments Hide 19 older comments

More Answers (1)

6 Comments
Show 4 older comments Hide 4 older comments

Categories

Tags

Community Treasure Hunt

Faster alternative to mode?

5 Comments Show 3 older comments Hide 3 older comments

Accepted Answer

21 Comments Show 19 older comments Hide 19 older comments

More Answers (1)

6 Comments Show 4 older comments Hide 4 older comments

Categories

Tags

See Also

Community Treasure Hunt

5 Comments
Show 3 older comments Hide 3 older comments

21 Comments
Show 19 older comments Hide 19 older comments

6 Comments
Show 4 older comments Hide 4 older comments