Community Profile

photo

Joss Knight

MathWorks

Last seen: 6 days ago Active since 2013

Although I cannot be contacted directly, if you would like to ask me a question all you have to do is mention "GPU" somewhere in your MATLAB Answers question.

Statistics

  • 36 Month Streak
  • Knowledgeable Level 5
  • Pro
  • Revival Level 2
  • First Answer

View badges

Content Feed

View by

Answered
Why is my GPU code faster with the profiler on in RTX GPUs?
This is due to an optimization which is not performing ideally under memory pressure. If you reduce the size of your input you'l...

7 days ago | 0

Answered
Conflicting behaviour of arrayfun() with gpu: example that works and example of error
The function normcdf isn't supported by GPU arrayfun because it accepts varargin. For a list of supported functions see the docu...

23 days ago | 0

| accepted

Answered
How to initialize a string variable, and pass it to the matlab function using GPU coder
MATLAB and Simulink code generation do not currently support string. Edit: Sorry, my bad, it does support scalar strings, but n...

23 days ago | 0

| accepted

Answered
need to plot the accuracy vs epoch graph
Add Plots="training-progress" to your training options. FWIW, you shouldn't use ReadFcn for resizing images, it dramatically sl...

23 days ago | 1

Answered
Update BatchNorm Layer State in Siamese netwrok with custom loop for triplet and contrastive loss
Interesting question! The purpose of batch norm state is to collect statistics about typical inputs. In a normal Siamese workflo...

1 month ago | 0

| accepted

Answered
gpu arrayfun don't support linspace or NaN array
You cannot create an array inside a call to GPU arrayfun, only scalars.

1 month ago | 0

Answered
GPU Support for RTX 4090
Forgive me for needing to correct Walter, but the last three versions of MATLAB _will_ natively support the 4000 series because,...

1 month ago | 1

Answered
mexcuda gives unsupported GNU version error
R2022a uses CUDA 11.2, not 11.7. I suspect that the actual compiler that ends up being used is the version of nvcc shipped with ...

3 months ago | 0

| accepted

Answered
GPU speed up for pcg() is disappointing
I'm guessing LL' is extremely dense, which will explain why the solver stalls. On the GPU the preconditioning is (currently) per...

3 months ago | 0

| accepted

Answered
How to implement Siamese network with the two subnetworks not share weights
You can try gathering the weights back from each network after you've used it, as in net = dlupdate(@gather,net). This should sa...

3 months ago | 0

Answered
Speed up inference or/and training of a 3D deep neural network (U-net) for a regression task
Have you tried using dlaccelerate? As well as ensuring any Custom Layers are using the Acceleratable mixin?

3 months ago | 1

| accepted

Answered
Matrix multiplication optimization using GPU parallel computation
The Windows Task Manager lets you track GPU utilization and memory graphically, and the utility nvidia-smi lets you do it in a t...

4 months ago | 1

Answered
How to increase MiniBatchSize
It depends on what you're doing. Some ideas: * Get a new GPU with more memory * Use a smaller model * If your model accepts...

4 months ago | 0

Answered
Matlab trainNetwork CNN training pauses iterating intermittently at random then continues
Is the pause associated with a validation measurement being added to the training plot? With 7 times as much validation data it ...

4 months ago | 0

Answered
problems with @arrayfun on GPU
This is a bug. I have reported it. Thanks for finding it! In the meantime, you can work around the issue by using a local funct...

4 months ago | 0

| accepted

Answered
A problem when using "multi-gpu" as "ExecutionEnvironment" for training a CNN
Most likely this is this issue, which is fixed in the latest update to R2022a. You can also try downgrading your GPU drivers.

4 months ago | 0

| accepted

Answered
Perform mldivide between 3x3 matrix M and every RGB pixel in a image in GPU
I feel like I'm missing something - this is just a single backslash with multiple right-hand sides, or to avoid permutation a si...

5 months ago | 1

Answered
Library not loaded: @rpath/libcudart.10.2.dylib
This problem should now be fixed at Apple, please reboot and report here if you are still experiencing issues.

5 months ago | 0

Answered
Warning: GPU is low on memory
A 3-D U-net is a very large model. Try reducing |patchSize|, |patchPerImage|, |miniBatchSize| and |inputSize|.

6 months ago | 0

| accepted

Answered
How to run lane detection optimized with GPU coder project on matlab
https://www.mathworks.com/help/gpucoder/ug/lane-detection-optimized-with-gpu-coder.html

6 months ago | 0

Answered
Dedicated GPU Memory Usage - Permanently increases every time code is run
This error means you ran out of GPU memory. I can't reproduce any sort of memory leak in R2022a. It's possible that you are perm...

6 months ago | 1

Answered
minibatchqueue function cannot generate the expected MiniBatchSize
You've asked your arrayDatastore to iterate over the rows because that's the default. So as far as arrayDatastore is concerned, ...

7 months ago | 1

| accepted

Answered
RTX 3090 vs A100 in deep learning.
According to the spec as documented on Wikipedia, the RTX 3090 has about 2x the maximum speed at single precision than the A100,...

7 months ago | 0

| accepted

Answered
GPUCoder does not generate parallelized code
This looks about right to me, because your kernel is too simple and you're transferring data from and to the CPU on every call. ...

7 months ago | 1

Answered
Can I run custom Matlab function or gpuArray on another GPU?
You can use parallel syntax to process other arrays on other GPUs at the same time, or to process some data on the CPU at the sa...

8 months ago | 0

| accepted

Answered
How can run and upload my Deep Learning model in cloud?
<https://uk.mathworks.com/help/deeplearning/ug/deep-learning-in-the-cloud.html Start here>

8 months ago | 0

| accepted

Answered
Error in minibatchqueue (line 290) numVariables = numel(getPreviewFromDatastore(originalDatastore));
Make sure the accompanying file augmentDataForLD2HDCT.m that comes with this example is on the path when you run your code.

8 months ago | 0

| accepted

Answered
Reorganizing current data structure in order to take advantage of GPU
Evidently with your nested structure you are not required to have uniform data - every element at each level can have a differen...

8 months ago | 1

| accepted

Answered
Saving images quickly for huge datasets
It's hard to say what will speed things up, since we don't know which part of the process is slow. Is saving slow? Is computing ...

8 months ago | 1

Load more