How to use Matlab trainnet to train a network without an explicit output layer (R2024a)

Question

Michael Solonenko on 8 Aug 2024

0
Link

Direct link to this question

https://au.mathworks.com/matlabcentral/answers/2144094-how-to-use-matlab-trainnet-to-train-a-network-without-an-explicit-output-layer-r2024a

Edited: Matt J on 9 Aug 2024

I've attempted to train a CNN with the goal of assigning N numeric values to different input images, depending on image characteristics. It looked like the network's output layer could be a fully-connected layer with N outputs (because I have not found a linear output layer in Deep Network Designer). I am not sure if I can use a non-linear output layer instead, because this is fundamentally a regression task.

However, when using a fully-connected layer in place of an output layer the trainnet gives repeating errors indicating that I must have an output layer.

So basically, I have two questions:

1) Is it possible to use trainnet in a network without an output layer? It is difficult to imagine that a built-in training function has an oversight like this. Do I really need to construct a custom training loop if my network?..

2) Are there any alternatives? In essence, all I am looking for is an output layer that is either a) linear or b) does not change the previous layer's output. Just anything that is compatible with a regression task.

If any clarification is needed on my issue or network construction, I would be happy to provide it.

Thank you so much for your help!

Deep Learning Toolbox Version 24.1 (R2024a) , trainnet function, Matlab 2024.

2 Comments
Show NoneHide None

Matt J on 9 Aug 2024

Open in MATLAB Online

I can't reproduce that. Here is an example of a simple network training where the final layer is a fully connect layer. No error messages:

ds=combine( arrayDatastore(rand(3),IterationDim=3) , ...

arrayDatastore(rand(1),IterationDim=3) );

layers=[imageInputLayer([3,3,1]),fullyConnectedLayer(1)];

trainnet(ds,layers,'mse', trainingOptions('adam',TargetDataFormats="CB"))

Iteration Epoch TimeElapsed LearnRate TrainingLoss _________ _____ ___________ _________ ____________ 1 1 00:00:00 0.001 0.073747 30 30 00:00:01 0.001 0.036938 Training stopped: Max epochs completed

ans =

dlnetwork with properties: Layers: [2x1 nnet.cnn.layer.Layer] Connections: [1x2 table] Learnables: [2x3 table] State: [0x3 table] InputNames: {'imageinput'} OutputNames: {'fc'} Initialized: 1 View summary with summary.

Michael Solonenko on 9 Aug 2024

Use of 'mse' does the trick. I figured that out already, and thank you!

Sign in to comment.

Sign in to answer this question.

Answer 1

Aditya on 8 Aug 2024

0
Link

Direct link to this answer

https://au.mathworks.com/matlabcentral/answers/2144094-how-to-use-matlab-trainnet-to-train-a-network-without-an-explicit-output-layer-r2024a#answer_1496769

Edited: Aditya on 8 Aug 2024

Hi @Michael Solonenko,

To Address your query:

1) From my knowledge we cannot use "trainnet" function without an explicit output layer. The trainnet function in MATLAB expects a complete network architecture, including an output layer, to properly define the loss function and perform backpropagation during training.

2) Use a fully connected layer with N outputs and set the loss function to "mse" since you are doing regression tasks. I am not sure why you are getting the mentioned error while doing this step. It might be helpful if you could provide the code that you are using (layers architecture & trainingOptions)

Also you could refer to this MATLAB documentation on "Train Convolutional Neural Network for Regression":

https://www.mathworks.com/help/deeplearning/ug/train-a-convolutional-neural-network-for-regression.html

Hope this helps!

3 Comments
Show 1 older commentHide 1 older comment

Michael Solonenko on 8 Aug 2024

Open in MATLAB Online

Hello @Aditya,

Thank you for the response!

However, I do have one question. In your response to 1) you explain that "trainnet" cannot function without an explicit output layer, but the link provided at the end has an example of just this.

Here, you have a network terminating in a fully connected layer:

layers = [
    imageInputLayer([28 28 1])
    convolution2dLayer(3,8,Padding="same")
    batchNormalizationLayer
    reluLayer
    averagePooling2dLayer(2,Stride=2)
    convolution2dLayer(3,16,Padding="same")
    batchNormalizationLayer
    reluLayer
    averagePooling2dLayer(2,Stride=2)
    convolution2dLayer(3,32,Padding="same")
    batchNormalizationLayer
    reluLayer
    convolution2dLayer(3,32,Padding="same")
    batchNormalizationLayer
    reluLayer
    fullyConnectedLayer(numResponses)];

And this network would be trained as such:

miniBatchSize  = 128;
validationFrequency = floor(numel(anglesTrain)/miniBatchSize);
options = trainingOptions("sgdm", ...
    MiniBatchSize=miniBatchSize, ...
    InitialLearnRate=1e-3, ...
    LearnRateSchedule="piecewise", ...
    LearnRateDropFactor=0.1, ...
    LearnRateDropPeriod=20, ...
    Shuffle="every-epoch", ...
    ValidationData={XTest,anglesTest}, ...
    ValidationFrequency=validationFrequency, ...
    Plots="training-progress", ...
    Metrics="rmse", ...
    Verbose=false);
net = trainnet(XTrain,anglesTrain,layers,"mse",options);

Is there something in the options that allows this? How does this work?

Aditya on 9 Aug 2024

Edited: Aditya on 9 Aug 2024

Hi @Michael Solonenko,

Yes, so when we call trainnet with the "mse" loss function, MATLAB automatically understands that the network is intended for regression tasks. The "mse" loss function (mean squared error) is applied to the output of the fully connected layer during training.

You can also look into this MATLAB documentation on regressionLayer: https://in.mathworks.com/help/deeplearning/ref/regressionlayer.html

Here they have mentioned to use "trainnet" with "mse" instead of using regressionLayer.

Hope this clarifies the doubt!

Michael Solonenko on 9 Aug 2024

Thank you!

Sign in to comment.

Answer 2

Matt J on 9 Aug 2024

0
Link

Direct link to this answer

https://au.mathworks.com/matlabcentral/answers/2144094-how-to-use-matlab-trainnet-to-train-a-network-without-an-explicit-output-layer-r2024a#answer_1497229

Edited: Matt J on 9 Aug 2024

1) Is it possible to use trainnet in a network without an output layer? It is difficult to imagine that a built-in training function has an oversight like this.

trainnet is always to be used without an output layer.. The loss function is specified using the lossFcn input argument,

netTrained = trainnet(images,net,lossFcn,options)

2) Are there any alternatives? In essence, all I am looking for is an output layer that is either a) linear or b) does not change the previous layer's output. Just anything that is compatible with a regression task.

The lossFcn can be a customized loss function supplied by you. From the doc,

Function handle with the syntax loss = f(Y1,...,Yn,T1,...,Tm), where Y1,...,Yn are dlarray objects that correspond to the n network predictions and T1,...,Tm are dlarray objects that correspond to the m targets.

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

How to use Matlab trainnet to train a network without an explicit output layer (R2024a)

2 Comments
Show NoneHide None

Answers (2)

3 Comments
Show 1 older commentHide 1 older comment

0 Comments
Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

How to use Matlab trainnet to train a network without an explicit output layer (R2024a)

2 Comments Show NoneHide None

Answers (2)

3 Comments Show 1 older commentHide 1 older comment

0 Comments Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

2 Comments
Show NoneHide None

3 Comments
Show 1 older commentHide 1 older comment

0 Comments
Show -2 older commentsHide -2 older comments