Physical Informed Neural Network - Identify coefficient of loss function

Question

Giulio Vialetto on 10 Sep 2023

0
Link

Direct link to this question

https://au.mathworks.com/matlabcentral/answers/2019216-physical-informed-neural-network-identify-coefficient-of-loss-function

Commented: Giulio on 21 Nov 2023

Is it possible in MATLAB to train a PINN to find also unknown parameters of the physical loss function.

I found this article that was performed in Python: https://towardsdatascience.com/physics-informed-neural-networks-pinns-an-intuitive-guide-fff138069563

In this case it is presented how to manage a PINN for projectile motion and drag coefficient is defined as trainable variable even if it is coefficient of the the loss function.

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Ben on 18 Sep 2023

0
Link

Direct link to this answer

https://au.mathworks.com/matlabcentral/answers/2019216-physical-informed-neural-network-identify-coefficient-of-loss-function#answer_1312867

Yes this is possible, you can make the coefficient μ into a dlarray and train it alongside the dlnetwork or other dlarray-s as in https://uk.mathworks.com/help/deeplearning/ug/solve-partial-differential-equations-using-deep-learning.html

There's also some discussion here where I previously gave some details on this: https://uk.mathworks.com/matlabcentral/answers/1783690-physics-informed-nn-for-parameter-identification

Here's a simple example to use an inverse PINN to find μ in

from solution data.

% Inverse PINN for d2x/dt2 = mu*x

%

% For mu<0 it's known the solution is

% x(t) = a*cos(sqrt(-mu)*t) + b*sin(sqrt(-mu)*t);

%

% where a, b are determined by initial conditions.

%

% Let's fix a = 1, b = 0 and train to learn mu from the solution data x

% Set some value for mu and specify the true solution function to generate data to train on.

% In practice these values are unknown.

muActual = -rand;

x = @(t) cos(sqrt(-muActual)*t);

% Create training data - in practice you might get this data elsewhere, e.g. from sensors on a physical system or model.

% Evaluate x(t) at uniformly spaced t in [0,maxT].

% Choose maxT such that a full wavelength occurs for x(t).

maxT = 2*pi/sqrt(-muActual);

t = dlarray(linspace(0,maxT,batchSize),"CB");

xactual = dlarray(x(t),"CB");

% Specify a network and initial guess for mu as parameters to train

net = [

featureInputLayer(1)

fullyConnectedLayer(100)

tanhLayer

fullyConnectedLayer(100)

tanhLayer

fullyConnectedLayer(1)];

params.net = dlnetwork(net);

params.mu = dlarray(-0.5);

% Specify training configuration

numEpochs = 5000;

avgG = [];

avgSqG = [];

batchSize = 100;

lossFcn = dlaccelerate(@modelLoss);

clearCache(lossFcn);

lr = 1e-3;

% Train

for i = 1:numEpochs

[loss,grad] = dlfeval(lossFcn,t,xactual,params);

[params,avgG,avgSqG] = adamupdate(params,grad,avgG,avgSqG,i,lr);

if mod(i,1000)==0

fprintf("Epoch: %d, Predicted mu: %.3f, Actual mu: %.3f\n",i,extractdata(params.mu),muActual);

end

function [loss,grad] = modelLoss(t,x,params)

% Implement the PINN loss by predicting x(t) via params.net and computing the derivatives with dlgradient

xpred = forward(params.net,t);

% Here we sum xpred over the batch dimension to get a scalar.

% This is required since dlgradient only takes gradients of scalars.

% This works because xpred(i) depends only on t(i)

% - i.e. the network-s forward pass vectorizes over the batch dimension

dxdt = dlgradient(sum(xpred),t,EnableHigherDerivatives=true);

d2xdt2 = dlgradient(sum(dxdt),t);

odeResidual = d2xdt2 - params.mu*xpred;

% Compute the mean square error of the ODE residual.

odeLoss = mean(odeResidual.^2);

% Compute the L2 difference between the predicted xpred and the true x.

dataLoss = l2loss(x,xpred);

% Sum the losses and take gradients.

% Note that by creating the grad as a struct with fields matching params we can

% easily pass grad to adamupdate in the training loop.

loss = odeLoss + dataLoss;

[grad.net,grad.mu] = dlgradient(loss,params.net.Learnables,params.mu);

end

This script should run reasonably quickly and usually approximates μ well.

In general you will need to:

Modify the PINN loss in the modelLoss function for your particular ODE or PDE.
Modify the params to include both the dlnetwork and any additional coefficients.
Tweak the net design and training configuration to achieve a good loss.

3 Comments
Show 1 older commentHide 1 older comment

Giulio on 21 Nov 2023

Hi Ben,

Thanks for the support. Previously I tried your sample code and was able to create a PINN trained on the equation x(t) = g*(t.^2)+v*t where g = -9.81 m/s^2 and v=2 m/s.

I created the PINN with the loss function defined, including the first and second derivatives of the function where g and v were declared. I then tried to get the PINN to also estimate g and v, but an error occurred. Did I define the neural network/loss function correctly or are some changes needed?

ERROR

Error using nnet.internal.cnn.solver.adamstep

Complex gradients are not supported with ADAM. Consider using SGDM instead.

Error in adamupdate>iSingleStepValue (line 183)

[step, avg_g, avg_gsq] = nnet.internal.cnn.solver.adamstep(...

Error in deep.internal.recording.callFunction (line 23)

[outputs{1:numOut}] = fun(...

Error in deep.internal.recording.containerfeval>iDispatch_Nout_Nin (line 166)

outputs = deep.internal.recording.callFunction(numOut, fun, ...

Error in deep.internal.recording.containerfeval>iProcessCell_Nout_Nin (line 248)

thisOutputs = iDispatch_Nout_Nin(allowNetInput, ...

Error in deep.internal.recording.containerfeval>iProcessStruct_Nout_Nin (line 328)

outputs = iProcessCell_Nout_Nin(allowNetInput, fun, paramFun, numOut, ...

Error in deep.internal.recording.containerfeval>iDispatch_Nout_Nin (line 176)

outputs = iProcessStruct_Nout_Nin(allowNetInput, ...

Error in deep.internal.recording.containerfeval (line 38)

outputs = iDispatch_Nout_Nin(allowNetInput, fun, paramFun, numOut, ...

Error in deep.internal.networkContainerFixedArgsFun (line 29)

varargout = deep.internal.recording.containerfeval(...

Error in adamupdate (line 152)

[p, avg_g, avg_gsq] = deep.internal.networkContainerFixedArgsFun(func, ...

Error in PINN_example7 (line 57)

[params,avgG,avgSqG] = adamupdate(params,grad,avgG,avgSqG,i,lr);

SCRIPT

clear all;

%%

% Inverse PINN for d2x/dt2 = mu*x

%

% For mu<0 it's known the solution is

% x(t) = a*cos(sqrt(-mu)*t) + b*sin(sqrt(-mu)*t);

%

% where a, b are determined by initial conditions.

%

% Let's fix a = 1, b = 0 and train to learn mu from the solution data x

% Set some value for mu and specify the true solution function to generate data to train on.

% In practice these values are unknown.

batchSize = 50;

%

% muActual = -rand;

%

g = -9.81;

v = 2;

%

x = @(t) g.*(t.^2)+v.*t;

% Create training data - in practice you might get this data elsewhere, e.g. from sensors on a physical system or model.

% Evaluate x(t) at uniformly spaced t in [0,maxT].

% Choose maxT such that a full wavelength occurs for x(t).

maxT = 0.1;

tlin = linspace(0,maxT,batchSize);

%

t = dlarray(tlin,"CB");

%

xact = x(tlin);

xactual = dlarray(xact,"CB");

% Specify a network and initial guess for mu as parameters to train

net = [

featureInputLayer(1)

swishLayer

fullyConnectedLayer(100)

swishLayer

fullyConnectedLayer(500)

swishLayer

fullyConnectedLayer(100)

swishLayer

fullyConnectedLayer(1)];

params.net = dlnetwork(net);

% params.g = dlarray(rand*g);

params.v = dlarray(rand*v);

%params.mu = dlarray(-0.5);

% Specify training configuration

numEpochs = 3000;

avgG = [];

avgSqG = [];

lossFcn = dlaccelerate(@modelLoss);

clearCache(lossFcn);

lr = 1e-3;

%%

% Train

for i = 1:numEpochs

[loss,grad] = dlfeval(lossFcn,t,xactual,params);

[params,avgG,avgSqG] = adamupdate(params,grad,avgG,avgSqG,i,lr);

if mod(i,10)==0

% fprintf("Epoch: %d, Loss: %.6f, Actual g:%.6f, Actual v:%.4f\n",i,loss,extractdata(params.g),extractdata(params.v));

fprintf("Epoch: %d, Loss: %.6f, Actual v:%.4f\n",i,loss,extractdata(params.v));

end

%%

tpred = linspace(0,0.4,batchSize);

xpred = x(tpred);

%

plot(tpred,xpred,'red');

%

hold on

xpred_hat = predict(params.net,dlarray(tpred,"CB"));

plot(tpred,xpred_hat,'green')

hold off

%%

function [loss,grad] = modelLoss(t,x,params)

% Implement the PINN loss by predicting x(t) via params.net and computing the derivatives with dlgradient

xpred = forward(params.net,t);

% Here we sum xpred over the batch dimension to get a scalar.

% This is required since dlgradient only takes gradients of scalars.

% This works because xpred(i) depends only on t(i)

% - i.e. the network-s forward pass vectorizes over the batch dimension

%dxdt = dlgradient(sum(xpred),t,EnableHigherDerivatives=true);

%d2xdt2 = dlgradient(sum(dxdt),t);

%odeResidual = d2xdt2 - 0.5*xpred;

% Compute the mean square error of the ODE residual.

%odeLoss = mean(odeResidual.^2);

% Physical loss

dxdt = dlgradient(sum(xpred),t,EnableHigherDerivatives=true);

%

odeResidual_first = dxdt - (-2*9.81.*t+params.v);

odeLoss_first = mean(odeResidual_first.^2);

%

% d2xdt2 = dlgradient(sum(dxdt),t);

% odeResidual_second = d2xdt2 - (-2*9.81);

% odeLoss_second = mean(odeResidual_second.^2);

% Compute the L2 difference between the predicted xpred and the true x.

dataLoss = l2loss(x,xpred);

% Sum the losses and take gradients.

% Note that by creating the grad as a struct with fields matching params we can

% easily pass grad to adamupdate in the training loop.

% loss = 0.3.*odeLoss_first + 0.3.*odeLoss_second + 0.5.*dataLoss;

loss = 0.5.*odeLoss_first + 0.5.*dataLoss;

[grad.net,grad.v] = dlgradient(loss,params.net.Learnables,params.v);

end

Ben on 21 Nov 2023

@Giulio I don't get an error when running that script.

The error stack in your comment notes complex values which aren't supported by adamupdate. I'm not sure how complex values would appear in your script though. Is this reproducible every iteration or training? If so - you could remove the dlaccelerate calls, and place a breakpoint in modelLoss to try to identify where the values become complex.

Giulio on 21 Nov 2023

Thank you for your quick reply.

Could you confirm to me that it's not expected to generate a complex gradient, is it? I think it should be related to a wrong setting of the loss function.

Sign in to comment.

Physical Informed Neural Network - Identify coefficient of loss function

0 Comments
Show -2 older commentsHide -2 older comments

Answers (1)

3 Comments
Show 1 older commentHide 1 older comment

See Also

Categories

Tags

Community Treasure Hunt

Physical Informed Neural Network - Identify coefficient of loss function

0 Comments Show -2 older commentsHide -2 older comments

Answers (1)

3 Comments Show 1 older commentHide 1 older comment

See Also

Categories

Tags

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

3 Comments
Show 1 older commentHide 1 older comment