離散行動を複数持つQ関数の作成

Question

Open in MATLAB Online

0 votes

rlFiniteSetSpec を使い、複数の離散行動を持つQ関数を作成したいのですが、

InputとDimensionの数が合わずエラーが返されてしまいます。

現在コードは下記のようにしているのですが、

DimensionをInputの数に合わせる方法はないでしょうか。

初歩的な質問となってしまいますが、

教えていただけますと幸いです。

%Actionに関するコード抜粋
NA = 5;
actInfo =rlFiniteSetSpec(NA);
actPath = [
    featureInputLayer(NA,'Normalization','none','Name','action')  
    fullyConnectedLayer(50,'Name','CA1')]

0 Comments
Show -2 older comments Hide -2 older comments

Sign in to comment.

Sign in to answer this question.

Follow Question

Answer 1

Hiro Yoshino on 20 Oct 2020

0 votes

rlFiniteSetSPec の引数はInputの数では無く、実際に取り得る値を指定します

actionが1つならば、それが取り得る離散値をベクトルで渡します

actionが複数ならば、cellを使ってあり得る組み合わせのベクトルを渡します

https://jp.mathworks.com/help/reinforcement-learning/ref/rl.util.rlfinitesetspec.html#mw_68f70adf-d6a9-4cbe-846c-a7d0823c0774_sep_mw_770a16f8-3eaf-4f06-80ca-87296824fb89

このあたりに詳細が書いてあります

3 Comments
Show 1 older comment Hide 1 older comment

Y. M on 21 Oct 2020

Open in MATLAB Online

現在このように書き換えてみました。

criticOpts＝...までは実行可能なのですが、やはりcritic=...で、

エラー: rl.representation.rlAbstractRepresentation/validateModelInputDimension (行 557)

Model input sizes must match the dimensions specified in the corresponding observation and action info specifications.

が返されてしまいます。

NS=4;
selectable_actions={1,2,3,4,5};
Ts = 0.05;
obsInfo =rlNumericSpec(NS);
obsInfo.Name = 'observation';
obsInfo.Description = '温度、絶対湿度、代表点壁面温度' ;    %状態に関する情報の説明（別になくてもいい）
actInfo =rlFiniteSetSpec(selectable_actions);
actInfo.Name = 'AirVolume' ;
NA = numel(actInfo.Elements);
    
obsPath = [
   featureInputLayer(NS,'Normalization','none','Name','state')   
    fullyConnectedLayer(50,'Name','CS1')             
actPath = [
    featureInputLayer(NA,'Normalization','none','Name','action')  
    fullyConnectedLayer(50,'Name','CA1')];
comPath=[   
    additionLayer(2,'Name','add')
    reluLayer('Name','CriticCommonRelu') 
    fullyConnectedLayer(1,'Name','output')];
    
dnn = layerGraph();
dnn = addLayers(dnn,obsPath);
dnn = addLayers(dnn,actPath);
dnn = addLayers(dnn,comPath);
dnn = connectLayers(dnn,'CS1','add/in1');
dnn = connectLayers(dnn,'CA1','add/in2');
figure
plot(layerGraph(dnn))
criticOpts = rlRepresentationOptions('LearnRate',0.001,'Optimizer',"rmsprop");
critic = rlQValueRepresentation(dnn,obsInfo,actInfo,'Observation',{'state'},'Action',{'action'},criticOpts);

Hiro Yoshino on 21 Oct 2020

Open in MATLAB Online

cellの扱い等を理解が怪しいのでMATLAB入門を受講されることをおススメします：

https://matlabacademy.mathworks.com/jp

selectable_actions=[1,2,3,4,5];

上のようにするのがドキュメンテーション通りです（恐らく、セルを使っても動作はしますが)

observationの数が3つっぽいのですが、NS = 4?そのあたりは大丈夫ですか？

obsInfo =rlNumericSpec([4 1]);

かなと思います。

いずれにしても、ドキュメンテーションに書いてありますので、よく読まれる事をおススメします。

https://jp.mathworks.com/help/reinforcement-learning/ref/rl.util.rlnumericspec.html#mw_dd97f7de-8690-4904-9211-08eb0123352b

Y. M on 21 Oct 2020

始めたばかりとはいえ、

初歩的なことで詰まってしまっていることがお恥ずかしいばかりです。

多くの助言を頂き、誠にありがとうございます。

Sign in to comment.

離散行動を複数持つQ関数の作成

0 Comments
Show -2 older comments Hide -2 older comments

Accepted Answer

3 Comments
Show 1 older comment Hide 1 older comment

More Answers (0)

Categories

Tags

Community Treasure Hunt

離散行動を複数持つQ関数の作成

0 Comments Show -2 older comments Hide -2 older comments

Accepted Answer

3 Comments Show 1 older comment Hide 1 older comment

More Answers (0)

Categories

Tags

See Also

Community Treasure Hunt

0 Comments
Show -2 older comments Hide -2 older comments

3 Comments
Show 1 older comment Hide 1 older comment