<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Define the state of reinforcement learning in FlexSim Forum</title>
    <link>https://forums.autodesk.com/t5/flexsim-forum/define-the-state-of-reinforcement-learning/m-p/13574992#M74432</link>
    <description>&lt;DIV class="fr-view clearfix"&gt;&lt;P&gt;&lt;A href="https://trello.com/c/RSA1JrPe/23957-reinforcement-learning-training-problem"&gt;New post here.&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Hi &lt;A rel="user" href="https://answers.flexsim.com/users/29682/a9080109.html" nodeid="29682"&gt;@mark zhen &lt;/A&gt;, was one of Jason Lightfoot's or Felix Möhlmann's answers helpful? If so, please click the "Accept" button at the bottom of the one that best answers your question. Or if you still have questions, add a comment and we'll continue the conversation.&lt;/P&gt;&lt;P&gt;If we haven't heard back from you within 3 business days we'll auto-accept an answer, but you can always comment back to reopen your question.&lt;/P&gt;&lt;/DIV&gt;</description>
    <pubDate>Mon, 11 Sep 2023 11:24:52 GMT</pubDate>
    <dc:creator>jason_lightfoot_adsk</dc:creator>
    <dc:date>2023-09-11T11:24:52Z</dc:date>
    <item>
      <title>Define the state of reinforcement learning</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/define-the-state-of-reinforcement-learning/m-p/13574982#M74422</link>
      <description>&lt;P&gt;&lt;I&gt;[ FlexSim 22.0.16 ]&lt;/I&gt;&lt;/P&gt;&lt;DIV class="fr-view clearfix"&gt;
 &lt;P id="isPasted" style="margin: 0px 0px 10px; color: rgb(51, 51, 51); font-family: ;"&gt;I think I'm almost done, the state of my model now,&lt;/P&gt;
 &lt;P style="margin: 0px 0px 10px; color: rgb(51, 51, 51); font-family: ;"&gt;I want to define it as the number of deferred order but I'm a bit confused on how to do it?&lt;/P&gt;
 &lt;P style="margin: 0px 0px 10px; color: rgb(51, 51, 51); font-family: ;"&gt;And I define my actions I take six different actions&lt;/P&gt;
 &lt;P style="margin: 0px 0px 10px; color: rgb(51, 51, 51); font-family: ;"&gt;The part about the reward may be to minimize the tardness or to calculate the average of the overall tardness (but I don't know how to calculate the average in flexsim)&lt;/P&gt;
 &lt;P style="margin: 0px 0px 10px; color: rgb(51, 51, 51); font-family: ;"&gt;As for the label part, I have defined four labels in the source&lt;/P&gt;
 &lt;P style="margin: 0px 0px 10px; color: rgb(51, 51, 51); font-family: ;"&gt;ArrivalTime is the arrival time of the goods&lt;/P&gt;
 &lt;P style="margin: 0px 0px 10px; color: rgb(51, 51, 51); font-family: ;"&gt;date is the delivery time&lt;/P&gt;
 &lt;P style="margin: 0px 0px 10px; color: rgb(51, 51, 51); font-family: ;"&gt;total arrival total arrival time&lt;/P&gt;
 &lt;P style="margin: 0px 0px 10px; color: rgb(51, 51, 51); font-family: ;"&gt;mark the order in which goods enter&lt;/P&gt;
 &lt;P style="margin: 0px 0px 10px; color: rgb(51, 51, 51); font-family: ;"&gt;I want to calculate the average tardness in the global table of flexsim. How should I do it?&lt;/P&gt;
 &lt;P style="margin: 0px 0px 10px; color: rgb(51, 51, 51); font-family: ;"&gt;&lt;A rel="user" href="https://answers.flexsim.com/users/35833/kavikaf.html" nodeid="35833"&gt;@Kavika F&lt;/A&gt; &lt;A rel="user" href="https://answers.flexsim.com/users/19365/felixmh.html" nodeid="19365"&gt;@Felix Möhlmann&lt;/A&gt; &lt;A rel="user" href="https://answers.flexsim.com/users/226/jason.l.html" nodeid="226"&gt;@Jason Lightfoot&lt;/A&gt;&lt;/P&gt;
 &lt;P style="margin: 0px 0px 10px; color: rgb(51, 51, 51); font-family: ;"&gt;&lt;A rel="noopener noreferrer" href="https://answers.flexsim.com/storage/attachments/73347-rule0905-autosave.fsm" target="_blank"&gt;rule0905_autosave.fsm&lt;/A&gt;&lt;/P&gt;
&lt;/DIV&gt;</description>
      <pubDate>Mon, 04 Sep 2023 07:23:20 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/define-the-state-of-reinforcement-learning/m-p/13574982#M74422</guid>
      <dc:creator>a9080109</dc:creator>
      <dc:date>2023-09-04T07:23:20Z</dc:date>
    </item>
    <item>
      <title>Re: Define the state of reinforcement learning</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/define-the-state-of-reinforcement-learning/m-p/13574983#M74423</link>
      <description>&lt;DIV class="fr-view clearfix"&gt;&lt;P&gt;You could just sum up the tardiness of each entering item in a label on the sink. Then get the average by dividing that value by the input stat.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="1694020679762.png"&gt;&lt;img src="https://forums.autodesk.com/t5/image/serverpage/image-id/1519287i53BF82314116FFB2/image-size/large?v=v2&amp;amp;px=999" role="button" title="1694020679762.png" alt="1694020679762.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;A rel="noopener noreferrer" href="https://answers.flexsim.com/storage/attachments/73339-rule0905-autosave.fsm" target="_blank"&gt;rule0905-autosave.fsm&lt;/A&gt;&lt;/P&gt;&lt;/DIV&gt;</description>
      <pubDate>Wed, 06 Sep 2023 17:18:23 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/define-the-state-of-reinforcement-learning/m-p/13574983#M74423</guid>
      <dc:creator>moehlmann_fe</dc:creator>
      <dc:date>2023-09-06T17:18:23Z</dc:date>
    </item>
    <item>
      <title>Re: Define the state of reinforcement learning</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/define-the-state-of-reinforcement-learning/m-p/13574984#M74424</link>
      <description>&lt;DIV class="fr-view clearfix"&gt;&lt;P&gt;The rolling average  for the label on the sink would be:&lt;/P&gt;&lt;PRE&gt;((N-1)*avgTardiness+item.tardiness)/N&lt;/PRE&gt;&lt;P&gt;Using the global table you can use :&lt;/P&gt;&lt;PRE&gt;Table.query("SELECT AVG(tardiness) FROM [entry time]")[1][1]&lt;/PRE&gt;&lt;P&gt;..if the tardiness field contains the lateness of each item (not the rolling average).&lt;/P&gt;&lt;/DIV&gt;</description>
      <pubDate>Wed, 06 Sep 2023 17:23:31 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/define-the-state-of-reinforcement-learning/m-p/13574984#M74424</guid>
      <dc:creator>jason_lightfoot_adsk</dc:creator>
      <dc:date>2023-09-06T17:23:31Z</dc:date>
    </item>
    <item>
      <title>Re: Define the state of reinforcement learning</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/define-the-state-of-reinforcement-learning/m-p/13574985#M74425</link>
      <description>&lt;DIV class="fr-view clearfix"&gt;
 &lt;P&gt;I have another question to ask about my state. I am currently trying to calculate how many orders I have in total that are delayed, but I feel that my approach may not be right.&lt;/P&gt;
&lt;/DIV&gt;</description>
      <pubDate>Wed, 06 Sep 2023 18:30:13 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/define-the-state-of-reinforcement-learning/m-p/13574985#M74425</guid>
      <dc:creator>a9080109</dc:creator>
      <dc:date>2023-09-06T18:30:13Z</dc:date>
    </item>
    <item>
      <title>Re: Define the state of reinforcement learning</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/define-the-state-of-reinforcement-learning/m-p/13574986#M74426</link>
      <description>&lt;DIV class="fr-view clearfix"&gt;That can just be a counter on a label that you increment with item.tardiness&amp;gt;0.   &lt;P&gt;&lt;BR /&gt;&lt;/P&gt;Please do the all tutorials if you haven't already or consult your academic institution's training material.&lt;/DIV&gt;</description>
      <pubDate>Wed, 06 Sep 2023 22:14:13 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/define-the-state-of-reinforcement-learning/m-p/13574986#M74426</guid>
      <dc:creator>jason_lightfoot_adsk</dc:creator>
      <dc:date>2023-09-06T22:14:13Z</dc:date>
    </item>
    <item>
      <title>Re: Define the state of reinforcement learning</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/define-the-state-of-reinforcement-learning/m-p/13574987#M74427</link>
      <description>&lt;DIV class="fr-view clearfix"&gt;
 &lt;P&gt;No, I've done the calculation, but I want to treat it as my state, and my model has this done, but there may be some definitions or details that I haven't dealt with.&lt;/P&gt;
&lt;/DIV&gt;</description>
      <pubDate>Thu, 07 Sep 2023 05:40:08 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/define-the-state-of-reinforcement-learning/m-p/13574987#M74427</guid>
      <dc:creator>a9080109</dc:creator>
      <dc:date>2023-09-07T05:40:08Z</dc:date>
    </item>
    <item>
      <title>Re: Define the state of reinforcement learning</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/define-the-state-of-reinforcement-learning/m-p/13574988#M74428</link>
      <description>&lt;DIV class="fr-view clearfix"&gt;These sounds like comments, not questions.&lt;/DIV&gt;</description>
      <pubDate>Thu, 07 Sep 2023 10:44:33 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/define-the-state-of-reinforcement-learning/m-p/13574988#M74428</guid>
      <dc:creator>jason_lightfoot_adsk</dc:creator>
      <dc:date>2023-09-07T10:44:33Z</dc:date>
    </item>
    <item>
      <title>Re: Define the state of reinforcement learning</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/define-the-state-of-reinforcement-learning/m-p/13574989#M74429</link>
      <description>&lt;DIV class="fr-view clearfix"&gt;
 &lt;P id="isPasted"&gt;My problem is that I'm not sure if my status is set correctly.&lt;/P&gt;
 &lt;P&gt;(My ideal state is to delay the order)&lt;/P&gt;
&lt;/DIV&gt;</description>
      <pubDate>Thu, 07 Sep 2023 13:48:15 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/define-the-state-of-reinforcement-learning/m-p/13574989#M74429</guid>
      <dc:creator>a9080109</dc:creator>
      <dc:date>2023-09-07T13:48:15Z</dc:date>
    </item>
    <item>
      <title>Re: Define the state of reinforcement learning</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/define-the-state-of-reinforcement-learning/m-p/13574990#M74430</link>
      <description>&lt;DIV class="fr-view clearfix"&gt;What do you mean by 'state'? The observations for the RL algorithm?&lt;P&gt;We can't tell you if your model is set up correctly without seeing the current version.&lt;/P&gt;&lt;/DIV&gt;</description>
      <pubDate>Fri, 08 Sep 2023 06:20:23 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/define-the-state-of-reinforcement-learning/m-p/13574990#M74430</guid>
      <dc:creator>moehlmann_fe</dc:creator>
      <dc:date>2023-09-08T06:20:23Z</dc:date>
    </item>
    <item>
      <title>Re: Define the state of reinforcement learning</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/define-the-state-of-reinforcement-learning/m-p/13574991#M74431</link>
      <description>&lt;DIV class="fr-view clearfix"&gt;
 &lt;P&gt;&lt;A rel="noopener noreferrer" href="https://answers.flexsim.com/storage/attachments/73416-rule0905-autosave-autosave.fsm" target="_blank"&gt;rule0905-autosave_autosave.fsm&lt;/A&gt;&lt;/P&gt;
 &lt;P&gt;Yes, in the literature I read on rl, the three elements of state action reward are mentioned. In my understanding, state may be On obeservation?&lt;/P&gt;
&lt;/DIV&gt;</description>
      <pubDate>Fri, 08 Sep 2023 09:48:15 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/define-the-state-of-reinforcement-learning/m-p/13574991#M74431</guid>
      <dc:creator>a9080109</dc:creator>
      <dc:date>2023-09-08T09:48:15Z</dc:date>
    </item>
    <item>
      <title>Re: Define the state of reinforcement learning</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/define-the-state-of-reinforcement-learning/m-p/13574992#M74432</link>
      <description>&lt;DIV class="fr-view clearfix"&gt;&lt;P&gt;&lt;A href="https://trello.com/c/RSA1JrPe/23957-reinforcement-learning-training-problem"&gt;New post here.&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Hi &lt;A rel="user" href="https://answers.flexsim.com/users/29682/a9080109.html" nodeid="29682"&gt;@mark zhen &lt;/A&gt;, was one of Jason Lightfoot's or Felix Möhlmann's answers helpful? If so, please click the "Accept" button at the bottom of the one that best answers your question. Or if you still have questions, add a comment and we'll continue the conversation.&lt;/P&gt;&lt;P&gt;If we haven't heard back from you within 3 business days we'll auto-accept an answer, but you can always comment back to reopen your question.&lt;/P&gt;&lt;/DIV&gt;</description>
      <pubDate>Mon, 11 Sep 2023 11:24:52 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/define-the-state-of-reinforcement-learning/m-p/13574992#M74432</guid>
      <dc:creator>jason_lightfoot_adsk</dc:creator>
      <dc:date>2023-09-11T11:24:52Z</dc:date>
    </item>
    <item>
      <title>Re: Define the state of reinforcement learning</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/define-the-state-of-reinforcement-learning/m-p/13574993#M74433</link>
      <description>&lt;DIV class="fr-view clearfix"&gt;
 &lt;P&gt;I do not understand what you mean!!&lt;/P&gt;
&lt;/DIV&gt;</description>
      <pubDate>Thu, 14 Sep 2023 15:57:10 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/define-the-state-of-reinforcement-learning/m-p/13574993#M74433</guid>
      <dc:creator>a9080109</dc:creator>
      <dc:date>2023-09-14T15:57:10Z</dc:date>
    </item>
  </channel>
</rss>

