<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Machine random action in FlexSim Forum</title>
    <link>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572695#M72557</link>
    <description>&lt;DIV class="fr-view clearfix"&gt;
 &lt;P&gt;When I need to make the next item, I can use four scheduling rules to perform actions (such as spt lpt and other scheduling rules), and this action is random. I have read a document about his reward function It is determined that he will perform all four actions and learn the best results (for example, it takes less time, etc.), but what I want to do now is that the actions performed by my agent are based on the actions I gave His scheduling rules, and then learn the situation that each time step will think about which rule will be the best.&lt;/P&gt;
&lt;/DIV&gt;</description>
    <pubDate>Wed, 23 Aug 2023 16:41:34 GMT</pubDate>
    <dc:creator>a9080109</dc:creator>
    <dc:date>2023-08-23T16:41:34Z</dc:date>
    <item>
      <title>Machine random action</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572693#M72555</link>
      <description>&lt;P&gt;&lt;I&gt;[ FlexSim 22.0.16 ]&lt;/I&gt;&lt;/P&gt;&lt;DIV class="fr-view clearfix"&gt;
 &lt;P&gt;What I want to do now is to make the strategy adopted by my machine pull randomly. For example, I have four options (spt lpt fifo lifo). When I finish processing, I will randomly use one of these four actions. one&lt;A rel="noopener noreferrer" href="https://answers.flexsim.com/storage/attachments/72884-random-action.fsm" target="_blank"&gt;random action.fsm&lt;/A&gt;&lt;/P&gt;
 &lt;P&gt;What I want to do now is that I want to use reinforcement learning to complete the optimal scheduling, but I don't know how to set up my observation space and action space.&lt;/P&gt;
 &lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="1692723650962.png"&gt;&lt;img src="https://forums.autodesk.com/t5/image/serverpage/image-id/1519149i28079D26032ADABA/image-size/large?v=v2&amp;amp;px=999" role="button" title="1692723650962.png" alt="1692723650962.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;/DIV&gt;</description>
      <pubDate>Tue, 22 Aug 2023 17:01:19 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572693#M72555</guid>
      <dc:creator>a9080109</dc:creator>
      <dc:date>2023-08-22T17:01:19Z</dc:date>
    </item>
    <item>
      <title>Re: Machine random action</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572694#M72556</link>
      <description>&lt;DIV class="fr-view clearfix"&gt;
 &lt;P&gt;Hi &lt;A rel="user" href="https://answers.flexsim.com/users/29682/a9080109.html" nodeid="29682"&gt;@mark zhen&lt;/A&gt;,&lt;/P&gt;
 &lt;P&gt;The tutorial on our site is similar and should be helpful. &lt;/P&gt;
 &lt;P&gt;Can you clarify your goal for this model? Are you saying that you can change methods every single time the processor pulls a flow item from the queue? This would be similar to our tutorial. Our tutorial, however, pulls an item based on type, and there is a clear pattern as to which type is best based on the observation of which type was last pulled. For reinforcement learning to work well, there needs to be a learnable pattern between the observation and the best action.&lt;/P&gt;
 &lt;P&gt;If you simply want to know which of the four sequencing methods is best for your model, perhaps using the experimenter would be better. &lt;/P&gt;
&lt;/DIV&gt;</description>
      <pubDate>Wed, 23 Aug 2023 15:09:44 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572694#M72556</guid>
      <dc:creator>natalie_white</dc:creator>
      <dc:date>2023-08-23T15:09:44Z</dc:date>
    </item>
    <item>
      <title>Re: Machine random action</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572695#M72557</link>
      <description>&lt;DIV class="fr-view clearfix"&gt;
 &lt;P&gt;When I need to make the next item, I can use four scheduling rules to perform actions (such as spt lpt and other scheduling rules), and this action is random. I have read a document about his reward function It is determined that he will perform all four actions and learn the best results (for example, it takes less time, etc.), but what I want to do now is that the actions performed by my agent are based on the actions I gave His scheduling rules, and then learn the situation that each time step will think about which rule will be the best.&lt;/P&gt;
&lt;/DIV&gt;</description>
      <pubDate>Wed, 23 Aug 2023 16:41:34 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572695#M72557</guid>
      <dc:creator>a9080109</dc:creator>
      <dc:date>2023-08-23T16:41:34Z</dc:date>
    </item>
    <item>
      <title>Re: Machine random action</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572696#M72558</link>
      <description>&lt;DIV class="fr-view clearfix"&gt;
 &lt;P&gt;I have read and understood the teaching guide, but what I want to do now is that I want my agent to learn these traditional scheduling methods to explore new possibilities, or the four scheduling methods Integrate for a better scheduling result&lt;/P&gt;
&lt;/DIV&gt;</description>
      <pubDate>Wed, 23 Aug 2023 16:43:06 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572696#M72558</guid>
      <dc:creator>a9080109</dc:creator>
      <dc:date>2023-08-23T16:43:06Z</dc:date>
    </item>
    <item>
      <title>Re: Machine random action</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572697#M72559</link>
      <description>&lt;DIV class="fr-view clearfix"&gt;
 &lt;P&gt;This application of reinforcement learning is probably not realistic. Let me explain: &lt;/P&gt;
 &lt;P&gt;First, how will your model learn which rule is best? It needs some metric for "best," so you'd need to determine what that means (likely you'll want to minimize time or maximize throughput) and design a rewards system that will promote your objective. &lt;/P&gt;
 &lt;P&gt;Additionally, "best" is going to depend on the current state of your model. What exactly is it, in your model, that determines which rule is optimal? You need to be able to identify what that is and have your model observe it. This is your main problem. I don't know if there is a clear answer to this question, and if you aren't able to answer this question, then you can't successfully use reinforcement learning for your model. &lt;/P&gt;
 &lt;P&gt;In the tutorial, the best action to take (which type of item to pull next) is directly tied to the observation (which type of item was last pulled). Reinforcement learning requires a connection between the observation and the best action to take. &lt;/P&gt;
&lt;/DIV&gt;</description>
      <pubDate>Wed, 23 Aug 2023 18:07:23 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572697#M72559</guid>
      <dc:creator>natalie_white</dc:creator>
      <dc:date>2023-08-23T18:07:23Z</dc:date>
    </item>
    <item>
      <title>Re: Machine random action</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572698#M72560</link>
      <description>&lt;DIV class="fr-view clearfix"&gt;
 &lt;P&gt;I think I have a completion rate. For example, I hope that my order can be completed within a certain time. If so, I will give him a +1 reward. If not, I will give him a -1 reward. This is my reward function. Statute. Then if I finish all three within the time, I will compare the total completion time, and give him 1 for the smallest one.&lt;/P&gt;
&lt;/DIV&gt;</description>
      <pubDate>Wed, 23 Aug 2023 18:28:03 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572698#M72560</guid>
      <dc:creator>a9080109</dc:creator>
      <dc:date>2023-08-23T18:28:03Z</dc:date>
    </item>
    <item>
      <title>Re: Machine random action</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572699#M72561</link>
      <description>&lt;DIV class="fr-view clearfix"&gt;
 The problem is that you cannot know which "rule" optimizes your completion rate. Your completion rate is affected by these two things: the item's type, and the type of the previous item. That's the point of the example in the tutorial. 
 &lt;P&gt;You can't know which rule is best at each time step. You CAN know which item type is best to pull, but you don't know which rule will have you pull that item. At various points in your model run, a certain rule will pull different items. &lt;/P&gt;
&lt;/DIV&gt;</description>
      <pubDate>Wed, 23 Aug 2023 19:40:06 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572699#M72561</guid>
      <dc:creator>natalie_white</dc:creator>
      <dc:date>2023-08-23T19:40:06Z</dc:date>
    </item>
    <item>
      <title>Re: Machine random action</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572700#M72562</link>
      <description>&lt;DIV class="fr-view clearfix"&gt;
 &lt;P&gt;But there are similar methods mentioned in the literature I read, but I want to complete it first (how should I write random actions)?&lt;/P&gt;
 &lt;P&gt;&lt;A rel="nofollow" id="isPasted" href="https://www.sciencedirect.com/science/article/pii/S0921889000000877"&gt;https://www.sciencedirect.com/science/article/pii/S0921889000000877&lt;/A&gt;&lt;/P&gt;
&lt;/DIV&gt;</description>
      <pubDate>Thu, 24 Aug 2023 06:30:46 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572700#M72562</guid>
      <dc:creator>a9080109</dc:creator>
      <dc:date>2023-08-24T06:30:46Z</dc:date>
    </item>
    <item>
      <title>Re: Machine random action</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572701#M72563</link>
      <description>&lt;DIV class="fr-view clearfix"&gt;This paper mentioned breakdowns and priority jobs as random events. A schedule depends on intervals of known input and demanded output over time. Even if you have a static sequence of products you can define a window width of time, which varies for each simulation setup. This window defines what your control mechanism knows about to decide on any order of production steps in your model. &lt;/DIV&gt;</description>
      <pubDate>Thu, 24 Aug 2023 07:05:59 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572701#M72563</guid>
      <dc:creator>joerg_vogel_HsH</dc:creator>
      <dc:date>2023-08-24T07:05:59Z</dc:date>
    </item>
    <item>
      <title>Re: Machine random action</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572702#M72564</link>
      <description>&lt;DIV class="fr-view clearfix"&gt;
 &lt;P&gt;Sorry, can you explain it more clearly? Then how do I implement the content in this document&lt;/P&gt;
&lt;/DIV&gt;</description>
      <pubDate>Thu, 24 Aug 2023 07:08:55 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572702#M72564</guid>
      <dc:creator>a9080109</dc:creator>
      <dc:date>2023-08-24T07:08:55Z</dc:date>
    </item>
    <item>
      <title>Re: Machine random action</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572703#M72565</link>
      <description>&lt;DIV class="fr-view clearfix"&gt;
 &lt;P&gt;Regarding random execution, my current thoughts are roughly as shown in the model，but it doesnt work&lt;/P&gt;
 &lt;P&gt;&lt;A rel="noopener noreferrer" href="https://answers.flexsim.com/storage/attachments/72971-random-action.fsm" target="_blank"&gt;random-action.fsm&lt;/A&gt;&lt;/P&gt;
&lt;/DIV&gt;</description>
      <pubDate>Thu, 24 Aug 2023 07:45:54 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572703#M72565</guid>
      <dc:creator>a9080109</dc:creator>
      <dc:date>2023-08-24T07:45:54Z</dc:date>
    </item>
    <item>
      <title>Re: Machine random action</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572704#M72566</link>
      <description>&lt;DIV class="fr-view clearfix"&gt;You cast a continuous value between 1 and 2 as an integer which removes the decimal places. As a result &lt;EM&gt;randomAction&lt;/EM&gt; is always 1. Use the discrete uniform distribution.&lt;/DIV&gt;</description>
      <pubDate>Thu, 24 Aug 2023 08:04:48 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572704#M72566</guid>
      <dc:creator>moehlmann_fe</dc:creator>
      <dc:date>2023-08-24T08:04:48Z</dc:date>
    </item>
    <item>
      <title>Re: Machine random action</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572705#M72567</link>
      <description>&lt;DIV class="fr-view clearfix"&gt;
 &lt;P&gt;&lt;A rel="user" href="https://answers.flexsim.com/users/19365/felixmh.html" nodeid="19365"&gt;@Felix Möhlmann&lt;/A&gt;&lt;/P&gt;
 &lt;P&gt;I have a new idea now, I write the method I need in several ways (such as 1= spt, 2 = lpt) and so on, so that my machine is like looking up a dictionary, I randomly look up a number and execute the number inside required content&lt;/P&gt;
 &lt;P&gt;Also, is that what you're talking about?&lt;span class="lia-inline-image-display-wrapper" image-alt="1692865706857.png"&gt;&lt;img src="https://forums.autodesk.com/t5/image/serverpage/image-id/1519145i1244DDF5DE2826FC/image-size/large?v=v2&amp;amp;px=999" role="button" title="1692865706857.png" alt="1692865706857.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;/DIV&gt;</description>
      <pubDate>Thu, 24 Aug 2023 08:22:46 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572705#M72567</guid>
      <dc:creator>a9080109</dc:creator>
      <dc:date>2023-08-24T08:22:46Z</dc:date>
    </item>
    <item>
      <title>Re: Machine random action</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572706#M72568</link>
      <description>&lt;DIV class="fr-view clearfix"&gt;If you set the type of randomAction to a double it will never be equal to 1 or 2. Leave it as an integer but generate a discrete number random number (duniform(1, 2)).&lt;/DIV&gt;</description>
      <pubDate>Thu, 24 Aug 2023 08:57:27 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572706#M72568</guid>
      <dc:creator>moehlmann_fe</dc:creator>
      <dc:date>2023-08-24T08:57:27Z</dc:date>
    </item>
    <item>
      <title>Re: Machine random action</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572707#M72569</link>
      <description>&lt;DIV class="fr-view clearfix"&gt;
 &lt;P&gt;So now I want to use parameters to select a method, what should I do?&lt;/P&gt;
&lt;/DIV&gt;</description>
      <pubDate>Thu, 24 Aug 2023 09:04:07 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572707#M72569</guid>
      <dc:creator>a9080109</dc:creator>
      <dc:date>2023-08-24T09:04:07Z</dc:date>
    </item>
    <item>
      <title>Re: Machine random action</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572708#M72570</link>
      <description>&lt;DIV class="fr-view clearfix"&gt;&lt;P&gt;You already have code that chooses a logic depending on a numeric value. Instead of randomly generating that value, read it from a parameter.&lt;/P&gt;&lt;P&gt;&lt;A id="isPasted" href="https://docs.flexsim.com/en/21.1/ModelLogic/ModelParameters/ModelParameters.html"&gt;https://docs.flexsim.com/en/21.1/ModelLogic/ModelParameters/ModelParameters.html&lt;/A&gt;&lt;/P&gt;&lt;/DIV&gt;</description>
      <pubDate>Fri, 25 Aug 2023 05:59:39 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572708#M72570</guid>
      <dc:creator>moehlmann_fe</dc:creator>
      <dc:date>2023-08-25T05:59:39Z</dc:date>
    </item>
    <item>
      <title>Re: Machine random action</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572709#M72571</link>
      <description>&lt;DIV class="fr-view clearfix"&gt;
 &lt;P&gt;I don't quite understand~~~&lt;A rel="noopener noreferrer" href="https://answers.flexsim.com/storage/attachments/73009-random-action.fsm" target="_blank"&gt;random action.fsm&lt;/A&gt;&lt;/P&gt;
&lt;/DIV&gt;</description>
      <pubDate>Fri, 25 Aug 2023 15:31:26 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572709#M72571</guid>
      <dc:creator>a9080109</dc:creator>
      <dc:date>2023-08-25T15:31:26Z</dc:date>
    </item>
    <item>
      <title>Re: Machine random action</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572710#M72572</link>
      <description>&lt;DIV class="fr-view clearfix"&gt;&lt;P&gt;Define a parameter and use it to control how the processor pulls the next part.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="1693048321240.png"&gt;&lt;img src="https://forums.autodesk.com/t5/image/serverpage/image-id/1519148iDB24162EBB00463C/image-size/large?v=v2&amp;amp;px=999" role="button" title="1693048321240.png" alt="1693048321240.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;/DIV&gt;</description>
      <pubDate>Sat, 26 Aug 2023 11:12:32 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572710#M72572</guid>
      <dc:creator>moehlmann_fe</dc:creator>
      <dc:date>2023-08-26T11:12:32Z</dc:date>
    </item>
    <item>
      <title>Re: Machine random action</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572711#M72573</link>
      <description>&lt;DIV class="fr-view clearfix"&gt;&lt;P&gt;Hi &lt;A rel="user" href="https://answers.flexsim.com/users/29682/a9080109.html" nodeid="29682"&gt;@mark zhen &lt;/A&gt;, was Natalie White's answer helpful? If so, please click the "Accept" button at the bottom of their answer. Or if you still have questions, add a comment and we'll continue the conversation.&lt;/P&gt;&lt;P&gt;If we haven't heard back from you within 3 business days we'll auto-accept an answer, but you can always comment back to reopen your question.&lt;/P&gt;&lt;/DIV&gt;</description>
      <pubDate>Wed, 30 Aug 2023 14:23:20 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572711#M72573</guid>
      <dc:creator>Jeanette_Fullmer</dc:creator>
      <dc:date>2023-08-30T14:23:20Z</dc:date>
    </item>
    <item>
      <title>Re: Machine random action</title>
      <link>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572712#M72574</link>
      <description>&lt;DIV class="fr-view clearfix"&gt;
 &lt;P&gt;&lt;A rel="user" href="https://answers.flexsim.com/users/42879/nataliewhite.html" nodeid="42879"&gt;@Natalie White&lt;/A&gt; &lt;A rel="user" href="https://answers.flexsim.com/users/19365/felixmh.html" nodeid="19365"&gt;@Felix Möhlmann&lt;/A&gt; &lt;A rel="user" href="https://answers.flexsim.com/users/35833/kavikaf.html" nodeid="35833"&gt;@Kavika F&lt;/A&gt; &lt;/P&gt;
 &lt;P id="isPasted"&gt;I think I'm almost done, the state of my model now,&lt;/P&gt;
 &lt;P&gt;I want to define it as the number of deferred tickets but I'm a bit confused on how to do it?&lt;/P&gt;
 &lt;P&gt;And I define my actions I take six different actions&lt;/P&gt;
 &lt;P&gt;The part about the reward may be to minimize the tardness or to calculate the average of the overall tardness (but I don't know how to calculate the average in flexsim)&lt;/P&gt;
 &lt;P&gt;As for the label part, I have defined four labels in the source&lt;/P&gt;
 &lt;P&gt;ArrivalTime is the arrival time of the goods&lt;/P&gt;
 &lt;P&gt;date is the delivery time&lt;/P&gt;
 &lt;P&gt;total arrival total arrival time&lt;/P&gt;
 &lt;P&gt;mark the order in which goods enter&lt;/P&gt;
 &lt;P&gt;&lt;A rel="noopener noreferrer" href="https://answers.flexsim.com/storage/attachments/73183-random-action-autosave.fsm" target="_blank"&gt;random-action_autosave.fsm&lt;/A&gt;&lt;/P&gt;
&lt;/DIV&gt;</description>
      <pubDate>Wed, 30 Aug 2023 15:57:14 GMT</pubDate>
      <guid>https://forums.autodesk.com/t5/flexsim-forum/machine-random-action/m-p/13572712#M72574</guid>
      <dc:creator>a9080109</dc:creator>
      <dc:date>2023-08-30T15:57:14Z</dc:date>
    </item>
  </channel>
</rss>

