<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
		<id>http://toohna.ourproject.org/RPG/index.php?action=history&amp;feed=atom&amp;title=AlMeta%3AYTDNM_reinforcement_learning</id>
		<title>AlMeta:YTDNM reinforcement learning - Revision history</title>
		<link rel="self" type="application/atom+xml" href="http://toohna.ourproject.org/RPG/index.php?action=history&amp;feed=atom&amp;title=AlMeta%3AYTDNM_reinforcement_learning"/>
		<link rel="alternate" type="text/html" href="http://toohna.ourproject.org/RPG/index.php?title=AlMeta:YTDNM_reinforcement_learning&amp;action=history"/>
		<updated>2026-05-04T23:23:51Z</updated>
		<subtitle>Revision history for this page on the wiki</subtitle>
		<generator>MediaWiki 1.30.2</generator>

	<entry>
		<id>http://toohna.ourproject.org/RPG/index.php?title=AlMeta:YTDNM_reinforcement_learning&amp;diff=1781&amp;oldid=prev</id>
		<title>Cmdrtako: Created page with &quot;Your the Dog Now Ma(n|chine)  ---- #if (you got a reward) ## if reward is &gt; 0 ###Treat =true ###add top of @History to @Trick ##else (if reward &lt; 0) ###Treat = False ###clear...&quot;</title>
		<link rel="alternate" type="text/html" href="http://toohna.ourproject.org/RPG/index.php?title=AlMeta:YTDNM_reinforcement_learning&amp;diff=1781&amp;oldid=prev"/>
				<updated>2022-06-28T08:55:51Z</updated>
		
		<summary type="html">&lt;p&gt;Created page with &amp;quot;Your the Dog Now Ma(n|chine)  ---- #if (you got a reward) ## if reward is &amp;gt; 0 ###Treat =true ###add top of @History to @Trick ##else (if reward &amp;lt; 0) ###Treat = False ###clear...&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;Your the Dog Now Ma(n|chine)&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
#if (you got a reward)&lt;br /&gt;
## if reward is &amp;gt; 0&lt;br /&gt;
###Treat =true&lt;br /&gt;
###add top of @History to @Trick&lt;br /&gt;
##else (if reward &amp;lt; 0)&lt;br /&gt;
###Treat = False&lt;br /&gt;
###clear @Trick&lt;br /&gt;
##for X in @History&lt;br /&gt;
###update X in NN&lt;br /&gt;
###if (X was Conditioned) &lt;br /&gt;
####Reward = Reward minus( 1 minus likelihood of choosing X)&lt;br /&gt;
###if(X was not Conditioned)&lt;br /&gt;
####Reward = Reward times 0.5&lt;br /&gt;
##clear @History&lt;br /&gt;
##if (you are not sure what to do)&lt;br /&gt;
###if (Treat)&lt;br /&gt;
####if (second of @Trick not Conditioned)&lt;br /&gt;
#####add @Trick to @Repertoire&lt;br /&gt;
###add new choice to @History as not Conditioned&lt;br /&gt;
##else (if you are sure what to do)&lt;br /&gt;
###add new choice to @History as Conditioned&lt;br /&gt;
#else (if you didn't get a reward)&lt;br /&gt;
##if (your in not vary sure of what do)&lt;br /&gt;
###if (you last action was Conditioned)&lt;br /&gt;
####reward = -1&lt;br /&gt;
####while(top of @History was Conditioned)&lt;br /&gt;
#####pop top of @History as X&lt;br /&gt;
#####Update X in NN&lt;br /&gt;
#####reward = reward times likelihood of X&lt;br /&gt;
####Clear @History&lt;br /&gt;
###Clear @Trick&lt;br /&gt;
###add new choice to @History as not Conditioned &lt;br /&gt;
##else (if you are sure of what to do)&lt;br /&gt;
###if you last action was Conditioned)&lt;br /&gt;
####add top of @History to @Trick &lt;br /&gt;
###add new choice to  @History as Conditioned.&lt;/div&gt;</summary>
		<author><name>Cmdrtako</name></author>	</entry>

	</feed>