Abstract
We study task sequences that allow for speeding up the learners average reward
intake through appropriate shifts of inductive bias
changes of the learner's policy. To evaluate
long-term effects of bias shifts setting the stage for later bias shifts we use the "success-story
algorithm" (SSA).SSA is occasionally called at times that may depend
... read more