JMLR<p>'The ODE Method for Stochastic Approximation and Reinforcement Learning with Markovian Noise', by Shuze Daniel Liu, Shuhang Chen, Shangtong Zhang.</p><p><a href="http://jmlr.org/papers/v26/24-0100.html" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">http://</span><span class="ellipsis">jmlr.org/papers/v26/24-0100.ht</span><span class="invisible">ml</span></a> <br> <br><a href="https://sigmoid.social/tags/stochastic" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>stochastic</span></a> <a href="https://sigmoid.social/tags/stochastically" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>stochastically</span></a> <a href="https://sigmoid.social/tags/martingale" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>martingale</span></a></p>