NPTEL Video Course : NOC:Stochastic Approximation: Theory and Applications


Lecture 43 - Best Policy Algorithm for Q-Value Functions: A Stochastic Approximation Formulation


            


DIGIMAT Learning Management Platform