Resumen de A stability criterion for two timescale stochastic approximation schemes - Dialnet

Ir al contenido

Ayuda

Resumen de A stability criterion for two timescale stochastic approximation schemes

Chandrashekar Lakshminarayanan, Shalabh Bhatnagar

Abstract We present the first sufficient conditions that guarantee stability of two-timescale stochastic approximation schemes. Our analysis is based on the ordinary differential equation (ODE) method and is an extension of the results in Borkar and Meyn (2000) for single-timescale schemes. As an application of our result, we show the stability of iterates in a two-timescale stochastic approximation scheme arising in reinforcement learning.

Fundación Dialnet

Acceso de usuarios registrados

Imagen de identificación

¿Olvidó su contraseña?

¿Es nuevo? Regístrese

Ventajas de registrarse

Dialnet Plus

© 2001-2025 Fundación Dialnet · Todos los derechos reservados

Coordinado por: