LeatherSagiKnowledgebase

Tag: off-policy

4 items with this tag.

Apr 13, 2026
Bounded eligibility traces enable off-policy TD with much larger lambda values than prior schemes
Apr 13, 2026
Generalized Bellman Equation
Apr 13, 2026
On Generalized Bellman Equations and Temporal-Difference Learning
Apr 13, 2026
richard-sutton

Created with Quartz v4.5.2 © 2026

GitHub
Discord Community