LeatherSagiKnowledgebase

Tag: off-policy

4 items with this tag.

  • Apr 13, 2026

    Bounded eligibility traces enable off-policy TD with much larger lambda values than prior schemes

    • reinforcement-learning
    • temporal-difference-learning
    • off-policy
    • eligibility-traces
    • variance-reduction
  • Apr 13, 2026

    Generalized Bellman Equation

    • reinforcement-learning
    • dynamic-programming
    • temporal-difference-learning
    • off-policy
  • Apr 13, 2026

    On Generalized Bellman Equations and Temporal-Difference Learning

    • reinforcement-learning
    • temporal-difference-learning
    • off-policy
    • eligibility-traces
    • generalized-bellman
  • Apr 13, 2026

    richard-sutton

    • reinforcement-learning
    • temporal-difference-learning
    • off-policy
    • dynamic-programming

Created with Quartz v4.5.2 © 2026

  • GitHub
  • Discord Community