LeatherSagiKnowledgebase

Tag: reinforcement-learning

8 items with this tag.

  • Apr 13, 2026

    Bounded eligibility traces enable off-policy TD with much larger lambda values than prior schemes

    • reinforcement-learning
    • temporal-difference-learning
    • off-policy
    • eligibility-traces
    • variance-reduction
  • Apr 13, 2026

    Transition-based discounting unifies episodic and continuing RL tasks under a single Bellman contraction

    • reinforcement-learning
    • bellman-operator
    • generalized-discounting
    • options
  • Apr 13, 2026

    Generalized Bellman Equation

    • reinforcement-learning
    • dynamic-programming
    • temporal-difference-learning
    • off-policy
  • Apr 13, 2026

    Transition-Based Discounting

    • reinforcement-learning
    • bellman-operator
    • generalized-discounting
    • options
  • Apr 13, 2026

    Unifying Task Specification in Reinforcement Learning

    • reinforcement-learning
    • bellman-operator
    • generalized-discounting
    • options
  • Apr 13, 2026

    On Generalized Bellman Equations and Temporal-Difference Learning

    • reinforcement-learning
    • temporal-difference-learning
    • off-policy
    • eligibility-traces
    • generalized-bellman
  • Apr 13, 2026

    martha-white

    • reinforcement-learning
    • temporal-difference-learning
    • off-policy-learning
    • general-value-functions
  • Apr 13, 2026

    richard-sutton

    • reinforcement-learning
    • temporal-difference-learning
    • off-policy
    • dynamic-programming

Created with Quartz v4.5.2 © 2026

  • GitHub
  • Discord Community