Reinforcement Learning Theory

Trending

3papers

2.3viability

+100%30d

Papers

1–3 of 3

Research Paper·Mar 22, 2026

The Myhill-Nerode Theorem for Bounded Interaction: Canonical Abstractions via Agent-Bounded Indistinguishability

Any capacity-limited observer induces a canonical quotient on its environment: two situations that no bounded agent can distinguish are, for that agent, the same. We formalise this for finite POMDPs. ...

3.0 viabilityHas code

Research Paper·Mar 25, 2026

Optimal Variance-Dependent Regret Bounds for Infinite-Horizon MDPs

Online reinforcement learning in infinite-horizon Markov decision processes (MDPs) remains less theoretically and algorithmically developed than its episodic counterpart, with many algorithms sufferin...

2.0 viability

Research Paper·Feb 27, 2026

Beyond State-Wise Mirror Descent: Offline Policy Optimization with Parameteric Policies

We investigate the theoretical aspects of offline reinforcement learning (RL) under general function approximation. While prior works (e.g., Xie et al., 2021) have established the theoretical foundati...