Tag: bellman

Fitted Q Evaluation Without Bellman Completeness via Stationary Weighting

Fitted Q Evaluation Without Bellman Completeness via Stationary Weighting arXiv:2512.23805v1 Announce Type: new Abstract: Fitted Q-evaluation (FQE) is a central method for off-policy evaluation in reinforcement learning, but it generally requires Bellman completeness: that the hypothesis class is closed under the evaluation Bellman operator. This requirement is challenging because enlarging the hypothesis class can worsen…

January 1, 2026

Fitted Q Evaluation Without Bellman Completeness via Stationary Weighting