Tag: soft
-
Stationary Reweighting Yields Local Convergence of Soft Fitted Q-Iteration
Stationary Reweighting Yields Local Convergence of Soft Fitted Q-Iteration arXiv:2512.23927v1 Announce Type: new Abstract: Fitted Q-iteration (FQI) and its entropy-regularized variant, soft FQI, are central tools for value-based model-free offline reinforcement learning, but can behave poorly under function approximation and distribution shift. In the entropy-regularized setting, we show that the soft Bellman operator is locally…
-
Towards Interpretable Soft Prompts
Towards Interpretable Soft Prompts arXiv:2504.02144v1 Announce Type: cross Abstract: Soft prompts have been popularized as a cheap and easy way to improve task-specific LLM performance beyond few-shot prompts. Despite their origin as an automated prompting method, however, soft prompts and other trainable prompts remain a black-box method with no immediately interpretable connections to prompting. We…