Tag: centric

Preference-centric Bandits: Optimality of Mixtures and Regret-efficient Algorithms

Preference-centric Bandits: Optimality of Mixtures and Regret-efficient Algorithms arXiv:2504.20877v1 Announce Type: new Abstract: The objective of canonical multi-armed bandits is to identify and repeatedly select an arm with the largest reward, often in the form of the expected value of the arm’s probability distribution. Such a utilitarian perspective and focus on the probability models’ first…

April 30, 2025

Preference-centric Bandits: Optimality of Mixtures and Regret-efficient Algorithms