Explore 1 AI terms in Multi-Armed Bandit
A linear bandit is a type of reinforcement learning problem where actions yield rewards based on a linear relationship with features.