搜索结果: 1-1 共查到“理论统计学 Contextual Bandits”相关记录1条 . 查询时间(0.14 秒)
Efficient Optimal Learning for Contextual Bandits
Efficient Optimal Learning Contextual Bandits
2011/7/6
We address the problem of learning in an online setting where the learner repeatedly observes features, selects among a set of actions, and receives reward for the action taken.