Skip to content

Implement LM-DSEE and SW-UCB# algorithms for non-stationary bandits from paper arXiv:1802.08380 #151

@Naereen

Description

@Naereen

For non-stationary bandits, the following reference introduce the awesome LM-DSEE and SW-UCB# algorithms: ["On Abruptly-Changing and Slowly-Varying Multiarmed Bandit Problems", by Lai Wei, Vaibhav Srivastava, 2018, arXiv:1802.08380].

I need to:

  • Write the LM-DSEE algorithm!
  • Write the SW-UCB# algorithm!
  • Implement it in a very generic way, as children of BaseWrapperPolicy, so any index policy can directly be used and not just UCB!
  • Test it on simple problems, and check what the authors claim in their paper,
  • Compare them with DiscountedUCB, SlindingWindow-UCB or SW-klUCB or SWRestart-UCB.

Metadata

Metadata

Assignees

Labels

new algoI have to implement a new algorithm! Yay!non-stationaryFor non-stationary bandits simulationssingle-playerFor single-player bandits simulations

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions