Comment by LPisGood

Comment by LPisGood 6 days ago

0 replies

A standard formulation of MAB problem assumes that acting will impact the rewards, and this forgetting factor approach is one which allows for that and still attempts to find the currently most exploitable lever.