Summary of Effective Off-policy Evaluation and Learning in Contextual Combinatorial Bandits, by Tatsuhiro Shimizu et al.
Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Banditsby Tatsuhiro Shimizu, Koichi Tanaka, Ren Kishimoto,…