Summary of Pretraining Decision Transformers with Reward Prediction For In-context Multi-task Structured Bandit Learning, by Subhojyoti Mukherjee et al.
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learningby Subhojyoti Mukherjee, Josiah…