Summary of Improving Multi-domain Task-oriented Dialogue System with Offline Reinforcement Learning, by Dharmendra Prajapat et al.
Improving Multi-Domain Task-Oriented Dialogue System with Offline Reinforcement Learningby Dharmendra Prajapat, Durga ToshniwalFirst submitted to…