Summary of Discrete Multimodal Transformers with a Pretrained Large Language Model For Mixed-supervision Speech Processing, by Viet Anh Trinh et al.
Discrete Multimodal Transformers with a Pretrained Large Language Model for Mixed-Supervision Speech Processingby Viet Anh…