Summary of Context-aware Assistant Selection For Improved Inference Acceleration with Large Language Models, by Jerry Huang et al.
Context-Aware Assistant Selection for Improved Inference Acceleration with Large Language Modelsby Jerry Huang, Prasanna Parthasarathi,…