Summary of Vptq: Extreme Low-bit Vector Post-training Quantization For Large Language Models, by Yifei Liu et al.
VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Modelsby Yifei Liu, Jicheng Wen, Yang…
VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Modelsby Yifei Liu, Jicheng Wen, Yang…
LLaMa-SciQ: An Educational Chatbot for Answering Science MCQby Marc-Antoine Allard, Matin Ansaripour, Maria Yuffa, Paul…
AXCEL: Automated eXplainable Consistency Evaluation using LLMsby P Aditya Sreekar, Sahil Verma, Suransh Chopra, Sarik…
Dynamic-Width Speculative Beam Decoding for Efficient LLM Inferenceby Zongyue Qin, Zifan He, Neha Prakriya, Jason…
Leveraging Unsupervised Learning for Cost-Effective Visual Anomaly Detectionby Yunbo Long, Zhengyang Ling, Sam Brook, Duncan…
Parse Trees Guided LLM Prompt Compressionby Wenhao Mao, Chengbin Hou, Tianyu Zhang, Xinyu Lin, Ke…
From Text to Treatment Effects: A Meta-Learning Approach to Handling Text-Based Confoundingby Henri Arno, Paloma…
FACET: Fast and Accurate Event-Based Eye Tracking Using Ellipse Modeling for Extended Realityby Junyuan Ding,…
Benchmarking Edge AI Platforms for High-Performance ML Inferenceby Rakshith Jayanth, Neelesh Gupta, Viktor PrasannaFirst submitted…
A-VL: Adaptive Attention for Large Vision-Language Modelsby Junyang Zhang, Mu Yuan, Ruiguang Zhong, Puhan Luo,…