Summary of A Declarative System For Optimizing Ai Workloads, by Chunwei Liu et al.
A Declarative System for Optimizing AI Workloadsby Chunwei Liu, Matthew Russo, Michael Cafarella, Lei Cao,…
A Declarative System for Optimizing AI Workloadsby Chunwei Liu, Matthew Russo, Michael Cafarella, Lei Cao,…
StyleX: A Trainable Metric for X-ray Style Distancesby Dominik Eckert, Christopher Syben, Christian Hümmer, Ludwig…
PipeFusion: Patch-level Pipeline Parallelism for Diffusion Transformers Inferenceby Jiarui Fang, Jinzhe Pan, Jiannan Wang, Aoyu…
Exploring the use of a Large Language Model for data extraction in systematic reviews: a…
Two Heads are Better Than One: Neural Networks Quantization with 2D Hilbert Curve-based Output Representationby…
Real Time Deep Learning Weapon Detection Techniques for Mitigating Lone Wolf Attacksby Kambhatla Akhila, Khaled…
Enhanced Object Tracking by Self-Supervised Auxiliary Depth Estimation Learningby Zhenyu Wei, Yujie He, Zhanchuan CaiFirst…
DEGAP: Dual Event-Guided Adaptive Prefixes for Templated-Based Event Argument Extraction with Slot Queryingby Guanghui Wang,…
MeteoRA: Multiple-tasks Embedded LoRA for Large Language Modelsby Jingwei Xu, Junyu Lai, Yunpeng HuangFirst submitted…
A Comprehensive Survey of Accelerated Generation Techniques in Large Language Modelsby Mahsa Khoshnoodi, Vinija Jain,…