Summary of Geckopt: Llm System Efficiency Via Intent-based Tool Selection, by Michael Fore et al.
GeckOpt: LLM System Efficiency via Intent-Based Tool Selectionby Michael Fore, Simranjit Singh, Dimitrios StamoulisFirst submitted…
GeckOpt: LLM System Efficiency via Intent-Based Tool Selectionby Michael Fore, Simranjit Singh, Dimitrios StamoulisFirst submitted…
Mamba3D: Enhancing Local Features for 3D Point Cloud Analysis via State Space Modelby Xu Han,…
3D-Convolution Guided Spectral-Spatial Transformer for Hyperspectral Image Classificationby Shyam Varahagiri, Aryaman Sinha, Shiv Ram Dubey,…
Empowering Interdisciplinary Research with BERT-Based Models: An Approach Through SciBERT-CNN with Topic Modelingby Darya Likhareva,…
When LLMs are Unfit Use FastFit: Fast and Effective Text Classification with Many Classesby Asaf…
Hierarchical Context Merging: Better Long Context Understanding for Pre-trained LLMsby Woomin Song, Seunghyuk Oh, Sangwoo…
Language Model Cascades: Token-level uncertainty and beyondby Neha Gupta, Harikrishna Narasimhan, Wittawat Jitkrittum, Ankit Singh…
Exploring and Improving Drafts in Blockwise Parallel Decodingby Taehyeon Kim, Ananda Theertha Suresh, Kishore Papineni,…
On Speculative Decoding for Multimodal Large Language Modelsby Mukul Gagrani, Raghavv Goel, Wonseok Jeon, Junyoung…
CATS: Contextually-Aware Thresholding for Sparsity in Large Language Modelsby Donghyun Lee, Je-Yong Lee, Genghan Zhang,…