Summary of Efficiently Dispatching Flash Attention For Partially Filled Attention Masks, by Agniv Sharma and Jonas Geiping
Efficiently Dispatching Flash Attention For Partially Filled Attention Masksby Agniv Sharma, Jonas GeipingFirst submitted to…
Efficiently Dispatching Flash Attention For Partially Filled Attention Masksby Agniv Sharma, Jonas GeipingFirst submitted to…
DiffFluid: Plain Diffusion Models are Effective Predictors of Flow Dynamicsby Dongyu Luo, Jianyu Wu, Jing…
Machine Translation with Large Language Models: Decoder Only vs. Encoder-Decoderby Abhinav P.M., SujayKumar Reddy M, Oswald…
Enhancing E-commerce Product Title Translation with Retrieval-Augmented Generation and Large Language Modelsby Bryan Zhang, Taichi…
Latent Diffusion Models for Controllable RNA Sequence Generationby Kaixuan Huang, Yukang Yang, Kaidi Fu, Yanyi…
ProcessTBench: An LLM Plan Generation Dataset for Process Miningby Andrei Cosmin Redis, Mohammadreza Fani Sani,…
Current Symmetry Group Equivariant Convolution Frameworks for Representation Learningby Ramzan Basheer, Deepak MishraFirst submitted to…
Chain-of-Translation Prompting (CoTR): A Novel Prompting Technique for Low Resource Languagesby Tejas Deshpande, Nidhi Kowtal,…
A Data Selection Approach for Enhancing Low Resource Machine Translation Using Cross-Lingual Sentence Representationsby Nidhi…
Relative-Translation Invariant Wasserstein Distanceby Binshuai Wang, Qiwei Di, Ming Yin, Mengdi Wang, Quanquan Gu, Peng…