Summary of Transformer Tricks: Removing Weights For Skipless Transformers, by Nils Graef
Transformer tricks: Removing weights for skipless transformersby Nils GraefFirst submitted to arxiv on: 18 Apr…
Transformer tricks: Removing weights for skipless transformersby Nils GraefFirst submitted to arxiv on: 18 Apr…
Sketch-guided Image Inpainting with Partial Discrete Diffusion Processby Nakul Sharma, Aditay Tripathi, Anirban Chakraborty, Anand…
QGen: On the Ability to Generalize in Quantization Aware Trainingby MohammadHossein AskariHemmat, Ahmadreza Jeddi, Reyhane…
GenFighter: A Generative and Evolutive Textual Attack Removalby Md Athikul Islam, Edoardo Serra, Sushil JajodiaFirst…
LongVQ: Long Sequence Modeling with Vector Quantization on Structured Memoryby Zicheng Liu, Li Wang, Siyuan…
Function Approximation for Reinforcement Learning Controller for Energy from Spread Wavesby Soumyendu Sarkar, Vineet Gundecha,…
BayesJudge: Bayesian Kernel Language Modelling with Confidence Uncertainty in Legal Judgment Predictionby Ubaid Azam, Imran…
AGHINT: Attribute-Guided Representation Learning on Heterogeneous Information Networks with Transformerby Jinhui Yuan, Shan Lu, Peibo…
SparseDM: Toward Sparse Efficient Diffusion Modelsby Kafeng Wang, Jianfei Chen, He Li, Zhenpeng Mi, Jun…
Advancing Long-Term Multi-Energy Load Forecasting with Patchformer: A Patch and Transformer-Based Approachby Qiuyi Hong, Fanlin…