Summary of Increasing Transformer Token Length with a Maximum Entropy Principle Method, by R. I. Cukier
Increasing transformer token length with a Maximum Entropy Principle Methodby R. I. CukierFirst submitted to…
Increasing transformer token length with a Maximum Entropy Principle Methodby R. I. CukierFirst submitted to…
AdapMoE: Adaptive Sensitivity-based Expert Gating and Management for Efficient MoE Inferenceby Shuzhang Zhong, Ling Liang,…
BatGPT-Chem: A Foundation Large Model For Retrosynthesis Predictionby Yifei Yang, Runhan Shi, Zuchao Li, Shu…
Preference-Optimized Pareto Set Learning for Blackbox Optimizationby Zhang Haishan, Diptesh Das, Koji TsudaFirst submitted to…
Uniting contrastive and generative learning for event sequences modelsby Aleksandr Yugay, Alexey ZaytsevFirst submitted to…
The Fairness-Quality Trade-off in Clusteringby Rashida Hakim, Ana-Andreea Stoica, Christos H. Papadimitriou, Mihalis YannakakisFirst submitted…
Unlocking the Power of LSTM for Long Term Time Series Forecastingby Yaxuan Kong, Zepu Wang,…
PinnDE: Physics-Informed Neural Networks for Solving Differential Equationsby Jason Matthews, Alex BihloFirst submitted to arxiv…
Exploiting Fine-Grained Prototype Distribution for Boosting Unsupervised Class Incremental Learningby Jiaming Liu, Hongyuan Liu, Zhili…
Facial Wrinkle Segmentation for Cosmetic Dermatology: Pretraining with Texture Map-Based Weak Supervisionby Junho Moon, Haejun…