Summary of Beyond Efficiency: Scaling Ai Sustainably, by Carole-jean Wu et al.
Beyond Efficiency: Scaling AI Sustainablyby Carole-Jean Wu, Bilge Acun, Ramya Raghavendra, Kim HazelwoodFirst submitted to…
Beyond Efficiency: Scaling AI Sustainablyby Carole-Jean Wu, Bilge Acun, Ramya Raghavendra, Kim HazelwoodFirst submitted to…
LoCoCo: Dropping In Convolutions for Long Context Compressionby Ruisi Cai, Yuandong Tian, Zhangyang Wang, Beidi…
CMamba: Channel Correlation Enhanced State Space Models for Multivariate Time Series Forecastingby Chaolv Zeng, Zhanyu…
Hints-In-Browser: Benchmarking Language Models for Programming Feedback Generationby Nachiket Kotalwar, Alkis Gotovos, Adish SinglaFirst submitted…
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learningby Subhojyoti Mukherjee, Josiah…
Progressive Entropic Optimal Transport Solversby Parnian Kassraie, Aram-Alexandre Pooladian, Michal Klein, James Thornton, Jonathan Niles-Weed,…
Massively Multiagent Minigames for Training Generalist Agentsby Kyoung Whan Choe, Ryan Sullivan, Joseph SuárezFirst submitted…
Linearization Turns Neural Operators into Function-Valued Gaussian Processesby Emilia Magnani, Marvin Pförtner, Tobias Weber, Philipp…
Optimizing Time Series Forecasting Architectures: A Hierarchical Neural Architecture Search Approachby Difan Deng, Marius LindauerFirst…
SUMIE: A Synthetic Benchmark for Incremental Entity Summarizationby Eunjeong Hwang, Yichao Zhou, Beliz Gunel, James…