Summary of Exploring the Benefit Of Activation Sparsity in Pre-training, by Zhengyan Zhang et al.
Exploring the Benefit of Activation Sparsity in Pre-trainingby Zhengyan Zhang, Chaojun Xiao, Qiujieli Qin, Yankai…
Exploring the Benefit of Activation Sparsity in Pre-trainingby Zhengyan Zhang, Chaojun Xiao, Qiujieli Qin, Yankai…
Variational Bayes Gaussian Splattingby Toon Van de Maele, Ozan Catal, Alexander Tschantz, Christopher L. Buckley,…
AVG-LLaVA: A Large Multimodal Model with Adaptive Visual Granularityby Zhibin Lan, Liqiang Niu, Fandong Meng,…
Can Language Models Take A Hint? Prompting for Controllable Contextualized Commonsense Inferenceby Pedro Colon-Hernandez, Nanxi…
Planning in Strawberry Fields: Evaluating and Improving the Planning and Scheduling Capabilities of LRM o1by…
KnobGen: Controlling the Sophistication of Artwork in Sketch-Based Diffusion Modelsby Pouyan Navard, Amin Karimi Monsefi,…
Efficient Length-Generalizable Attention via Causal Retrieval for Long-Context Language Modelingby Xiang Hu, Zhihao Teng, Jun…
Auto-Demo Prompting: Leveraging Generated Outputs as Demonstrations for Enhanced Batch Promptingby Longyu Feng, Mengze Hong,…
GERA: Geometric Embedding for Efficient Point Registration Analysisby Geng Li, Haozhi Cao, Mingyang Liu, Shenghai…
Inferring Preferences from Demonstrations in Multi-objective Reinforcement Learningby Junlin Lu, Patrick Mannion, Karl MasonFirst submitted…