Summary of Ivideogpt: Interactive Videogpts Are Scalable World Models, by Jialong Wu et al.
iVideoGPT: Interactive VideoGPTs are Scalable World Modelsby Jialong Wu, Shaofeng Yin, Ningya Feng, Xu He,…
iVideoGPT: Interactive VideoGPTs are Scalable World Modelsby Jialong Wu, Shaofeng Yin, Ningya Feng, Xu He,…
Defining error accumulation in ML atmospheric simulatorsby Raghul Parthipan, Mohit Anand, Hannah M. Christensen, J.…
MiniCache: KV Cache Compression in Depth Dimension for Large Language Modelsby Akide Liu, Jing Liu,…
Meanings and Feelings of Large Language Models: Observability of Latent States in Generative AIby Tian…
Reducing Transformer Key-Value Cache Size with Cross-Layer Attentionby William Brandon, Mayank Mishra, Aniruddha Nrusimha, Rameswar…
Energy Rank Alignment: Using Preference Optimization to Search Chemical Space at Scaleby Shriram Chennakesavalu, Frank…
MambaOut: Do We Really Need Mamba for Vision?by Weihao Yu, Xinchao WangFirst submitted to arxiv…
Multi-Scale Dilated Convolution Network for Long-Term Time Series Forecastingby Feifei Li, Suhan Guo, Feng Han,…
Lory: Fully Differentiable Mixture-of-Experts for Autoregressive Language Model Pre-trainingby Zexuan Zhong, Mengzhou Xia, Danqi Chen,…
Sample-efficient neural likelihood-free Bayesian inference of implicit HMMsby Sanmitra Ghosh, Paul J. Birrell, Daniela De…