Summary of Unified Text-to-image Generation and Retrieval, by Leigang Qu et al.
Unified Text-to-Image Generation and Retrievalby Leigang Qu, Haochuan Li, Tan Wang, Wenjie Wang, Yongqi Li,…
Unified Text-to-Image Generation and Retrievalby Leigang Qu, Haochuan Li, Tan Wang, Wenjie Wang, Yongqi Li,…
Lifelong Learning of Video Diffusion Models From a Single Video Streamby Jason Yoo, Yingchen He,…
Multifidelity digital twin for real-time monitoring of structural dynamics in aquaculture net cagesby Eirini Katsidoniotaki,…
Simplified and Generalized Masked Diffusion for Discrete Databy Jiaxin Shi, Kehang Han, Zhe Wang, Arnaud…
Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Databy Jingyang Ou, Shen…
What Should Embeddings Embed? Autoregressive Models Represent Latent Generating Distributionsby Liyi Zhang, Michael Y. Li,…
Block Transformer: Global-to-Local Language Modeling for Fast Inferenceby Namgyu Ho, Sangmin Bae, Taehyeon Kim, Hyunjik…
Pretrained Mobility Transformer: A Foundation Model for Human Mobilityby Xinhua Wu, Haoyu He, Yanchao Wang,…
Dual Process Learning: Controlling Use of In-Context vs. In-Weights Strategies with Weight Forgettingby Suraj Anand, Michael…
Arbitrary-Length Generalization for Addition in a Tiny Transformerby Alexandre Galvao PatriotaFirst submitted to arxiv on:…