Summary of Hart: Efficient Visual Generation with Hybrid Autoregressive Transformer, by Haotian Tang et al.
HART: Efficient Visual Generation with Hybrid Autoregressive Transformerby Haotian Tang, Yecheng Wu, Shang Yang, Enze…
HART: Efficient Visual Generation with Hybrid Autoregressive Transformerby Haotian Tang, Yecheng Wu, Shang Yang, Enze…
LVD-2M: A Long-take Video Dataset with Temporally Dense Captionsby Tianwei Xiong, Yuqing Wang, Daquan Zhou,…
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Freeby Ziyue Li, Tianyi ZhouFirst submitted…
Burning RED: Unlocking Subtask-Driven Reinforcement Learning and Risk-Awareness in Average-Reward Markov Decision Processesby Juan Sebastian…
Regularized Robustly Reliable Learners and Instance Targeted Attacksby Avrim Blum, Donya SalessFirst submitted to arxiv…
STACKFEED: Structured Textual Actor-Critic Knowledge Base Editing with FeedBackby Naman Gupta, Shashank Kirtania, Priyanshu Gupta,…
TopoFR: A Closer Look at Topology Alignment on Face Recognitionby Jun Dan, Yang Liu, Jiankang…
TRESTLE: A Model of Concept Formation in Structured Domainsby Christopher J. MacLellan, Erik Harpstead, Vincent…
Neural networks that overcome classic challenges through practiceby Kazuki Irie, Brenden M. LakeFirst submitted to…
Multi-modal Vision Pre-training for Medical Image Analysisby Shaohao Rui, Lingzhi Chen, Zhenyu Tang, Lilong Wang,…