Summary of Tokenunify: Scalable Autoregressive Visual Pre-training with Mixture Token Prediction, by Yinda Chen et al.
TokenUnify: Scalable Autoregressive Visual Pre-training with Mixture Token Predictionby Yinda Chen, Haoyuan Shi, Xiaoyu Liu,…
TokenUnify: Scalable Autoregressive Visual Pre-training with Mixture Token Predictionby Yinda Chen, Haoyuan Shi, Xiaoyu Liu,…
Exploring the LLM Journey from Cognition to Expression with Linear Representationsby Yuzi Yan, Jialian Li,…
Position: Foundation Agents as the Paradigm Shift for Decision Makingby Xiaoqian Liu, Xingzhou Lou, Jianbin…
Towards Cross-modal Backward-compatible Representation Learning for Vision-Language Modelsby Young Kyun Jang, Ser-nam LimFirst submitted to…
360Zhinao Technical Reportby 360Zhinao TeamFirst submitted to arxiv on: 22 May 2024CategoriesMain: Computation and Language…
A survey on fairness of large language models in e-commerce: progress, application, and challengeby Qingyang…
Searching Realistic-Looking Adversarial Objects For Autonomous Driving Systemsby Shengxiang Sun, Shenzhe ZhuFirst submitted to arxiv…
IGOT: Information Gain Optimized Tokenizer on Domain Adaptive Pretrainingby Dawei Feng, Yihai Zhang, Zhixuan XuFirst…
Self-supervised vision-langage alignment of deep learning representations for bone X-rays analysisby Alexandre Englebert, Anne-Sophie Collin,…
LlamaTurk: Adapting Open-Source Generative Large Language Models for Low-Resource Languageby Cagri ToramanFirst submitted to arxiv…