Summary of A Multi-task Role-playing Agent Capable Of Imitating Character Linguistic Styles, by Siyuan Chen et al.
A Multi-Task Role-Playing Agent Capable of Imitating Character Linguistic Stylesby Siyuan Chen, Qingyi Si, Chenxu…
A Multi-Task Role-Playing Agent Capable of Imitating Character Linguistic Stylesby Siyuan Chen, Qingyi Si, Chenxu…
RS-MoE: A Vision-Language Model with Mixture of Experts for Remote Sensing Image Captioning and Visual…
Latent Paraphrasing: Perturbation on Layers Improves Knowledge Injection in Language Modelsby Minki Kang, Sung Ju…
Nearest Neighbor Normalization Improves Multimodal Retrievalby Neil Chowdhury, Franklin Wang, Sumedh Shenoy, Douwe Kiela, Sarah…
BUZZ: Beehive-structured Sparse KV Cache with Segmented Heavy Hitters for Efficient LLM Inferenceby Junqi Zhao,…
LEAF: Learning and Evaluation Augmented by Fact-Checking to Improve Factualness in Large Language Modelsby Hieu…
RealCQA-V2 : Visual Premise Proving A Manual COT Dataset for Chartsby Saleem Ahmed, Ranga Setlur,…
CT2C-QA: Multimodal Question Answering over Chinese Text, Table and Chartby Bowen Zhao, Tianhao Cheng, Yuejie…
Enhancing Financial Question Answering with a Multi-Agent Reflection Frameworkby Sorouralsadat Fatemi, Yuheng HuFirst submitted to…
AutoBench-V: Can Large Vision-Language Models Benchmark Themselves?by Han Bao, Yue Huang, Yanbo Wang, Jiayi Ye,…