Summary of On the Token Distance Modeling Ability Of Higher Rope Attention Dimension, by Xiangyu Hong et al.
On the token distance modeling ability of higher RoPE attention dimensionby Xiangyu Hong, Che Jiang,…
On the token distance modeling ability of higher RoPE attention dimensionby Xiangyu Hong, Che Jiang,…
MKGL: Mastery of a Three-Word Languageby Lingbing Guo, Zhongpu Bo, Zhuo Chen, Yichi Zhang, Jiaoyan…
Diversified and Adaptive Negative Sampling on Knowledge Graphsby Ran Liu, Zhongzhou Liu, Xiaoli Li, Hao…
VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasksby Ziyan Jiang, Rui Meng, Xinyi Yang,…
Correlation-Aware Select and Merge Attention for Efficient Fine-Tuning and Context Length Extensionby Ning Wang, Zekun…
Constructing Cloze Questions Generativelyby Yicheng Sun, Jie WangFirst submitted to arxiv on: 5 Oct 2024CategoriesMain:…
A Pluggable Common Sense-Enhanced Framework for Knowledge Graph Completionby Guanglin Niu, Bo Li, Siling FengFirst…
Intrinsic Evaluation of RAG Systems for Deep-Logic Questionsby Junyi Hu, You Zhou, Jie WangFirst submitted…
PixelBytes: Catching Unified Representation for Multimodal Generationby Fabien FurfaroFirst submitted to arxiv on: 16 Sep…
Emotion-Aware Embedding Fusion in LLMs (Flan-T5, LLAMA 2, DeepSeek-R1, and ChatGPT 4) for Intelligent Response…