Summary of On the Token Distance Modeling Ability Of Higher Rope Attention Dimension, by Xiangyu Hong et al.
On the token distance modeling ability of higher RoPE attention dimensionby Xiangyu Hong, Che Jiang,…
On the token distance modeling ability of higher RoPE attention dimensionby Xiangyu Hong, Che Jiang,…
Diversified and Adaptive Negative Sampling on Knowledge Graphsby Ran Liu, Zhongzhou Liu, Xiaoli Li, Hao…
MKGL: Mastery of a Three-Word Languageby Lingbing Guo, Zhongpu Bo, Zhuo Chen, Yichi Zhang, Jiaoyan…
VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasksby Ziyan Jiang, Rui Meng, Xinyi Yang,…
A Pluggable Common Sense-Enhanced Framework for Knowledge Graph Completionby Guanglin Niu, Bo Li, Siling FengFirst…
Correlation-Aware Select and Merge Attention for Efficient Fine-Tuning and Context Length Extensionby Ning Wang, Zekun…
Constructing Cloze Questions Generativelyby Yicheng Sun, Jie WangFirst submitted to arxiv on: 5 Oct 2024CategoriesMain:…
Intrinsic Evaluation of RAG Systems for Deep-Logic Questionsby Junyi Hu, You Zhou, Jie WangFirst submitted…
PixelBytes: Catching Unified Representation for Multimodal Generationby Fabien FurfaroFirst submitted to arxiv on: 16 Sep…
Emotion-Aware Embedding Fusion in LLMs (Flan-T5, LLAMA 2, DeepSeek-R1, and ChatGPT 4) for Intelligent Response…