Summary of R-judge: Benchmarking Safety Risk Awareness For Llm Agents, by Tongxin Yuan et al.
R-Judge: Benchmarking Safety Risk Awareness for LLM Agentsby Tongxin Yuan, Zhiwei He, Lingzhong Dong, Yiming…
R-Judge: Benchmarking Safety Risk Awareness for LLM Agentsby Tongxin Yuan, Zhiwei He, Lingzhong Dong, Yiming…
xCoT: Cross-lingual Instruction Tuning for Cross-lingual Chain-of-Thought Reasoningby Linzheng Chai, Jian Yang, Tao Sun, Hongcheng…
An EcoSage Assistant: Towards Building A Multimodal Plant Care Dialogue Assistantby Mohit Tomar, Abhisek Tiwari,…
PokerGPT: An End-to-End Lightweight Solver for Multi-Player Texas Hold’em via Large Language Modelby Chenghao Huang,…
Universal Vulnerabilities in Large Language Models: Backdoor Attacks for In-context Learningby Shuai Zhao, Meihuizi Jia,…
Chain of History: Learning and Forecasting with LLMs for Temporal Knowledge Graph Completionby Ruilin Luo,…
POMP: Probability-driven Meta-graph Prompter for LLMs in Low-resource Unsupervised Neural Machine Translationby Shilong Pan, Zhiliang…
Designing Heterogeneous LLM Agents for Financial Sentiment Analysisby Frank XingFirst submitted to arxiv on: 11…
Tuning LLMs with Contrastive Alignment Instructions for Machine Translation in Unseen, Low-resource Languagesby Zhuoyuan Mao,…
Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talkby Dennis Ulmer, Elman Mansimov, Kaixiang Lin, Justin Sun,…