Summary of Rethinking Legal Judgement Prediction in a Realistic Scenario in the Era Of Large Language Models, by Shubham Kumar Nigam et al.
Rethinking Legal Judgement Prediction in a Realistic Scenario in the Era of Large Language Modelsby…
Rethinking Legal Judgement Prediction in a Realistic Scenario in the Era of Large Language Modelsby…
FormalAlign: Automated Alignment Evaluation for Autoformalizationby Jianqiao Lu, Yingjia Wan, Yinya Huang, Jing Xiong, Zhengying…
Can In-context Learning Really Generalize to Out-of-distribution Tasks?by Qixun Wang, Yifei Wang, Yisen Wang, Xianghua…
Diagnosing Robotics Systems Issues with Large Language Modelsby Jordis Emilia Herrmann, Aswath Mandakath Gopinath, Mikael…
Synth-SONAR: Sonar Image Synthesis with Enhanced Diversity and Realism via Dual Diffusion Models and GPT…
Benchmarking Agentic Workflow Generationby Shuofei Qiao, Runnan Fang, Zhisong Qiu, Xiaobin Wang, Ningyu Zhang, Yong…
PLaMo-100B: A Ground-Up Language Model Designed for Japanese Proficiencyby Preferred Elements, Kenshin Abe, Kaizaburo Chubachi,…
SAGE: Scalable Ground Truth Evaluations for Large Sparse Autoencodersby Constantin Venhoff, Anisoara Calinescu, Philip Torr,…
ETA: Evaluating Then Aligning Safety of Vision Language Models at Inference Timeby Yi Ding, Bolian…
LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for Enhanced Following of Instructions with Multiple…