Paper List

We recommend you use the search box as this list is very long.

Summary of Giebench: Towards Holistic Evaluation Of Group Identity-based Empathy For Large Language Models, by Leyan Wang et al.
Summary of Towards Retrieval Augmented Generation Over Large Video Libraries, by Yannis Tevissen et al.
Summary of Ceasefire: An Ai-powered System For Combatting Illicit Firearms Trafficking, by Jorgen Cani et al.
Summary of Artificial Leviathan: Exploring Social Evolution Of Llm Agents Through the Lens Of Hobbesian Social Contract Theory, by Gordon Dai et al.
Summary of Control When Confidence Is Costly, by Itzel Olivos-castillo et al.
Summary of Apeer: Automatic Prompt Engineering Enhances Large Language Model Reranking, by Can Jin et al.
Summary of Futurenet-lof: Joint Trajectory Prediction and Lane Occupancy Field Prediction with Future Context Encoding, by Mingkun Wang et al.
Summary of Rewarding What Matters: Step-by-step Reinforcement Learning For Task-oriented Dialogue, by Huifang Du et al.
Summary of Learning Telic-controllable State Representations, by Nadav Amir et al.
Summary of Safesora: Towards Safety Alignment Of Text2video Generation Via a Human Preference Dataset, by Josef Dai et al.
Summary of Proceedings Of the Second International Workshop on Explainable Ai For the Arts (xaixarts), by Nick Bryan-kinns et al.
Summary of Evidence Of a Log Scaling Law For Political Persuasion with Large Language Models, by Kobi Hackenburg et al.
Summary of V-lasik: Consistent Glasses-removal From Videos Using Synthetic Data, by Rotem Shalev-arkushin et al.
Summary of Solving a Stackelberg Game on Transportation Networks in a Dynamic Crime Scenario: a Mixed Approach on Multi-layer Networks, by Sukanya Samanta et al.
Summary of Graphreader: Building Graph-based Agent to Enhance Long-context Abilities Of Large Language Models, by Shilong Li et al.
Summary of Whiteboard-of-thought: Thinking Step-by-step Across Modalities, by Sachit Menon and Richard Zemel and Carl Vondrick
Summary of Sorry-bench: Systematically Evaluating Large Language Model Safety Refusal, by Tinghao Xie et al.
Summary of Holistic Evaluation For Interleaved Text-and-image Generation, by Minqian Liu et al.
Summary of Can Llms Learn by Teaching For Better Reasoning? a Preliminary Study, By Xuefei Ning et al.
Summary of Speech Prefix-tuning with Rnnt Loss For Improving Llm Predictions, by Murali Karthick Baskar et al.
Summary of Multiagent Collaboration Attack: Investigating Adversarial Attacks in Large Language Model Collaborations Via Debate, by Alfonso Amayuelas et al.
Summary of Do Llms Have Distinct and Consistent Personality? Trait: Personality Testset Designed For Llms with Psychometrics, by Seungbeen Lee et al.
Summary of Does Gpt Really Get It? a Hierarchical Scale to Quantify Human Vs Ai’s Understanding Of Algorithms, by Mirabel Reid et al.
Summary of Raising the Bar: Investigating the Values Of Large Language Models Via Generative Evolving Testing, by Han Jiang et al.
Summary of Citynav: Language-goal Aerial Navigation Dataset with Geographic Information, by Jungdae Lee et al.
Summary of The Impact Of Ai on Perceived Job Decency and Meaningfulness: a Case Study, by Kuntal Ghosh et al.
Summary of On the Evaluation Practices in Multilingual Nlp: Can Machine Translation Offer An Alternative to Human Translations?, by Rochelle Choenni et al.
Summary of Step-back Profiling: Distilling User History For Personalized Scientific Writing, by Xiangru Tang et al.
Summary of Qpaug: Question and Passage Augmentation For Open-domain Question Answering Of Llms, by Minsang Kim et al.
Summary of Q*: Improving Multi-step Reasoning For Llms with Deliberative Planning, by Chaojie Wang et al.
Summary of Learning to Plan For Retrieval-augmented Large Language Models From Knowledge Graphs, by Junjie Wang et al.
Summary of Vaiyakarana : a Benchmark For Automatic Grammar Correction in Bangla, by Pramit Bhattacharyya and Arnab Bhattacharya
Summary of Ai in Space For Scientific Missions: Strategies For Minimizing Neural-network Model Upload, by Jonah Ekelund et al.
Summary of Cross-level Requirement Traceability: a Novel Approach Integrating Bag-of-words and Word Embedding For Enhanced Similarity Functionality, by Baher Mohammad et al.
Summary of Infusing Clinical Knowledge Into Tokenisers For Language Models, by Abul Hasan et al.
Summary of Iterative Repair with Weak Verifiers For Few-shot Transfer in Kbqa with Unanswerability, by Riya Sawhney et al.
Summary of Livemind: Low-latency Large Language Models with Simultaneous Inference, by Chuangtao Chen and Grace Li Zhang and Xunzhao Yin and Cheng Zhuo and Ulf Schlichtmann and Bing Li
Summary of Self-supervised Interpretable Concept-based Models For Text Classification, by Francesco De Santis et al.
Summary of Identifying User Goals From Ui Trajectories, by Omri Berkovitch et al.
Summary of Exploring Spatial Representations in the Historical Lake District Texts with Llm-based Relation Extraction, by Erum Haris et al.
Summary of Iwisdm: Assessing Instruction Following in Multimodal Models at Scale, by Xiaoxuan Lei and Lucas Gomez and Hao Yuan Bai and Pouya Bashivan
Summary of Robustness Analysis Of Ai Models in Critical Energy Systems, by Pantelis Dogoulis et al.
Summary of Posebench: Benchmarking the Robustness Of Pose Estimation Models Under Corruptions, by Sihan Ma et al.
Summary of Evaluating Implicit Bias in Large Language Models by Attacking From a Psychometric Perspective, By Yuchen Wen et al.
Summary of How to Design a Dataset Compliant with An Ml-based System Odd?, by Cyril Cappi et al.
Summary of Cryptogpt: a 7b Model Rivaling Gpt-4 in the Task Of Analyzing and Classifying Real-time Financial News, by Ying Zhang et al.
Summary of Using Game Play to Investigate Multimodal and Conversational Grounding in Large Multimodal Models, by Sherzod Hakimov and Yerkezhan Abdullayeva and Kushal Koshti and Antonia Schmidt and Yan Weiser and Anne Beyer and David Schlangen
Summary of How Many Parameters Does It Take to Change a Light Bulb? Evaluating Performance in Self-play Of Conversational Games As a Function Of Model Characteristics, by Nidhir Bhavsar and Jonathan Jordan and Sherzod Hakimov and David Schlangen
Summary of Optimizing Speculative Decoding For Serving Large Language Models Using Goodput, by Xiaoxuan Liu et al.
Summary of Emotion-aware Personalized Music Recommendation with a Heterogeneity-aware Deep Bayesian Network, by Erkang Jing et al.
Summary of Two-stage Depth Enhanced Learning with Obstacle Map For Object Navigation, by Yanwei Zheng et al.
Summary of Eduqate: Generating Adaptive Curricula Through Rmabs in Education Settings, by Sidney Tio et al.
Summary of Easyecr: a Library For Easy Implementation and Evaluation Of Event Coreference Resolution Models, by Yuncong Li et al.
Summary of Enhancing Monotonic Modeling with Spatio-temporal Adaptive Awareness in Diverse Marketing, by Bin Li et al.
Summary of A Data-driven Guided Decoding Mechanism For Diagnostic Captioning, by Panagiotis Kaliosis et al.
Summary of Ranking Llms by Compression, By Peijia Guo et al.
Summary of Simulseamless: Fbk at Iwslt 2024 Simultaneous Speech Translation, by Sara Papi and Marco Gaido and Matteo Negri and Luisa Bentivogli
Summary of Timo: Towards Better Temporal Reasoning For Language Models, by Zhaochen Su et al.
Summary of Vlbiasbench: a Comprehensive Benchmark For Evaluating Bias in Large Vision-language Model, by Sibo Wang et al.
Summary of Secokd: Aligning Large Language Models For In-context Learning with Fewer Shots, by Weixing Wang et al.
Summary of Reveal-it: Reinforcement Learning with Visibility Of Evolving Agent Policy For Interpretability, by Shuang Ao et al.
Summary of Evoagent: Towards Automatic Multi-agent Generation Via Evolutionary Algorithms, by Siyu Yuan et al.
Summary of Proving Olympiad Algebraic Inequalities Without Human Demonstrations, by Chenrui Wei et al.
Summary of Through the Theory Of Mind’s Eye: Reading Minds with Multimodal Video Large Language Models, by Zhawnen Chen et al.
Summary of Alanavlm: a Multimodal Embodied Ai Foundation Model For Egocentric Video Understanding, by Alessandro Suglia et al.
Summary of Heterogeneous Graph Neural Networks with Post-hoc Explanations For Multi-modal and Explainable Land Use Inference, by Xuehao Zhai et al.
Summary of Stackrag Agent: Improving Developer Answers with Retrieval-augmented Generation, by Davit Abrahamyan et al.
Summary of Generative Ai Misuse: a Taxonomy Of Tactics and Insights From Real-world Data, by Nahema Marchal et al.
Summary of A Large-scale Multicenter Breast Cancer Dce-mri Benchmark Dataset with Expert Segmentations, by Lidia Garrucho et al.
Summary of A Pure Transformer Pretraining Framework on Text-attributed Graphs, by Yu Song et al.
Summary of Knowledge Graph-enhanced Large Language Models Via Path Selection, by Haochen Liu et al.
Summary of Clinicallab: Aligning Agents For Multi-departmental Clinical Diagnostics in the Real World, by Weixiang Yan et al.
Summary of Spl: a Socratic Playground For Learning Powered by Large Language Model, By Liang Zhang et al.
Summary of Knowledge Tagging System on Math Questions Via Llms with Flexible Demonstration Retriever, by Hang Li et al.
Summary of Dpo: Dual-perturbation Optimization For Test-time Adaptation in 3d Object Detection, by Zhuoxiao Chen et al.
Summary of Pin: a Knowledge-intensive Dataset For Paired and Interleaved Multimodal Documents, by Junjie Wang et al.
Summary of Reasoning Like a Doctor: Improving Medical Dialogue Systems Via Diagnostic Reasoning Process Alignment, by Kaishuai Xu et al.
Summary of Aspirinsum: An Aspect-based Utility-preserved De-identification Summarization Framework, by Ya-lun Li
Summary of Genderalign: An Alignment Dataset For Mitigating Gender Bias in Large Language Models, by Tao Zhang et al.
Summary of Research on Flight Accidents Prediction Based Back Propagation Neural Network, by Haoxing Liu et al.
Summary of Autopal: Autonomous Adaptation to Users For Personal Ai Companionship, by Yi Cheng et al.
Summary of Mr-ben: a Meta-reasoning Benchmark For Evaluating System-2 Thinking in Llms, by Zhongshen Zeng et al.
Summary of Seeing Through Ai’s Lens: Enhancing Human Skepticism Towards Llm-generated Fake News, by Navid Ayoobi et al.
Summary of Mitigating Social Biases in Language Models Through Unlearning, by Omkar Dige et al.
Summary of Bild: Bi-directional Logits Difference Loss For Large Language Model Distillation, by Minchong Li et al.
Summary of Enhancing Travel Choice Modeling with Large Language Models: a Prompt-learning Approach, by Xuehao Zhai et al.
Summary of Is Ai Fun? Humordb: a Curated Dataset and Benchmark to Investigate Graphical Humor, by Veedant Jain and Felipe Dos Santos Alves Feitosa and Gabriel Kreiman
Summary of Trapezoidal Gradient Descent For Effective Reinforcement Learning in Spiking Networks, by Yuhao Pan et al.
Summary of Codreamer: Communication-based Decentralised World Models, by Edan Toledo et al.
Summary of Optimizing Psychological Counseling with Instruction-tuned Large Language Models, by Wenjie Li et al.
Summary of Enhance the Image: Super Resolution Using Artificial Intelligence in Mri, by Ziyu Li et al.
Summary of Fine-tuning Gemma-7b For Enhanced Sentiment Analysis Of Financial News Headlines, by Kangtong Mo et al.
Summary of Stability and Generalizability in Sde Diffusion Models with Measure-preserving Dynamics, by Weitong Zhang et al.
Summary of Towards Minimal Targeted Updates Of Language Models with Targeted Negative Training, by Lily H. Zhang and Rajesh Ranganath and Arya Tafvizi
Summary of Leveraging Large Language Models For Patient Engagement: the Power Of Conversational Ai in Digital Health, by Bo Wen et al.
Summary of Root-kgd: a Novel Framework For Root Cause Diagnosis Based on Knowledge Graph and Industrial Data, by Jiyu Chen et al.
Summary of Intcoop: Interpretability-aware Vision-language Prompt Tuning, by Soumya Suvra Ghosal et al.
Summary of Development Of a Dual-input Neural Model For Detecting Ai-generated Imagery, by Jonathan Gallagher and William Pugsley
Summary of From Single Agent to Multi-agent: Improving Traffic Signal Control, by Maksim Tislenko and Dmitrii Kisilev
Summary of Multilingual De-duplication Strategies: Applying Scalable Similarity Search with Monolingual & Multilingual Embedding Models, by Stefan Pasch et al.