Paper List
We recommend you use the search box as this list is very long.
-
Summary of Benchmarking Vision Language Models For Cultural Understanding, by Shravan Nayak et al.
-
Summary of Spider2-v: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?, by Ruisheng Cao et al.
-
Summary of Ref-avs: Refer and Segment Objects in Audio-visual Scenes, by Yaoting Wang et al.
-
Summary of Make-an-agent: a Generalizable Policy Network Generator with Behavior-prompted Diffusion, by Yongyuan Liang et al.
-
Summary of Building Artificial Intelligence with Creative Agency and Self-hood, by Liane Gabora and Joscha Bach
-
Summary of Do Large Language Models Understand Verbal Indicators Of Romantic Attraction?, by Sandra C. Matz et al.
-
Summary of Medbench: a Comprehensive, Standardized, and Reliable Benchmarking System For Evaluating Chinese Medical Large Language Models, by Mianxin Liu et al.
-
Summary of Lionguard: Building a Contextualized Moderation Classifier to Tackle Localized Unsafe Content, by Jessica Foo and Shaun Khoo
-
Summary of Talec: Teach Your Llm to Evaluate in Specific Domain with In-house Criteria by Criteria Division and Zero-shot Plus Few-shot, By Kaiqi Zhang et al.
-
Summary of Visualization Literacy Of Multimodal Large Language Models: a Comparative Study, by Zhimin Li et al.
-
Summary of Autonomous Prompt Engineering in Large Language Models, by Daan Kepel et al.
-
Summary of Ragbench: Explainable Benchmark For Retrieval-augmented Generation Systems, by Robert Friel et al.
-
Summary of The Good, the Bad, and the Greedy: Evaluation Of Llms Should Not Ignore Non-determinism, by Yifan Song et al.
-
Summary of Kinetic Typography Diffusion Model, by Seonmi Park et al.
-
Summary of Ideal: Leveraging Infinite and Dynamic Characterizations Of Large Language Models For Query-focused Summarization, by Jie Cao et al.
-
Summary of How and Where Does Clip Process Negation?, by Vincent Quantmeyer and Pablo Mosteiro and Albert Gatt
-
Summary of Tcm-ftp: Fine-tuning Large Language Models For Herbal Prescription Prediction, by Xingzhi Zhou et al.
-
Summary of An Experimental Evaluation Of Siamese Neural Networks For Robot Localization Using Omnidirectional Imaging in Indoor Environments, by J.j.cabrera et al.
-
Summary of 3d Geometric Shape Assembly Via Efficient Point Cloud Matching, by Nahyuk Lee et al.
-
Summary of Understanding the Dependence Of Perception Model Competency on Regions in An Image, by Sara Pohland and Claire Tomlin
-
Summary of Leveraging Hybrid Intelligence Towards Sustainable and Energy-efficient Machine Learning, by Daniel Geissler et al.
-
Summary of Boosting Zero-shot Crosslingual Performance Using Llm-based Augmentations with Effective Data Selection, by Barah Fazili et al.
-
Summary of An Evaluation Of Cnn Models and Data Augmentation Techniques in Hierarchical Localization Of Mobile Robots, by J.j. Cabrera et al.
-
Summary of Prompt Selection Matters: Enhancing Text Annotations For Social Sciences with Large Language Models, by Louis Abraham et al.
-
Summary of An Empirical Study Of Validating Synthetic Data For Formula Generation, by Usneek Singh et al.
-
Summary of Xeq Scale For Evaluating Xai Experience Quality, by Anjana Wijekoon et al.
-
Summary of Enhancing Retrieval and Managing Retrieval: a Four-module Synergy For Improved Quality and Efficiency in Rag Systems, by Yunxiao Shi et al.
-
Summary of Qwen2 Technical Report, by An Yang et al.
-
Summary of Addressing Image Hallucination in Text-to-image Generation Through Factual Image Retrieval, by Youngsun Lim and Hyunjung Shim
-
Summary of Sibyl: Simple Yet Effective Agent Framework For Complex Real-world Reasoning, by Yulong Wang et al.
-
Summary of When Synthetic Traces Hide Real Content: Analysis Of Stable Diffusion Image Laundering, by Sara Mandelli et al.
-
Summary of Clave: An Adaptive Framework For Evaluating Values Of Llm Generated Responses, by Jing Yao et al.
-
Summary of Autograms: Autonomous Graphical Agent Modeling Software, by Ben Krause et al.
-
Summary of Learning to Refuse: Towards Mitigating Privacy Risks in Llms, by Zhenhua Liu et al.
-
Summary of Rapid Biomedical Research Classification: the Pandemic Pact Advanced Categorisation Engine, by Omid Rohanian et al.
-
Summary of Hierarchical Multi-modal Transformer For Cross-modal Long Document Classification, by Tengfei Liu et al.
-
Summary of Look Within, Why Llms Hallucinate: a Causal Perspective, by He Li and Haoang Chi and Mingyu Liu and Wenjing Yang
-
Summary of Chatlogic: Integrating Logic Programming with Large Language Models For Multi-step Reasoning, by Zhongsheng Wang et al.
-
Summary of Key-point-driven Mathematical Reasoning Distillation Of Large Language Model, by Xunyu Zhu et al.
-
Summary of Shape2scene: 3d Scene Representation Learning Through Pre-training on Shape Data, by Tuo Feng et al.
-
Summary of Visual Prompt Selection For In-context Learning Segmentation, by Wei Suo et al.
-
Summary of Cross-lingual Multi-hop Knowledge Editing, by Aditi Khandelwal et al.
-
Summary of Alphadou: High-performance End-to-end Doudizhu Ai Integrating Bidding, by Chang Lei et al.
-
Summary of Lab-bench: Measuring Capabilities Of Language Models For Biology Research, by Jon M. Laurent et al.
-
Summary of An Empirical Study Of Mamba-based Pedestrian Attribute Recognition, by Xiao Wang et al.
-
Summary of Ntsebench: Cognitive Reasoning Benchmark For Vision Language Models, by Pranshu Pandya et al.
-
Summary of Cooperative Reward Shaping For Multi-agent Pathfinding, by Zhenyu Song et al.
-
Summary of Melon Fruit Detection and Quality Assessment Using Generative Ai-based Image Data Augmentation, by Seungri Yoon et al.
-
Summary of Expanding the Scope: Inductive Knowledge Graph Reasoning with Multi-starting Progressive Propagation, by Zhoutian Shao and Yuanning Cui and Wei Hu
-
Summary of A Multi-stage Framework For 3d Individual Tooth Segmentation in Dental Cbct, by Chunshi Wang et al.
-
Summary of Backdoor Attacks Against Image-to-image Networks, by Wenbo Jiang and Hongwei Li and Jiaming He and Rui Zhang and Guowen Xu and Tianwei Zhang and Rongxing Lu
-
Summary of Explainable Image Captioning Using Cnn- Cnn Architecture and Hierarchical Attention, by Rishi Kesav Mohan et al.
-
Summary of Don’t Fear Peculiar Activation Functions: Euaf and Beyond, by Qianchao Wang (1) et al.
-
Summary of Diagnosing and Re-learning For Balanced Multimodal Learning, by Yake Wei and Siwei Li and Ruoxuan Feng and Di Hu
-
Summary of Iccv23 Visual-dialog Emotion Explanation Challenge: Seu_309 Team Technical Report, by Yixiao Yuan and Yingzhe Peng
-
Summary of Layout-and-retouch: a Dual-stage Framework For Improving Diversity in Personalized Image Generation, by Kangyeol Kim et al.
-
Summary of Preserving the Privacy Of Reward Functions in Mdps Through Deception, by Shashank Reddy Chirra et al.
-
Summary of Building Pre-train Llm Dataset For the Indic Languages: a Case Study on Hindi, by Shantipriya Parida and Shakshi Panwar and Kusum Lata and Sanskruti Mishra and Sambit Sekhar
-
Summary of Cellagent: An Llm-driven Multi-agent Framework For Automated Single-cell Data Analysis, by Yihang Xiao et al.
-
Summary of Nativqa: Multilingual Culturally-aligned Natural Query For Llms, by Md. Arid Hasan et al.
-
Summary of Towards Systematic Monolingual Nlp Surveys: Gena Of Greek Nlp, by Juli Bakagianni et al.
-
Summary of Sefi-cd: a Semantic First Change Detection Paradigm That Can Detect Any Change You Want, by Ling Zhao et al.
-
Summary of Farfetched: Entity-centric Reasoning and Claim Validation For the Greek Language Based on Textually Represented Environments, by Dimitris Papadopoulos et al.
-
Summary of Wojoodner 2024: the Second Arabic Named Entity Recognition Shared Task, by Mustafa Jarrar et al.
-
Summary of Learning Online Scale Transformation For Talking Head Video Generation, by Fa-ting Hong et al.
-
Summary of Characterizing Disparity Between Edge Models and High-accuracy Base Models For Vision Tasks, by Zhenyu Wang et al.
-
Summary of Causality Extraction From Medical Text Using Large Language Models (llms), by Seethalakshmi Gopalakrishnan et al.
-
Summary of Lean-star: Learning to Interleave Thinking and Proving, by Haohan Lin et al.
-
Summary of Document-level Clinical Entity and Relation Extraction Via Knowledge Base-guided Generation, by Kriti Bhattarai et al.
-
Summary of Atomagents: Alloy Design and Discovery Through Physics-aware Multi-modal Multi-agent Artificial Intelligence, by Alireza Ghafarollahi and Markus J. Buehler
-
Summary of Constrained Intrinsic Motivation For Reinforcement Learning, by Xiang Zheng et al.
-
Summary of Predicting and Understanding Human Action Decisions: Insights From Large Language Models and Cognitive Instance-based Learning, by Thuy Ngoc Nguyen et al.
-
Summary of Instruction Following with Goal-conditioned Reinforcement Learning in Virtual Environments, by Zoya Volovikova et al.
-
Summary of Dahrs: Divergence-aware Hallucination-remediated Srl Projection, by Sangpil Youm et al.
-
Summary of Sina at Fignews 2024: Multilingual Datasets Annotated with Bias and Propaganda, by Lina Duaibes et al.
-
Summary of Is Contrasting All You Need? Contrastive Learning For the Detection and Attribution Of Ai-generated Text, by Lucio La Cava et al.
-
Summary of Gavel: Generating Games Via Evolution and Language Models, by Graham Todd et al.
-
Summary of Spiqa: a Dataset For Multimodal Question Answering on Scientific Papers, by Shraman Pramanick et al.
-
Summary of Fairylandai: Personalized Fairy Tales Utilizing Chatgpt and Dalle-3, by Georgios Makridis et al.
-
Summary of Muscle: a Model Update Strategy For Compatible Llm Evolution, by Jessica Echterhoff et al.
-
Summary of Is Gpt-4 Conscious?, by Izak Tait et al.
-
Summary of 1-lipschitz Neural Distance Fields, by Guillaume Coiffier and Louis Bethune
-
Summary of Putting Gpt-4o to the Sword: a Comprehensive Evaluation Of Language, Vision, Speech, and Multimodal Proficiency, by Sakib Shahriar et al.
-
Summary of Towards Llm-powered Ambient Sensor Based Multi-person Human Activity Recognition, by Xi Chen (m-psi) et al.
-
Summary of Optimization Of Autonomous Driving Image Detection Based on Rfaconv and Triplet Attention, by Zhipeng Ling et al.
-
Summary of Video Occupancy Models, by Manan Tomar et al.
-
Summary of Dpec: Dual-path Error Compensation Method For Enhanced Low-light Image Clarity, by Shuang Wang et al.
-
Summary of Towards Temporal Change Explanations From Bi-temporal Satellite Images, by Ryo Tsujimoto et al.
-
Summary of Empowering Few-shot Relation Extraction with the Integration Of Traditional Re Methods and Large Language Models, by Ye Liu et al.
-
Summary of Robustness Of Llms to Perturbations in Text, by Ayush Singh et al.
-
Summary of Emotion Talk: Emotional Support Via Audio Messages For Psychological Assistance, by Fabrycio Leite Nakano Almada et al.
-
Summary of Enhancing Few-shot Stock Trend Prediction with Large Language Models, by Yiqi Deng et al.
-
Summary of Introducing Vada: Novel Image Segmentation Model For Maritime Object Segmentation Using New Dataset, by Yongjin Kim et al.
-
Summary of Tcan: Animating Human Images with Temporally Consistent Pose Guidance Using Diffusion Models, by Jeongho Kim et al.
-
Summary of Spreadsheetllm: Encoding Spreadsheets For Large Language Models, by Yuzhang Tian et al.
-
Summary of Vision Language Model Is Not All You Need: Augmentation Strategies For Molecule Language Models, by Namkyeong Lee et al.