Paper List

We recommend you use the search box as this list is very long.

Summary of Automated Essay Scoring in Arabic: a Dataset and Analysis Of a Bert-based System, by Rayed Ghazawi et al.
Summary of Making New Connections: Llms As Puzzle Generators For the New York Times’ Connections Word Game, by Tim Merino et al.
Summary of Comet: “cone Of Experience” Enhanced Large Multimodal Model For Mathematical Problem Generation, by Sannyuya Liu et al.
Summary of Xeq Scale For Evaluating Xai Experience Quality, by Anjana Wijekoon et al.
Summary of An Empirical Study Of Validating Synthetic Data For Formula Generation, by Usneek Singh et al.
Summary of Enhancing Retrieval and Managing Retrieval: a Four-module Synergy For Improved Quality and Efficiency in Rag Systems, by Yunxiao Shi et al.
Summary of Qwen2 Technical Report, by An Yang et al.
Summary of Addressing Image Hallucination in Text-to-image Generation Through Factual Image Retrieval, by Youngsun Lim and Hyunjung Shim
Summary of Sibyl: Simple Yet Effective Agent Framework For Complex Real-world Reasoning, by Yulong Wang et al.
Summary of Aligning Neuronal Coding Of Dynamic Visual Scenes with Foundation Vision Models, by Rining Wu et al.
Summary of When Synthetic Traces Hide Real Content: Analysis Of Stable Diffusion Image Laundering, by Sara Mandelli et al.
Summary of Accdiffusion: An Accurate Method For Higher-resolution Image Generation, by Zhihang Lin et al.
Summary of Clave: An Adaptive Framework For Evaluating Values Of Llm Generated Responses, by Jing Yao et al.
Summary of Graphusion: Leveraging Large Language Models For Scientific Knowledge Graph Fusion and Construction in Nlp Education, by Rui Yang et al.
Summary of Think-on-graph 2.0: Deep and Faithful Large Language Model Reasoning with Knowledge-guided Retrieval Augmented Generation, by Shengjie Ma et al.
Summary of Biasscanner: Automatic Detection and Classification Of News Bias to Strengthen Democracy, by Tim Menzner and Jochen L. Leidner
Summary of Enabling Mcts Explainability For Sequential Planning Through Computation Tree Logic, by Ziyan An et al.
Summary of An Actionable Framework For Assessing Bias and Fairness in Large Language Model Use Cases, by Dylan Bouchard
Summary of Weighted Grouped Query Attention in Transformers, by Sai Sena Chinnakonduru et al.
Summary of Spider2-v: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?, by Ruisheng Cao et al.
Summary of Benchmarking Vision Language Models For Cultural Understanding, by Shravan Nayak et al.
Summary of Ref-avs: Refer and Segment Objects in Audio-visual Scenes, by Yaoting Wang et al.
Summary of Make-an-agent: a Generalizable Policy Network Generator with Behavior-prompted Diffusion, by Yongyuan Liang et al.
Summary of Lab-bench: Measuring Capabilities Of Language Models For Biology Research, by Jon M. Laurent et al.
Summary of An Empirical Study Of Mamba-based Pedestrian Attribute Recognition, by Xiao Wang et al.
Summary of Ntsebench: Cognitive Reasoning Benchmark For Vision Language Models, by Pranshu Pandya et al.
Summary of Cooperative Reward Shaping For Multi-agent Pathfinding, by Zhenyu Song et al.
Summary of Expanding the Scope: Inductive Knowledge Graph Reasoning with Multi-starting Progressive Propagation, by Zhoutian Shao and Yuanning Cui and Wei Hu
Summary of Melon Fruit Detection and Quality Assessment Using Generative Ai-based Image Data Augmentation, by Seungri Yoon et al.
Summary of A Multi-stage Framework For 3d Individual Tooth Segmentation in Dental Cbct, by Chunshi Wang et al.
Summary of Backdoor Attacks Against Image-to-image Networks, by Wenbo Jiang and Hongwei Li and Jiaming He and Rui Zhang and Guowen Xu and Tianwei Zhang and Rongxing Lu
Summary of The Good, the Bad, and the Greedy: Evaluation Of Llms Should Not Ignore Non-determinism, by Yifan Song et al.
Summary of Kinetic Typography Diffusion Model, by Seonmi Park et al.
Summary of Ideal: Leveraging Infinite and Dynamic Characterizations Of Large Language Models For Query-focused Summarization, by Jie Cao et al.
Summary of How and Where Does Clip Process Negation?, by Vincent Quantmeyer and Pablo Mosteiro and Albert Gatt
Summary of Tcm-ftp: Fine-tuning Large Language Models For Herbal Prescription Prediction, by Xingzhi Zhou et al.
Summary of An Experimental Evaluation Of Siamese Neural Networks For Robot Localization Using Omnidirectional Imaging in Indoor Environments, by J.j.cabrera et al.
Summary of Understanding the Dependence Of Perception Model Competency on Regions in An Image, by Sara Pohland and Claire Tomlin
Summary of Boosting Zero-shot Crosslingual Performance Using Llm-based Augmentations with Effective Data Selection, by Barah Fazili et al.
Summary of 3d Geometric Shape Assembly Via Efficient Point Cloud Matching, by Nahyuk Lee et al.
Summary of Leveraging Hybrid Intelligence Towards Sustainable and Energy-efficient Machine Learning, by Daniel Geissler et al.
Summary of An Evaluation Of Cnn Models and Data Augmentation Techniques in Hierarchical Localization Of Mobile Robots, by J.j. Cabrera et al.
Summary of Prompt Selection Matters: Enhancing Text Annotations For Social Sciences with Large Language Models, by Louis Abraham et al.
Summary of Farfetched: Entity-centric Reasoning and Claim Validation For the Greek Language Based on Textually Represented Environments, by Dimitris Papadopoulos et al.
Summary of Wojoodner 2024: the Second Arabic Named Entity Recognition Shared Task, by Mustafa Jarrar et al.
Summary of Learning Online Scale Transformation For Talking Head Video Generation, by Fa-ting Hong et al.
Summary of Characterizing Disparity Between Edge Models and High-accuracy Base Models For Vision Tasks, by Zhenyu Wang et al.
Summary of Atomagents: Alloy Design and Discovery Through Physics-aware Multi-modal Multi-agent Artificial Intelligence, by Alireza Ghafarollahi and Markus J. Buehler
Summary of Causality Extraction From Medical Text Using Large Language Models (llms), by Seethalakshmi Gopalakrishnan et al.
Summary of Document-level Clinical Entity and Relation Extraction Via Knowledge Base-guided Generation, by Kriti Bhattarai et al.
Summary of Lean-star: Learning to Interleave Thinking and Proving, by Haohan Lin et al.
Summary of Autograms: Autonomous Graphical Agent Modeling Software, by Ben Krause et al.
Summary of Learning to Refuse: Towards Mitigating Privacy Risks in Llms, by Zhenhua Liu et al.
Summary of Rapid Biomedical Research Classification: the Pandemic Pact Advanced Categorisation Engine, by Omid Rohanian et al.
Summary of Hierarchical Multi-modal Transformer For Cross-modal Long Document Classification, by Tengfei Liu et al.
Summary of Look Within, Why Llms Hallucinate: a Causal Perspective, by He Li and Haoang Chi and Mingyu Liu and Wenjing Yang
Summary of Chatlogic: Integrating Logic Programming with Large Language Models For Multi-step Reasoning, by Zhongsheng Wang et al.
Summary of Key-point-driven Mathematical Reasoning Distillation Of Large Language Model, by Xunyu Zhu et al.
Summary of Shape2scene: 3d Scene Representation Learning Through Pre-training on Shape Data, by Tuo Feng et al.
Summary of Visual Prompt Selection For In-context Learning Segmentation, by Wei Suo et al.
Summary of Alphadou: High-performance End-to-end Doudizhu Ai Integrating Bidding, by Chang Lei et al.
Summary of Cross-lingual Multi-hop Knowledge Editing, by Aditi Khandelwal et al.
Summary of Sora and V-jepa Have Not Learned the Complete Real World Model — a Philosophical Analysis Of Video Ais Through the Theory Of Productive Imagination, by Jianqiu Zhang
Summary of Putting Gpt-4o to the Sword: a Comprehensive Evaluation Of Language, Vision, Speech, and Multimodal Proficiency, by Sakib Shahriar et al.
Summary of Towards Llm-powered Ambient Sensor Based Multi-person Human Activity Recognition, by Xi Chen (m-psi) et al.
Summary of Optimization Of Autonomous Driving Image Detection Based on Rfaconv and Triplet Attention, by Zhipeng Ling et al.
Summary of Mate: Meet at the Embedding — Connecting Images with Long Texts, by Young Kyun Jang et al.
Summary of Video Occupancy Models, by Manan Tomar et al.
Summary of A Transformer-based Multi-stream Approach For Isolated Iranian Sign Language Recognition, by Ali Ghadami et al.
Summary of Towards Temporal Change Explanations From Bi-temporal Satellite Images, by Ryo Tsujimoto et al.
Summary of Explainable Image Captioning Using Cnn- Cnn Architecture and Hierarchical Attention, by Rishi Kesav Mohan et al.
Summary of Dpec: Dual-path Error Compensation Method For Enhanced Low-light Image Clarity, by Shuang Wang et al.
Summary of Don’t Fear Peculiar Activation Functions: Euaf and Beyond, by Qianchao Wang (1) et al.
Summary of Diagnosing and Re-learning For Balanced Multimodal Learning, by Yake Wei and Siwei Li and Ruoxuan Feng and Di Hu
Summary of Iccv23 Visual-dialog Emotion Explanation Challenge: Seu_309 Team Technical Report, by Yixiao Yuan and Yingzhe Peng
Summary of Contextualstory: Consistent Visual Storytelling with Spatially-enhanced and Storyline Context, by Sixiao Zheng et al.
Summary of Layout-and-retouch: a Dual-stage Framework For Improving Diversity in Personalized Image Generation, by Kangyeol Kim et al.
Summary of Nativqa: Multilingual Culturally-aligned Natural Query For Llms, by Md. Arid Hasan et al.
Summary of Building Pre-train Llm Dataset For the Indic Languages: a Case Study on Hindi, by Shantipriya Parida and Shakshi Panwar and Kusum Lata and Sanskruti Mishra and Sambit Sekhar
Summary of Preserving the Privacy Of Reward Functions in Mdps Through Deception, by Shashank Reddy Chirra et al.
Summary of Towards Systematic Monolingual Nlp Surveys: Gena Of Greek Nlp, by Juli Bakagianni et al.
Summary of Cellagent: An Llm-driven Multi-agent Framework For Automated Single-cell Data Analysis, by Yihang Xiao et al.
Summary of Sefi-cd: a Semantic First Change Detection Paradigm That Can Detect Any Change You Want, by Ling Zhao et al.
Summary of Machine Apophenia: the Kaleidoscopic Generation Of Architectural Images, by Alexey Tikhonov and Dmitry Sinyavin
Summary of The Two Sides Of the Coin: Hallucination Generation and Detection with Llms As Evaluators For Llms, by Anh Thu Maria Bui et al.
Summary of Dart: An Automated End-to-end Object Detection Pipeline with Data Diversification, Open-vocabulary Bounding Box Annotation, Pseudo-label Review, and Model Training, by Chen Xin et al.
Summary of Enhancing Depressive Post Detection in Bangla: a Comparative Study Of Tf-idf, Bert and Fasttext Embeddings, by Saad Ahmed Sazan et al.
Summary of A Chatbot For Asylum-seeking Migrants in Europe, by Bettina Fazzinga et al.
Summary of From Easy to Hard: Learning Curricular Shape-aware Features For Robust Panoptic Scene Graph Generation, by Hanrong Shi and Lin Li and Jun Xiao and Yueting Zhuang and Long Chen
Summary of Evaluating Ai Evaluation: Perils and Prospects, by John Burden
Summary of Fedvae: Trajectory Privacy Preserving Based on Federated Variational Autoencoder, by Yuchen Jiang et al.
Summary of Predicting and Understanding Human Action Decisions: Insights From Large Language Models and Cognitive Instance-based Learning, by Thuy Ngoc Nguyen et al.
Summary of Constrained Intrinsic Motivation For Reinforcement Learning, by Xiang Zheng et al.
Summary of Dahrs: Divergence-aware Hallucination-remediated Srl Projection, by Sangpil Youm et al.
Summary of Instruction Following with Goal-conditioned Reinforcement Learning in Virtual Environments, by Zoya Volovikova et al.
Summary of Sina at Fignews 2024: Multilingual Datasets Annotated with Bias and Propaganda, by Lina Duaibes et al.
Summary of Is Contrasting All You Need? Contrastive Learning For the Detection and Attribution Of Ai-generated Text, by Lucio La Cava et al.
Summary of Gavel: Generating Games Via Evolution and Language Models, by Graham Todd et al.
Summary of Spiqa: a Dataset For Multimodal Question Answering on Scientific Papers, by Shraman Pramanick et al.
Summary of Muscle: a Model Update Strategy For Compatible Llm Evolution, by Jessica Echterhoff et al.