Paper List
We recommend you use the search box as this list is very long.
-
Summary of S3pt: Scene Semantics and Structure Guided Clustering to Boost Self-supervised Pre-training For Autonomous Driving, by Maciej K. Wozniak et al.
-
Summary of Teaching a Language Model to Distinguish Between Similar Details Using a Small Adversarial Training Set, by Chris Achard
-
Summary of Diamond: Dementia Diagnosis with Multi-modal Vision Transformers Using Mri and Pet, by Yitong Li et al.
-
Summary of Public Domain 12m: a Highly Aesthetic Image-text Dataset with Novel Governance Mechanisms, by Jordan Meyer et al.
-
Summary of A Little Less Conversation, a Little More Action, Please: Investigating the Physical Common-sense Of Llms in a 3d Embodied Environment, by Matteo G. Mecattaf et al.
-
Summary of Tomato: Assessing Visual Temporal Reasoning Capabilities in Multimodal Foundation Models, by Ziyao Shangguan et al.
-
Summary of Vl-cache: Sparsity and Modality-aware Kv Cache Compression For Vision-language Model Inference Acceleration, by Dezhan Tu et al.
-
Summary of Tpp-gaze: Modelling Gaze Dynamics in Space and Time with Neural Temporal Point Processes, by Alessandro D’amelio et al.
-
Summary of Leaf: Learning and Evaluation Augmented by Fact-checking to Improve Factualness in Large Language Models, By Hieu Tran et al.
-
Summary of Graph-augmented Relation Extraction Model with Llms-generated Support Document, by Vicky Dong and Hao Yu and Yao Chen
-
Summary of From Context to Action: Analysis Of the Impact Of State Representation and Context on the Generalization Of Multi-turn Web Navigation Agents, by Nalin Tiwary et al.
-
Summary of Simulating User Agents For Embodied Conversational-ai, by Daniel Philipov et al.
-
Summary of Kernel Looping: Eliminating Synchronization Boundaries For Peak Inference Performance, by David Koeplinger et al.
-
Summary of Effective Guidance For Model Attention with Simple Yes-no Annotations, by Seongmin Lee et al.
-
Summary of Addressing Issues with Working Memory in Video Object Segmentation, by Clayton Bromley et al.
-
Summary of Do Large Language Models Align with Core Mental Health Counseling Competencies?, by Viet Cuong Nguyen et al.
-
Summary of Image2struct: Benchmarking Structure Extraction For Vision-language Models, by Josselin Somerville Roberts et al.
-
Summary of Predicting Future Actions Of Reinforcement Learning Agents, by Stephen Chung et al.
-
Summary of Scaling Llm Inference with Optimized Sample Compute Allocation, by Kexun Zhang et al.
-
Summary of Realcqa-v2 : Visual Premise Proving a Manual Cot Dataset For Charts, by Saleem Ahmed et al.
-
Summary of From Silos to Systems: Process-oriented Hazard Analysis For Ai Systems, by Shalaleh Rismani et al.
-
Summary of Ml Research Benchmark, by Matthew Kenney
-
Summary of Cogs: Model Agnostic Causality Constrained Counterfactual Explanations Using Goal-directed Asp, by Sopam Dasgupta et al.
-
Summary of Prove Your Point!: Bringing Proof-enhancement Principles to Argumentative Essay Generation, by Ruiyu Xiao et al.
-
Summary of Backdoor Attack Against Vision Transformers Via Attention Gradient-based Image Erosion, by Ji Guo et al.
-
Summary of Self-driving Car Racing: Application Of Deep Reinforcement Learning, by Florentiana Yuwono et al.
-
Summary of Beyond Ontology in Dialogue State Tracking For Goal-oriented Chatbot, by Sejin Lee and Dongha Kim and Min Song
-
Summary of Injecguard: Benchmarking and Mitigating Over-defense in Prompt Injection Guardrail Models, by Hao Li et al.
-
Summary of Reliability Assessment Of Information Sources Based on Random Permutation Set, by Juntao Xu et al.
-
Summary of Eliciting Critical Reasoning in Retrieval-augmented Language Models Via Contrastive Explanations, by Leonardo Ranaldi et al.
-
Summary of Sfa-unet: More Attention to Multi-scale Contrast and Contextual Information in Infrared Small Object Segmentation, by Imad Ali Shah et al.
-
Summary of Less Is More: Pre-training Cross-lingual Small-scale Language Models with Cognitively-plausible Curriculum Learning Strategies, by Suchir Salhan et al.
-
Summary of Bis: Nl2sql Service Evaluation Benchmark For Business Intelligence Scenarios, by Bora Caglayan et al.
-
Summary of Diffusion As Reasoning: Enhancing Object Goal Navigation with Llm-biased Diffusion Model, by Yiming Ji et al.
-
Summary of A Fresh Look at Generalized Category Discovery Through Non-negative Matrix Factorization, by Zhong Ji et al.
-
Summary of Advancing Efficient Brain Tumor Multi-class Classification — New Insights From the Vision Mamba Model in Transfer Learning, by Yinyi Lai et al.
-
Summary of Building Altruistic and Moral Ai Agent with Brain-inspired Affective Empathy Mechanisms, by Feifei Zhao et al.
-
Summary of Beyond Text: Optimizing Rag with Multimodal Inputs For Industrial Applications, by Monica Riedler et al.
-
Summary of From Explicit Rules to Implicit Reasoning in An Interpretable Violence Monitoring System, by Wen-dong Jiang et al.
-
Summary of Path-based Summary Explanations For Graph Recommenders (extended Version), by Danae Pla Karidi and Evaggelia Pitoura
-
Summary of Sing It, Narrate It: Quality Musical Lyrics Translation, by Zhuorui Ye et al.
-
Summary of Tractshapenet: Efficient Multi-shape Learning with 3d Tractography Point Clouds, by Yui Lo et al.
-
Summary of Mapping the Neuro-symbolic Ai Landscape by Architectures: a Handbook on Augmenting Deep Learning Through Symbolic Reasoning, By Jonathan Feldstein et al.
-
Summary of Hyperspectral Imaging-based Perception in Autonomous Driving Scenarios: Benchmarking Baseline Semantic Segmentation Models, by Imad Ali Shah et al.
-
Summary of Protecting Privacy in Multimodal Large Language Models with Mllmu-bench, by Zheyuan Liu et al.
-
Summary of Solving Epistemic Logic Programs Using Generate-and-test with Propagation, by Jorge Fandinno and Lute Lillo
-
Summary of Natural Language Processing For Analyzing Electronic Health Records and Clinical Notes in Cancer Research: a Review, by Muhammad Bilal et al.
-
Summary of Lightweight Frequency Masker For Cross-domain Few-shot Semantic Segmentation, by Jintao Tong et al.
-
Summary of Adam: An Embodied Causal Agent in Open-world Environments, by Shu Yu et al.
-
Summary of A Methodology For Incompleteness-tolerant and Modular Gradual Semantics For Argumentative Statement Graphs, by Antonio Rago et al.
-
Summary of Democratizing Reward Design For Personal and Representative Value-alignment, by Carter Blair et al.
-
Summary of From Melodic Note Sequences to Pitches Using Word2vec, by Daniel Defays
-
Summary of Contextiq: a Multimodal Expert-based Video Retrieval System For Contextual Advertising, by Ashutosh Chaubey et al.
-
Summary of Llmcbench: Benchmarking Large Language Model Compression For Efficient Deployment, by Ge Yang et al.
-
Summary of Causal Interventions on Causal Paths: Mapping Gpt-2’s Reasoning From Syntax to Semantics, by Isabelle Lee et al.
-
Summary of Ct2c-qa: Multimodal Question Answering Over Chinese Text, Table and Chart, by Bowen Zhao et al.
-
Summary of Advi2i: Adversarial Image Attack on Image-to-image Diffusion Models, by Yaopei Zeng et al.
-
Summary of Estimating Causal Effects Of Text Interventions Leveraging Llms, by Siyi Guo et al.
-
Summary of Large Language Models For Manufacturing, by Yiwei Li et al.
-
Summary of Can Large Language Models Act As Symbolic Reasoners?, by Rob Sullivan et al.
-
Summary of Efficient Training Of Sparse Autoencoders For Large Language Models Via Layer Groups, by Davide Ghilardi et al.
-
Summary of Going Beyond H&e and Oncology: How Do Histopathology Foundation Models Perform For Multi-stain Ihc and Immunology?, by Amaya Gallagher-syed et al.
-
Summary of Thank You, Stingray: Multilingual Large Language Models Can Not (yet) Disambiguate Cross-lingual Word Sense, by Samuel Cahyawijaya and Ruochen Zhang and Holy Lovenia and Jan Christian Blaise Cruz and Elisa Gilbert and Hiroki Nomoto and Alham Fikri Aji
-
Summary of Can Large Language Models Replace Data Scientists in Clinical Research?, by Zifeng Wang et al.
-
Summary of Imagenet-rib Benchmark: Large Pre-training Datasets Don’t Always Guarantee Robustness After Fine-tuning, by Jaedong Hwang et al.
-
Summary of Asynchronous Tool Usage For Real-time Agents, by Antonio A. Ginart et al.
-
Summary of Adaptgcd: Multi-expert Adapter Tuning For Generalized Category Discovery, by Yuxun Qu et al.
-
Summary of A Bayesian Approach to Harnessing the Power Of Llms in Authorship Attribution, by Zhengmian Hu et al.
-
Summary of Mcpdial: a Minecraft Persona-driven Dialogue Dataset, by Seyed Hossein Alavi et al.
-
Summary of Enhancing Financial Question Answering with a Multi-agent Reflection Framework, by Sorouralsadat Fatemi et al.
-
Summary of Learning and Unlearning Of Fabricated Knowledge in Language Models, by Chen Sun et al.
-
Summary of Text-guided Attention Is All You Need For Zero-shot Robustness in Vision-language Models, by Lu Yu et al.
-
Summary of Inverse Attention Agent For Multi-agent System, by Qian Long et al.
-
Summary of Informed Deep Abstaining Classifier: Investigating Noise-robust Training For Diagnostic Decision Support Systems, by Helen Schneider et al.
-
Summary of Kandinsky 3: Text-to-image Synthesis For Multifunctional Generative Framework, by Vladimir Arkhipkin et al.
-
Summary of Stealthy Jailbreak Attacks on Large Language Models Via Benign Data Mirroring, by Honglin Mu et al.
-
Summary of Efficient Mixture-of-expert For Video-based Driver State and Physiological Multi-task Estimation in Conditional Autonomous Driving, by Jiyao Wang et al.
-
Summary of Towards Unifying Evaluation Of Counterfactual Explanations: Leveraging Large Language Models For Human-centric Assessments, by Marharyta Domnich et al.
-
Summary of Retrieval-enhanced Mutation Mastery: Augmenting Zero-shot Prediction Of Protein Language Model, by Yang Tan et al.
-
Summary of Palisade — Prompt Injection Detection Framework, by Sahasra Kokkula et al.
-
Summary of Deep Learning-based Fatigue Cracks Detection in Bridge Girders Using Feature Pyramid Networks, by Jiawei Zhang et al.
-
Summary of Belief in the Machine: Investigating Epistemological Blind Spots Of Language Models, by Mirac Suzgun et al.
-
Summary of Hierarchical Knowledge Graph Construction From Images For Scalable E-commerce, by Zhantao Yang et al.
-
Summary of Multi-modal Ai For Comprehensive Breast Cancer Prognostication, by Jan Witowski et al.
-
Summary of Autobench-v: Can Large Vision-language Models Benchmark Themselves?, by Han Bao et al.
-
Summary of Larp: Tokenizing Videos with a Learned Autoregressive Generative Prior, by Hanyu Wang et al.
-
Summary of Natural Language Processing For the Legal Domain: a Survey Of Tasks, Datasets, Models, and Challenges, by Farid Ariai and Gianluca Demartini
-
Summary of Eora: Training-free Compensation For Compressed Llm with Eigenspace Low-rank Approximation, by Shih-yang Liu et al.
-
Summary of Mmdocbench: Benchmarking Large Vision-language Models For Fine-grained Visual Document Understanding, by Fengbin Zhu et al.
-
Summary of Multi-path Exploration and Feedback Adjustment For Text-to-image Person Retrieval, by Bin Kang et al.
-
Summary of Llm Robustness Against Misinformation in Biomedical Question Answering, by Alexander Bondarenko et al.
-
Summary of Fine-tuned Large Language Models (llms): Improved Prompt Injection Attacks Detection, by Md Abdur Rahman et al.
-
Summary of Large Language Model Benchmarks in Medical Tasks, by Lawrence K.q. Yan et al.
-
Summary of Rethinking Data Synthesis: a Teacher Model Training Recipe with Interpretation, by Yifang Chen et al.
-
Summary of Addressing the Pitfalls Of Image-based Structural Health Monitoring: a Focus on False Positives, False Negatives, and Base Rate Bias, by Vagelis Plevris
-
Summary of Autokaggle: a Multi-agent Framework For Autonomous Data Science Competitions, by Ziming Li et al.
-
Summary of Lodge++: High-quality and Long Dance Generation with Vivid Choreography Patterns, by Ronghui Li et al.
-
Summary of Medgo: a Chinese Medical Large Language Model, by Haitao Zhang and Bo An
-
Summary of Nt-vot211: a Large-scale Benchmark For Night-time Visual Object Tracking, by Yu Liu et al.