Paper List
We recommend you use the search box as this list is very long.
-
Summary of New Rules For Causal Identification with Background Knowledge, by Tian-zuo Wang et al.
-
Summary of Assessing Brittleness Of Image-text Retrieval Benchmarks From Vision-language Models Perspective, by Mariya Hendriksen et al.
-
Summary of Fmdnn: a Fuzzy-guided Multi-granular Deep Neural Network For Histopathological Image Classification, by Weiping Ding et al.
-
Summary of Emocam: Toward Understanding What Drives Cnn-based Emotion Recognition, by Youssef Doulfoukar and Laurent Mertens and Joost Vennekens
-
Summary of How to Engage Your Readers? Generating Guiding Questions to Promote Active Reading, by Peng Cui et al.
-
Summary of Panoptic Segmentation Of Mammograms with Text-to-image Diffusion Model, by Kun Zhao et al.
-
Summary of Llms Left, Right, and Center: Assessing Gpt’s Capabilities to Label Political Bias From Web Domains, by Raphael Hernandes and Giulio Corsi
-
Summary of The Vision Of Autonomic Computing: Can Llms Make It a Reality?, by Zhiyang Zhang et al.
-
Summary of Check-eval: a Checklist-based Approach For Evaluating Text Quality, by Jayr Pereira and Andre Assumpcao and Roberto Lotufo
-
Summary of On Pre-training Of Multimodal Language Models Customized For Chart Understanding, by Wan-cyuan Fan et al.
-
Summary of Depict: Diffusion-enabled Permutation Importance For Image Classification Tasks, by Sarah Jabbour et al.
-
Summary of Thought-like-pro: Enhancing Reasoning Of Large Language Models Through Self-driven Prolog-based Chain-of-thought, by Xiaoyu Tan (1) et al.
-
Summary of Escape: Energy-based Selective Adaptive Correction For Out-of-distribution 3d Human Pose Estimation, by Luke Bidulka et al.
-
Summary of Sqlfuse: Enhancing Text-to-sql Performance Through Comprehensive Llm Synergy, by Tingkai Zhang et al.
-
Summary of Towards Automated Functional Equation Proving: a Benchmark Dataset and a Domain-specific In-context Agent, by Mahdi Buali et al.
-
Summary of Cve-llm : Automatic Vulnerability Evaluation in Medical Device Industry Using Large Language Models, by Rikhiya Ghosh et al.
-
Summary of A New Lightweight Hybrid Graph Convolutional Neural Network — Cnn Scheme For Scene Classification Using Object Detection Inference, by Ayman Beghdadi et al.
-
Summary of Human-interpretable Adversarial Prompt Attack on Large Language Models with Situational Context, by Nilanjana Das et al.
-
Summary of I Need Help! Evaluating Llm’s Ability to Ask For Users’ Support: a Case Study on Text-to-sql Generation, by Cheng-kuang Wu et al.
-
Summary of Intelligent Artistic Typography: a Comprehensive Review Of Artistic Text Design and Generation, by Yuhang Bai et al.
-
Summary of Crowdmac: Masked Crowd Density Completion For Robust Crowd Density Forecasting, by Ryo Fujii et al.
-
Summary of Percore: a Deep Learning-based Framework For Persian Spelling Correction with Phonetic Analysis, by Seyed Mohammad Sadegh Dashti et al.
-
Summary of Passion: Towards Effective Incomplete Multi-modal Medical Image Segmentation with Imbalanced Missing Rates, by Junjie Shi et al.
-
Summary of Phi-3 Safety Post-training: Aligning Language Models with a “break-fix” Cycle, by Emman Haider et al.
-
Summary of Linsatnet: the Positive Linear Satisfiability Neural Networks, by Runzhong Wang et al.
-
Summary of Werewolf Arena: a Case Study in Llm Evaluation Via Social Deduction, by Suma Bailis et al.
-
Summary of High Risk Of Political Bias in Black Box Emotion Inference Models, by Hubert Plisiecki et al.
-
Summary of Rt-pose: a 4d Radar Tensor-based 3d Human Pose Estimation and Localization Benchmark, by Yuan-hao Ho et al.
-
Summary of Duoformer: Leveraging Hierarchical Visual Representations by Local and Global Attention, By Xiaoya Tang et al.
-
Summary of Assurance Of Ai Systems From a Dependability Perspective, by Robin Bloomfield and John Rushby
-
Summary of Rag-qa Arena: Evaluating Domain Robustness For Long-form Retrieval Augmented Question Answering, by Rujun Han et al.
-
Summary of Optimizing Agricultural Order Fulfillment Systems: a Hybrid Tree Search Approach, by Pranay Thangeda et al.
-
Summary of Multi-modal Relation Distillation For Unified 3d Representation Learning, by Huiqun Wang et al.
-
Summary of Tta-ood: Test-time Augmentation For Improving Out-of-distribution Detection in Gastrointestinal Vision, by Sandesh Pokhrel et al.
-
Summary of Ecco: Can We Improve Model-generated Code Efficiency Without Sacrificing Functional Correctness?, by Siddhant Waghjale et al.
-
Summary of Octrack: Benchmarking the Open-corpus Multi-object Tracking, by Zekun Qian et al.
-
Summary of The Cardinality Of Identifying Code Sets For Soccer Ball Graph with Application to Remote Sensing, by Anna L.d. Latour et al.
-
Summary of Lekube: a Legal Knowledge Update Benchmark, by Changyue Wang et al.
-
Summary of Koma: Knowledge-driven Multi-agent Framework For Autonomous Driving with Large Language Models, by Kemou Jiang et al.
-
Summary of Covoswitch: Machine Translation Of Synthetic Code-switched Text Based on Intonation Units, by Yeeun Kang
-
Summary of Predictive Simultaneous Interpretation: Harnessing Large Language Models For Democratizing Real-time Multilingual Communication, by Kurando Iida et al.
-
Summary of How to Blend Concepts in Diffusion Models, by Lorenzo Olearo et al.
-
Summary of Llm-empowered State Representation For Reinforcement Learning, by Boyuan Wang et al.
-
Summary of Linear-complexity Self-supervised Learning For Speech Processing, by Shucong Zhang et al.
-
Summary of Sortability Of Time Series Data, by Christopher Lohse et al.
-
Summary of End-to-end Clinical Trial Matching with Large Language Models, by Dyke Ferber et al.
-
Summary of Enhancing Biomedical Knowledge Discovery For Diseases: An Open-source Framework Applied on Rett Syndrome and Alzheimer’s Disease, by Christos Theodoropoulos et al.
-
Summary of Enhancing Source-free Domain Adaptive Object Detection with Low-confidence Pseudo Label Distillation, by Ilhoon Yoon et al.
-
Summary of How Reliable Are Llms As Knowledge Bases? Re-thinking Facutality and Consistency, by Danna Zheng et al.
-
Summary of Qalam : a Multimodal Llm For Arabic Optical Character and Handwriting Recognition, by Gagan Bhatia et al.
-
Summary of Plants: a Novel Problem and Dataset For Summarization Of Planning-like (pl) Tasks, by Vishal Pallagani et al.
-
Summary of Training-free Composite Scene Generation For Layout-to-image Synthesis, by Jiaqi Liu et al.
-
Summary of Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies, by Chaofan Tao et al.
-
Summary of A Comparative Study on Automatic Coding Of Medical Letters with Explainability, by Jamie Glen et al.
-
Summary of Weak-to-strong Reasoning, by Yuqing Yang et al.
-
Summary of Hpix: Generating Vector Maps From Satellite Images, by Aditya Taparia and Keshab Nath
-
Summary of Dart-math: Difficulty-aware Rejection Tuning For Mathematical Problem-solving, by Yuxuan Tong et al.
-
Summary of Cross-task Attack: a Self-supervision Generative Framework Based on Attention Shift, by Qingyuan Zeng et al.
-
Summary of Llms As Function Approximators: Terminology, Taxonomy, and Questions For Evaluation, by David Schlangen
-
Summary of Latent Causal Probing: a Formal Perspective on Probing with Causal Models Of Data, by Charles Jin et al.
-
Summary of Black-box Opinion Manipulation Attacks to Retrieval-augmented Generation Of Large Language Models, by Zhuo Chen et al.
-
Summary of Do Llms Have Consistent Values?, by Naama Rozen et al.
-
Summary of Explainable Biomedical Hypothesis Generation Via Retrieval Augmented Generation Enabled Large Language Models, by Alexander R. Pelletier et al.
-
Summary of Bright: a Realistic and Challenging Benchmark For Reasoning-intensive Retrieval, by Hongjin Su et al.
-
Summary of Dreamstory: Open-domain Story Visualization by Llm-guided Multi-subject Consistent Diffusion, By Huiguo He et al.
-
Summary of Halu-j: Critique-based Hallucination Judge, by Binjie Wang et al.
-
Summary of Temporal Label Hierachical Network For Compound Emotion Recognition, by Sunan Li and Hailun Lian and Cheng Lu and Yan Zhao and Tianhua Qi and Hao Yang and Yuan Zong and Wenming Zheng
-
Summary of A Three-stage Algorithm For the Closest String Problem on Artificial and Real Gene Sequences, by Alireza Abdi et al.
-
Summary of A Survey Of Prompt Engineering Methods in Large Language Models For Different Nlp Tasks, by Shubham Vatsal and Harsh Dubey
-
Summary of Agent-e: From Autonomous Web Navigation to Foundational Design Principles in Agentic Systems, by Tamer Abuelsaad and Deepak Akkil and Prasenjit Dey and Ashish Jagmohan and Aditya Vempaty and Ravi Kokku
-
Summary of Comprehensive Review and Empirical Evaluation Of Causal Discovery Algorithms For Numerical Data, by Wenjin Niu et al.
-
Summary of Enhancing Temporal Action Localization: Advanced S6 Modeling with Recurrent Mechanism, by Sangyoun Lee et al.
-
Summary of Metasumperceiver: Multimodal Multi-document Evidence Summarization For Fact-checking, by Ting-chih Chen et al.
-
Summary of On Causally Disentangled State Representation Learning For Reinforcement Learning Based Recommender Systems, by Siyu Wang and Xiaocong Chen and Lina Yao
-
Summary of Retrieve, Summarize, Plan: Advancing Multi-hop Question Answering with An Iterative Approach, by Zhouyu Jiang et al.
-
Summary of Translate-and-revise: Boosting Large Language Models For Constrained Translation, by Pengcheng Huang and Yongyu Mu and Yuzhang Wu and Bei Li and Chunyang Xiao and Tong Xiao and Jingbo Zhu
-
Summary of Learning Camouflaged Object Detection From Noisy Pseudo Label, by Jin Zhang and Ruiheng Zhang and Yanjiao Shi and Zhe Cao and Nian Liu and Fahad Shahbaz Khan
-
Summary of Scicode: a Research Coding Benchmark Curated by Scientists, By Minyang Tian et al.
-
Summary of Unified-egformer: Exposure Guided Lightweight Transformer For Mixed-exposure Image Enhancement, by Eashan Adhikarla et al.
-
Summary of Noder: Image Sequence Regression Based on Neural Ordinary Differential Equations, by Hao Bai et al.
-
Summary of Wtu-eval: a Whether-or-not Tool Usage Evaluation Benchmark For Large Language Models, by Kangyun Ning et al.
-
Summary of Whispering Experts: Neural Interventions For Toxicity Mitigation in Language Models, by Xavier Suau et al.
-
Summary of Assessing the Effectiveness Of Gpt-4o in Climate Change Evidence Synthesis and Systematic Assessments: Preliminary Insights, by Elphin Tom Joe and Sai Dileep Koneru and Christine J Kirchhoff
-
Summary of Regurgitative Training: the Value Of Real Data in Training Large Language Models, by Jinghui Zhang et al.
-
Summary of Why Does New Knowledge Create Messy Ripple Effects in Llms?, by Jiaxin Qin et al.
-
Summary of Black-box Model Ensembling For Textual and Visual Question Answering Via Information Fusion, by Yuxi Xia et al.
-
Summary of Nutribench: a Dataset For Evaluating Large Language Models on Nutrition Estimation From Meal Descriptions, by Andong Hua et al.
-
Summary of Aligning Model Evaluations with Human Preferences: Mitigating Token Count Bias in Language Model Assessments, by Roland Daynauth et al.
-
Summary of Applicability Of Large Language Models and Generative Models For Legal Case Judgement Summarization, by Aniket Deroy et al.
-
Summary of Automated Question Generation on Tabular Data For Conversational Data Exploration, by Ritwik Chaudhuri et al.
-
Summary of Citeme: Can Language Models Accurately Cite Scientific Claims?, by Ori Press et al.
-
Summary of Analyzing Large Language Models Chatbots: An Experimental Approach Using a Probability Test, by Melise Peruchini et al.
-
Summary of Token-supervised Value Models For Enhancing Mathematical Problem-solving Capabilities Of Large Language Models, by Jung Hyun Lee et al.
-
Summary of Grad-sum: Leveraging Gradient Summarization For Optimal Prompt Engineering, by Derek Austin et al.