Paper List
We recommend you use the search box as this list is very long.
-
Summary of Dynamic Universal Approximation Theory: the Basic Theory For Deep Learning-based Computer Vision Models, by Wei Wang et al.
-
Summary of I Could’ve Asked That: Reformulating Unanswerable Questions, by Wenting Zhao et al.
-
Summary of Model Collapse in the Self-consuming Chain Of Diffusion Finetuning: a Novel Perspective From Quantitative Trait Modeling, by Youngseok Yoon et al.
-
Summary of Mapping the Technological Future: a Topic, Sentiment, and Emotion Analysis in Social Media Discourse, by Alina Landowska et al.
-
Summary of Streamtinynet: Video Streaming Analysis with Spatial-temporal Tinyml, by Hazem Hesham Yousef Shalby et al.
-
Summary of A Process Algebraic Framework For Multi-agent Dynamic Epistemic Systems, by Alessandro Aldini
-
Summary of Crasar-u-droids: a Large Scale Benchmark Dataset For Building Alignment and Damage Assessment in Georectified Suas Imagery, by Thomas Manzini et al.
-
Summary of Cityx: Controllable Procedural Content Generation For Unbounded 3d Cities, by Shougao Zhang et al.
-
Summary of Comoto: Unpaired Cross-modal Lesion Distillation Improves Breast Lesion Detection in Tomosynthesis, by Muhammad Alberb et al.
-
Summary of Examining the Influence Of Political Bias on Large Language Model Performance in Stance Classification, by Lynnette Hui Xian Ng et al.
-
Summary of Enhancing Agent Learning Through World Dynamics Modeling, by Zhiyuan Sun et al.
-
Summary of Cost-effective Instruction Learning For Pathology Vision and Language Analysis, by Kaitao Chen et al.
-
Summary of Infinite Ends From Finite Samples: Open-ended Goal Inference As Top-down Bayesian Filtering Of Bottom-up Proposals, by Tan Zhi-xuan et al.
-
Summary of Networks Of Networks: Complexity Class Principles Applied to Compound Ai Systems Design, by Jared Quincy Davis et al.
-
Summary of Ai-enhanced 7-point Checklist For Melanoma Detection Using Clinical Knowledge Graphs and Data-driven Quantification, by Yuheng Wang et al.
-
Summary of Mllm-compbench: a Comparative Reasoning Benchmark For Multimodal Llms, by Jihyung Kil et al.
-
Summary of Early Screening Of Potential Breakthrough Technologies with Enhanced Interpretability: a Patent-specific Hierarchical Attention Network Model, by Jaewoong Choi et al.
-
Summary of Toward An Integrated Decision Making Framework For Optimized Stroke Diagnosis with Dsa and Treatment Under Uncertainty, by Nur Ahmad Khatim et al.
-
Summary of Case-enhanced Vision Transformer: Improving Explanations Of Image Similarity with a Vit-based Similarity Metric, by Ziwei Zhao et al.
-
Summary of High Efficiency Image Compression For Large Visual-language Models, by Binzhe Li et al.
-
Summary of Enhancing Environmental Monitoring Through Multispectral Imaging: the Wastems Dataset For Semantic Segmentation Of Lakeside Waste, by Qinfeng Zhu et al.
-
Summary of Diffree: Text-guided Shape Free Object Inpainting with Diffusion Model, by Lirui Zhao et al.
-
Summary of A Survey Forest Diagram : Gain a Divergent Insight View on a Specific Research Topic, by Jinghong Li et al.
-
Summary of Pipa++: Towards Unification Of Domain Adaptive Semantic Segmentation Via Self-supervised Learning, by Mu Chen and Zhedong Zheng and Yi Yang
-
Summary of When Text and Images Don’t Mix: Bias-correcting Language-image Similarity Scores For Anomaly Detection, by Adam Goodge et al.
-
Summary of Sdoh-gpt: Using Large Language Models to Extract Social Determinants Of Health (sdoh), by Bernardo Consoli et al.
-
Summary of Xmecap: Meme Caption Generation with Sub-image Adaptability, by Yuyan Chen et al.
-
Summary of Testing Large Language Models on Driving Theory Knowledge and Skills For Connected Autonomous Vehicles, by Zuoyin Tang et al.
-
Summary of Alpi: Auto-labeller with Proxy Injection For 3d Object Detection Using 2d Labels Only, by Saad Lahlali et al.
-
Summary of Lean-github: Compiling Github Lean Repositories For a Versatile Lean Prover, by Zijian Wu et al.
-
Summary of Scisegv2: a Universal Tool For Segmentation Of Intramedullary Lesions in Spinal Cord Injury, by Enamundram Naga Karthik et al.
-
Summary of Improving Icd Coding Using Chapter Based Named Entities and Attentional Models, by Abhijith R. Beeravolu et al.
-
Summary of Catvton: Concatenation Is All You Need For Virtual Try-on with Diffusion Models, by Zheng Chong et al.
-
Summary of Analyzing Polysemy Evolution Using Semantic Cells, by Yukio Ohsawa et al.
-
Summary of Finetuning Generative Large Language Models with Discrimination Instructions For Knowledge Graph Completion, by Yang Liu and Xiaobin Tian and Zequn Sun and Wei Hu
-
Summary of Advancing Brain Imaging Analysis Step-by-step Via Progressive Self-paced Learning, by Yanwu Yang et al.
-
Summary of Fora: Low-rank Adaptation Model Beyond Multimodal Siamese Network, by Weiying Xie and Yusi Zhang and Tianlin Hui and Jiaqing Zhang and Jie Lei and Yunsong Li
-
Summary of Learning Trimodal Relation For Audio-visual Question Answering with Missing Modality, by Kyu Ri Park et al.
-
Summary of Inf-llava: Dual-perspective Perception For High-resolution Multimodal Large Language Model, by Yiwei Ma et al.
-
Summary of Mcts Based Dispatch Of Autonomous Vehicles Under Operational Constraints For Continuous Transportation, by Milan Tomy et al.
-
Summary of Hsvlt: Hierarchical Scale-aware Vision-language Transformer For Multi-label Image Classification, by Shuyi Ouyang et al.
-
Summary of Lawluo: a Multi-agent Collaborative Framework For Multi-round Chinese Legal Consultation, by Jingyun Sun et al.
-
Summary of Primeguard: Safe and Helpful Llms Through Tuning-free Routing, by Blazej Manczak et al.
-
Summary of Soap: Enhancing Spatio-temporal Relation and Motion Information Capturing For Few-shot Action Recognition, by Wenbo Huang et al.
-
Summary of Virtue Ethics For Ethically Tunable Robotic Assistants, by Rajitha Ramanayake et al.
-
Summary of Machine Translation Hallucination Detection For Low and High Resource Languages Using Large Language Models, by Kenza Benkirane et al.
-
Summary of Psychomatics — a Multidisciplinary Framework For Understanding Artificial Minds, by Giuseppe Riva et al.
-
Summary of Is 3d Convolution with 5d Tensors Really Necessary For Video Analysis?, by Habib Hajimolahoseini et al.
-
Summary of Hapfi: History-aware Planning Based on Fused Information, by Sujin Jeon et al.
-
Summary of A Comparative Study on Patient Language Across Therapeutic Domains For Effective Patient Voice Classification in Online Health Discussions, by Giorgos Lysandrou et al.
-
Summary of A Framework For Pupil Tracking with Event Cameras, by Khadija Iddrisu et al.
-
Summary of Yolo-pdd: a Novel Multi-scale Pcb Defect Detection Method Using Deep Representations with Sequential Images, by Bowen Liu et al.
-
Summary of Compensate Quantization Errors+: Quantized Models Are Inquisitive Learners, by Yifei Gao et al.
-
Summary of In-context Learning Improves Compositional Understanding Of Vision-language Models, by Matteo Nulli et al.
-
Summary of Unsupervised Robust Cross-lingual Entity Alignment Via Neighbor Triple Matching with Entity and Relation Texts, by Soojin Yoon et al.
-
Summary of Can Gpt-4 Learn to Analyse Moves in Research Article Abstracts?, by Danni Yu et al.
-
Summary of Norface: Improving Facial Expression Analysis by Identity Normalization, By Hanwei Liu et al.
-
Summary of Psychometric Alignment: Capturing Human Knowledge Distributions Via Language Models, by Joy He-yueya et al.
-
Summary of Problems in Ai, Their Roots in Philosophy, and Implications For Science and Society, by Max Velthoven et al.
-
Summary of Swinsf: Image Reconstruction From Spatial-temporal Spike Streams, by Liangyan Jiang et al.
-
Summary of Predicting the Best Of N Visual Trackers, by Basit Alawode et al.
-
Summary of Flow-guided Motion Prediction with Semantics and Dynamic Occupancy Grid Maps, by Rabbia Asghar et al.
-
Summary of Dstruct2design: Data and Benchmarks For Data Structure Driven Generative Floor Plan Design, by Zhi Hao Luo et al.
-
Summary of Mamba Meets Crack Segmentation, by Zhili He et al.
-
Summary of Gfe-mamba: Mamba-based Ad Multi-modal Progression Assessment Via Generative Feature Extraction From Mci, by Zhaojie Fang et al.
-
Summary of On Shallow Planning Under Partial Observability, by Randy Lefebvre et al.
-
Summary of Taskgen: a Task-based, Memory-infused Agentic Framework Using Strictjson, by John Chong Min Tan et al.
-
Summary of Explaining Decisions in Ml Models: a Parameterized Complexity Analysis, by Sebastian Ordyniak et al.
-
Summary of Towards Latent Masked Image Modeling For Self-supervised Visual Representation Learning, by Yibing Wei et al.
-
Summary of Dmel: Speech Tokenization Made Simple, by He Bai et al.
-
Summary of Leveraging Large Language Models to Geolocate Linguistic Variations in Social Media Posts, by Davide Savarro et al.
-
Summary of Self-training Room Layout Estimation Via Geometry-aware Ray-casting, by Bolivar Solarte et al.
-
Summary of Medsaga: Few-shot Memory Efficient Medical Image Segmentation Using Gradient Low-rank Projection in Sam, by Navyansh Mahla et al.
-
Summary of Multi-agent Causal Discovery Using Large Language Models, by Hao Duong Le et al.
-
Summary of Rethinking Feature Backbone Fine-tuning For Remote Sensing Object Detection, by Yechan Kim and Jonghyun Park and Sooyeon Kim and Moongu Jeon
-
Summary of Dopra: Decoding Over-accumulation Penalization and Re-allocation in Specific Weighting Layer, by Jinfeng Wei et al.
-
Summary of Distilling Vision-language Foundation Models: a Data-free Approach Via Prompt Diversification, by Yunyi Xuan et al.
-
Summary of Reattention: Training-free Infinite Context with Finite Attention Scope, by Xiaoran Liu et al.
-
Summary of Assessing Brittleness Of Image-text Retrieval Benchmarks From Vision-language Models Perspective, by Mariya Hendriksen et al.
-
Summary of The Hitchhiker’s Guide to Human Alignment with *po, by Kian Ahrabian et al.
-
Summary of Explaining Decisions Of Agents in Mixed-motive Games, by Maayan Orner et al.
-
Summary of New Rules For Causal Identification with Background Knowledge, by Tian-zuo Wang et al.
-
Summary of Fmdnn: a Fuzzy-guided Multi-granular Deep Neural Network For Histopathological Image Classification, by Weiping Ding et al.
-
Summary of X-recon: Learning-based Patient-specific High-resolution Ct Reconstruction From Orthogonal X-ray Images, by Yunpeng Wang et al.
-
Summary of Odyssey: Empowering Minecraft Agents with Open-world Skills, by Shunyu Liu et al.
-
Summary of A Multimodal Knowledge-enhanced Whole-slide Pathology Foundation Model, by Yingxue Xu et al.
-
Summary of Walking in Others’ Shoes: How Perspective-taking Guides Large Language Models in Reducing Toxicity and Bias, by Rongwu Xu et al.
-
Summary of Towards Robust Vision Transformer Via Masked Adaptive Ensemble, by Fudong Lin et al.
-
Summary of Allam: Large Language Models For Arabic and English, by M Saiful Bari et al.
-
Summary of Imposter.ai: Adversarial Attacks with Hidden Intentions Towards Aligned Large Language Models, by Xiao Liu et al.
-
Summary of Semantic Diversity-aware Prototype-based Learning For Unbiased Scene Graph Generation, by Jaehyeong Jeon et al.
-
Summary of Towards Automated Functional Equation Proving: a Benchmark Dataset and a Domain-specific In-context Agent, by Mahdi Buali et al.
-
Summary of Thought-like-pro: Enhancing Reasoning Of Large Language Models Through Self-driven Prolog-based Chain-of-thought, by Xiaoyu Tan (1) et al.
-
Summary of Sqlfuse: Enhancing Text-to-sql Performance Through Comprehensive Llm Synergy, by Tingkai Zhang et al.
-
Summary of Escape: Energy-based Selective Adaptive Correction For Out-of-distribution 3d Human Pose Estimation, by Luke Bidulka et al.
-
Summary of Cve-llm : Automatic Vulnerability Evaluation in Medical Device Industry Using Large Language Models, by Rikhiya Ghosh et al.
-
Summary of Human-interpretable Adversarial Prompt Attack on Large Language Models with Situational Context, by Nilanjana Das et al.
-
Summary of A New Lightweight Hybrid Graph Convolutional Neural Network — Cnn Scheme For Scene Classification Using Object Detection Inference, by Ayman Beghdadi et al.
-
Summary of Crowdmac: Masked Crowd Density Completion For Robust Crowd Density Forecasting, by Ryo Fujii et al.