Paper List
We recommend you use the search box as this list is very long.
-
Summary of Neuralood: Improving Out-of-distribution Generalization Performance with Brain-machine Fusion Learning Framework, by Shuangchen Zhao et al.
-
Summary of Multilingual Arbitrage: Optimizing Data Pools to Accelerate Multilingual Progress, by Ayomide Odumakinde et al.
-
Summary of Cvpt: Cross-attention Help Visual Prompt Tuning Adapt Visual Task, by Lingyun Huang et al.
-
Summary of Sequence-aware Pre-training For Echocardiography Probe Guidance, by Haojun Jiang et al.
-
Summary of Flexible Categorization Using Formal Concept Analysis and Dempster-shafer Theory, by Marcel Boersma et al.
-
Summary of Mamba2mil: State Space Duality Based Multiple Instance Learning For Computational Pathology, by Yuqi Zhang et al.
-
Summary of Evidence-enhanced Triplet Generation Framework For Hallucination Alleviation in Generative Question Answering, by Haowei Du et al.
-
Summary of Baichuanseed: Sharing the Potential Of Extensive Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline, By Guosheng Dong et al.
-
Summary of Mtmamba++: Enhancing Multi-task Dense Scene Understanding Via Mamba-based Decoders, by Baijiong Lin et al.
-
Summary of Evaluating Stability Of Unreflective Alignment, by James Lucassen et al.
-
Summary of Aligning Xai with Eu Regulations For Smart Biomedical Devices: a Methodology For Compliance Analysis, by Francesco Sovrano et al.
-
Summary of Human-centric Video Anomaly Detection Through Spatio-temporal Pose Tokenization and Transformer, by Ghazal Alinezhad Noghre et al.
-
Summary of A Permuted Autoregressive Approach to Word-level Recognition For Urdu Digital Text, by Ahmed Mustafa et al.
-
Summary of Into the Unknown Unknowns: Engaged Human Learning Through Participation in Language Model Agent Conversations, by Yucheng Jiang et al.
-
Summary of Probing Causality Manipulation Of Large Language Models, by Chenyang Zhang et al.
-
Summary of Uncovering Knowledge Gaps in Radiology Report Generation Models Through Knowledge Graphs, by Xiaoman Zhang et al.
-
Summary of Chartom: a Visual Theory-of-mind Benchmark For Multimodal Large Language Models, by Shubham Bharti et al.
-
Summary of Medsage: Enhancing Robustness Of Medical Dialogue Summarization to Asr Errors with Llm-generated Synthetic Dialogues, by Kuluhan Binici et al.
-
Summary of Attend-fusion: Efficient Audio-visual Fusion For Video Classification, by Mahrukh Awan et al.
-
Summary of K-sort Arena: Efficient and Reliable Benchmarking For Generative Models Via K-wise Human Preferences, by Zhikai Li et al.
-
Summary of Revisiting Image Captioning Training Paradigm Via Direct Clip-based Optimization, by Nicholas Moratelli et al.
-
Summary of A Survey Of Camouflaged Object Detection and Beyond, by Fengyang Xiao et al.
-
Summary of Improving Clinical Note Generation From Complex Doctor-patient Conversation, by Yizhan Li et al.
-
Summary of Diagen: Diverse Image Augmentation with Generative Models, by Tobias Lingenberg et al.
-
Summary of Evince: Optimizing Multi-llm Dialogues Using Conditional Statistics and Information Theory, by Edward Y. Chang
-
Summary of On Centralized Critics in Multi-agent Reinforcement Learning, by Xueguang Lyu et al.
-
Summary of Effect Of Adaptation Rate and Cost Display in a Human-ai Interaction Game, by Jason T. Isa et al.
-
Summary of Bidirectional Emergent Language in Situated Environments, by Cornelius Wolff et al.
-
Summary of Training-free Activation Sparsity in Large Language Models, by James Liu et al.
-
Summary of Artificial Intelligence in Landscape Architecture: a Survey, by Yue Xing et al.
-
Summary of Rsteller: Scaling Up Visual Language Modeling in Remote Sensing with Rich Linguistic Semantics From Openly Available Data and Large Language Models, by Junyao Ge et al.
-
Summary of A Global Ai Community Requires Language-diverse Publishing, by Haley Lepp et al.
-
Summary of Mrovseg: Breaking the Resolution Curse Of Vision-language Models in Open-vocabulary Image Segmentation, by Yuanbing Zhu et al.
-
Summary of Optimizing Structured Data Processing Through Robotic Process Automation, by Vivek Bhardwaj et al.
-
Summary of Pam: a Propagation-based Model For Segmenting Any 3d Objects Across Multi-modal Medical Images, by Zifan Chen et al.
-
Summary of Tangram: Benchmark For Evaluating Geometric Element Recognition in Large Multimodal Models, by Chao Zhang and Jiamin Tang and Jing Xiao
-
Summary of Codegraph: Enhancing Graph Reasoning Of Llms with Code, by Qiaolong Cai et al.
-
Summary of Llms Are Superior Feedback Providers: Bootstrapping Reasoning For Lie Detection with Self-generated Feedback, by Tanushree Banerjee et al.
-
Summary of Focused Large Language Models Are Stable Many-shot Learners, by Peiwen Yuan et al.
-
Summary of Geo-llama: Leveraging Llms For Human Mobility Trajectory Generation with Spatiotemporal Constraints, by Siyu Li et al.
-
Summary of Automatic Medical Report Generation: Methods and Applications, by Li Guo et al.
-
Summary of Lmm-vqa: Advancing Video Quality Assessment with Large Multimodal Models, by Qihang Ge et al.
-
Summary of Video-ccam: Enhancing Video-language Understanding with Causal Cross-attention Masks For Short and Long Videos, by Jiajun Fei et al.
-
Summary of Pixel-aligned Multi-view Generation with Depth Guided Decoder, by Zhenggang Tang et al.
-
Summary of Swiftbrush V2: Make Your One-step Diffusion Model Better Than Its Teacher, by Trung Dao et al.
-
Summary of Contrastive Learning Subspace For Text Clustering, by Qian Yong et al.
-
Summary of Dynamicroutegpt: a Real-time Multi-vehicle Dynamic Navigation Framework Based on Large Language Models, by Ziai Zhou et al.
-
Summary of I2ebench: a Comprehensive Benchmark For Instruction-based Image Editing, by Yiwei Ma et al.
-
Summary of Magicman: Generative Novel View Synthesis Of Humans with 3d-aware Diffusion and Iterative Refinement, by Xu He and Xiaoyu Li and Di Kang and Jiangnan Ye and Chaopeng Zhang and Liyang Chen and Xiangjun Gao and Han Zhang and Zhiyong Wu and Haolin Zhuang
-
Summary of Fact Probability Vector Based Goal Recognition, by Nils Wilken et al.
-
Summary of Beyond Few-shot Object Detection: a Detailed Survey, by Vishal Chudasama et al.
-
Summary of Text3daug — Prompted Instance Augmentation For Lidar Perception, by Laurenz Reichardt et al.
-
Summary of Claim Verification in the Age Of Large Language Models: a Survey, by Alphaeus Dmonte et al.
-
Summary of Towards Adaptive Human-centric Video Anomaly Detection: a Comprehensive Framework and a New Benchmark, by Armin Danesh Pazho et al.
-
Summary of Temporal Fairness in Decision Making Problems, by Manuel R. Torres et al.
-
Summary of Ensemble Modeling Of Multiple Physical Indicators to Dynamically Phenotype Autism Spectrum Disorder, by Marie Huynh (1) et al.
-
Summary of Enhancing Few-shot Transfer Learning with Optimized Multi-task Prompt Tuning Through Modular Prompt Composition, by Ahmad Pouramini et al.
-
Summary of Sin-nerf2nerf: Editing 3d Scenes with Instructions Through Segmentation and Inpainting, by Jiseung Hong et al.
-
Summary of N-drivermotion: Driver Motion Learning and Prediction Using An Event-based Camera and Directly Trained Spiking Neural Networks on Loihi 2, by Hyo Jong Chung et al.
-
Summary of Optimizing Collaboration Of Llm Based Agents For Finite Element Analysis, by Chuan Tian and Yilei Zhang
-
Summary of Probing the Robustness Of Vision-language Pretrained Models: a Multimodal Adversarial Attack Approach, by Jiwei Guan et al.
-
Summary of Make Every Penny Count: Difficulty-adaptive Self-consistency For Cost-efficient Reasoning, by Xinglin Wang et al.
-
Summary of Balancing Diversity and Risk in Llm Sampling: How to Select Your Method and Parameter For Open-ended Text Generation, by Yuxuan Zhou et al.
-
Summary of Anople: Few-shot Anomaly Detection Via Bi-directional Prompt Learning with Only Normal Samples, by Yujin Lee et al.
-
Summary of Evaluating Alternative Training Interventions Using Personalized Computational Models Of Learning, by Christopher James Maclellan et al.
-
Summary of Dhp Benchmark: Are Llms Good Nlg Evaluators?, by Yicheng Wang et al.
-
Summary of Gpt-4 Emulates Average-human Emotional Cognition From a Third-person Perspective, by Ala N. Tak and Jonathan Gratch
-
Summary of Count-based Novelty Exploration in Classical Planning, by Giacomo Rosa and Nir Lipovetzky
-
Summary of Doce: Finding the Sweet Spot For Execution-based Code Generation, by Haau-sing Li et al.
-
Summary of Multi-agent Target Assignment and Path Finding For Intelligent Warehouse: a Cooperative Multi-agent Deep Reinforcement Learning Perspective, by Qi Liu et al.
-
Summary of Multimodal Ensemble with Conditional Feature Fusion For Dysgraphia Diagnosis in Children From Handwriting Samples, by Jayakanth Kunhoth et al.
-
Summary of Guardians Of the Machine Translation Meta-evaluation: Sentinel Metrics Fall In!, by Stefano Perrella et al.
-
Summary of Localization Of Synthetic Manipulations in Western Blot Images, by Anmol Manjunath et al.
-
Summary of Cllmfs: a Contrastive Learning Enhanced Large Language Model Framework For Few-shot Named Entity Recognition, by Yafeng Zhang et al.
-
Summary of Can Ai Assistance Aid in the Grading Of Handwritten Answer Sheets?, by Pritam Sil et al.
-
Summary of Deepdiveai: Identifying Ai Related Documents in Large Scale Literature Data, by Zhou Xiaochen et al.
-
Summary of Frequency-aware Feature Fusion For Dense Image Prediction, by Linwei Chen et al.
-
Summary of Has Multimodal Learning Delivered Universal Intelligence in Healthcare? a Comprehensive Survey, by Qika Lin et al.
-
Summary of Spatio-temporal Road Traffic Prediction Using Real-time Regional Knowledge, by Sumin Han et al.
-
Summary of Multiple Areal Feature Aware Transportation Demand Prediction, by Sumin Han et al.
-
Summary of What Do You Want? User-centric Prompt Generation For Text-to-image Synthesis Via Multi-turn Guidance, by Yilun Liu et al.
-
Summary of Trustworthy, Responsible, and Safe Ai: a Comprehensive Architectural Framework For Ai Safety with Challenges and Mitigations, by Chen Chen et al.
-
Summary of Isee: Advancing Multi-shot Explainable Ai Using Case-based Recommendations, by Anjana Wijekoon et al.
-
Summary of Causal-guided Active Learning For Debiasing Large Language Models, by Li Du et al.
-
Summary of Multimodal Contrastive In-context Learning, by Yosuke Miyanishi et al.
-
Summary of Qd-vmr: Query Debiasing with Contextual Understanding Enhancement For Video Moment Retrieval, by Chenghua Gao et al.
-
Summary of Cruxeval-x: a Benchmark For Multilingual Code Reasoning, Understanding and Execution, by Ruiyang Xu et al.
-
Summary of Vfm-det: Towards High-performance Vehicle Detection Via Large Foundation Models, by Wentao Wu et al.
-
Summary of Map-free Visual Relocalization Enhanced by Instance Knowledge and Depth Knowledge, By Mingyu Xiao et al.
-
Summary of Shapeicp: Iterative Category-level Object Pose and Shape Estimation From Depth, by Yihao Zhang and John J. Leonard
-
Summary of Say No to Freeloader: Protecting Intellectual Property Of Your Deep Model, by Lianyu Wang et al.
-
Summary of Instruct-deberta: a Hybrid Approach For Aspect-based Sentiment Analysis on Textual Reviews, by Dineth Jayakody et al.
-
Summary of Domaineval: An Auto-constructed Benchmark For Multi-domain Code Generation, by Qiming Zhu et al.
-
Summary of Continual Gesture Learning Without Data Via Synthetic Feature Sampling, by Zhenyu Lu et al.
-
Summary of Enhancing Transferability Of Adversarial Attacks with Ge-advgan+: a Comprehensive Framework For Gradient Editing, by Zhibo Jin et al.
-
Summary of Multilevel Interpretability Of Artificial Neural Networks: Leveraging Framework and Methods From Neuroscience, by Zhonghao He et al.
-
Summary of Can Llms Understand Social Norms in Autonomous Driving Games?, by Boxuan Wang et al.
-
Summary of Learning Valid Dual Bounds in Constraint Programming: Boosted Lagrangian Decomposition with Self-supervised Learning, by Swann Bessa et al.
-
Summary of Unlocking Intrinsic Fairness in Stable Diffusion, by Eunji Kim et al.