Paper List
We recommend you use the search box as this list is very long.
-
Summary of What Machine Learning Tells Us About the Mathematical Structure Of Concepts, by Jun Otsuka
-
Summary of Continual-learning-based Framework For Structural Damage Recognition, by Jiangpeng Shu et al.
-
Summary of Lrp4rag: Detecting Hallucinations in Retrieval-augmented Generation Via Layer-wise Relevance Propagation, by Haichuan Hu et al.
-
Summary of On Centralized Critics in Multi-agent Reinforcement Learning, by Xueguang Lyu et al.
-
Summary of Training-free Activation Sparsity in Large Language Models, by James Liu et al.
-
Summary of Bidirectional Emergent Language in Situated Environments, by Cornelius Wolff et al.
-
Summary of Artificial Intelligence in Landscape Architecture: a Survey, by Yue Xing et al.
-
Summary of Rsteller: Scaling Up Visual Language Modeling in Remote Sensing with Rich Linguistic Semantics From Openly Available Data and Large Language Models, by Junyao Ge et al.
-
Summary of Mrovseg: Breaking the Resolution Curse Of Vision-language Models in Open-vocabulary Image Segmentation, by Yuanbing Zhu et al.
-
Summary of A Global Ai Community Requires Language-diverse Publishing, by Haley Lepp et al.
-
Summary of Optimizing Structured Data Processing Through Robotic Process Automation, by Vivek Bhardwaj et al.
-
Summary of Diffusion Based Semantic Outlier Generation Via Nuisance Awareness For Out-of-distribution Detection, by Suhee Yoon et al.
-
Summary of Brain-inspired Artificial Intelligence: a Comprehensive Review, by Jing Ren and Feng Xia
-
Summary of Project Shadow: Symbolic Higher-order Associative Deductive Reasoning on Wikidata Using Lm Probing, by Hanna Abi Akl
-
Summary of Vhakg: a Multi-modal Knowledge Graph Based on Synchronized Multi-view Videos Of Daily Activities, by Shusaku Egami et al.
-
Summary of Atoxia: Red-teaming Large Language Models with Target Toxic Answers, by Yuhao Du et al.
-
Summary of Enhancing Analogical Reasoning in the Abstraction and Reasoning Corpus Via Model-based Rl, by Jihwan Lee et al.
-
Summary of Neuralood: Improving Out-of-distribution Generalization Performance with Brain-machine Fusion Learning Framework, by Shuangchen Zhao et al.
-
Summary of Multilingual Arbitrage: Optimizing Data Pools to Accelerate Multilingual Progress, by Ayomide Odumakinde et al.
-
Summary of Cvpt: Cross-attention Help Visual Prompt Tuning Adapt Visual Task, by Lingyun Huang et al.
-
Summary of Flexible Categorization Using Formal Concept Analysis and Dempster-shafer Theory, by Marcel Boersma et al.
-
Summary of Sequence-aware Pre-training For Echocardiography Probe Guidance, by Haojun Jiang et al.
-
Summary of Mamba2mil: State Space Duality Based Multiple Instance Learning For Computational Pathology, by Yuqi Zhang et al.
-
Summary of I2ebench: a Comprehensive Benchmark For Instruction-based Image Editing, by Yiwei Ma et al.
-
Summary of Magicman: Generative Novel View Synthesis Of Humans with 3d-aware Diffusion and Iterative Refinement, by Xu He and Xiaoyu Li and Di Kang and Jiangnan Ye and Chaopeng Zhang and Liyang Chen and Xiangjun Gao and Han Zhang and Zhiyong Wu and Haolin Zhuang
-
Summary of Dynamicroutegpt: a Real-time Multi-vehicle Dynamic Navigation Framework Based on Large Language Models, by Ziai Zhou et al.
-
Summary of Beyond Few-shot Object Detection: a Detailed Survey, by Vishal Chudasama et al.
-
Summary of Fact Probability Vector Based Goal Recognition, by Nils Wilken et al.
-
Summary of Text3daug — Prompted Instance Augmentation For Lidar Perception, by Laurenz Reichardt et al.
-
Summary of Claim Verification in the Age Of Large Language Models: a Survey, by Alphaeus Dmonte et al.
-
Summary of Towards Adaptive Human-centric Video Anomaly Detection: a Comprehensive Framework and a New Benchmark, by Armin Danesh Pazho et al.
-
Summary of Probing Causality Manipulation Of Large Language Models, by Chenyang Zhang et al.
-
Summary of Uncovering Knowledge Gaps in Radiology Report Generation Models Through Knowledge Graphs, by Xiaoman Zhang et al.
-
Summary of Medsage: Enhancing Robustness Of Medical Dialogue Summarization to Asr Errors with Llm-generated Synthetic Dialogues, by Kuluhan Binici et al.
-
Summary of Chartom: a Visual Theory-of-mind Benchmark For Multimodal Large Language Models, by Shubham Bharti et al.
-
Summary of Attend-fusion: Efficient Audio-visual Fusion For Video Classification, by Mahrukh Awan et al.
-
Summary of K-sort Arena: Efficient and Reliable Benchmarking For Generative Models Via K-wise Human Preferences, by Zhikai Li et al.
-
Summary of Revisiting Image Captioning Training Paradigm Via Direct Clip-based Optimization, by Nicholas Moratelli et al.
-
Summary of Improving Clinical Note Generation From Complex Doctor-patient Conversation, by Yizhan Li et al.
-
Summary of A Survey Of Camouflaged Object Detection and Beyond, by Fengyang Xiao et al.
-
Summary of Evince: Optimizing Multi-llm Dialogues Using Conditional Statistics and Information Theory, by Edward Y. Chang
-
Summary of Diagen: Diverse Image Augmentation with Generative Models, by Tobias Lingenberg et al.
-
Summary of Effect Of Adaptation Rate and Cost Display in a Human-ai Interaction Game, by Jason T. Isa et al.
-
Summary of Dhp Benchmark: Are Llms Good Nlg Evaluators?, by Yicheng Wang et al.
-
Summary of Count-based Novelty Exploration in Classical Planning, by Giacomo Rosa and Nir Lipovetzky
-
Summary of Gpt-4 Emulates Average-human Emotional Cognition From a Third-person Perspective, by Ala N. Tak and Jonathan Gratch
-
Summary of Doce: Finding the Sweet Spot For Execution-based Code Generation, by Haau-sing Li et al.
-
Summary of Multi-agent Target Assignment and Path Finding For Intelligent Warehouse: a Cooperative Multi-agent Deep Reinforcement Learning Perspective, by Qi Liu et al.
-
Summary of Multimodal Ensemble with Conditional Feature Fusion For Dysgraphia Diagnosis in Children From Handwriting Samples, by Jayakanth Kunhoth et al.
-
Summary of Localization Of Synthetic Manipulations in Western Blot Images, by Anmol Manjunath et al.
-
Summary of Guardians Of the Machine Translation Meta-evaluation: Sentinel Metrics Fall In!, by Stefano Perrella et al.
-
Summary of Pam: a Propagation-based Model For Segmenting Any 3d Objects Across Multi-modal Medical Images, by Zifan Chen et al.
-
Summary of Tangram: Benchmark For Evaluating Geometric Element Recognition in Large Multimodal Models, by Chao Zhang and Jiamin Tang and Jing Xiao
-
Summary of Codegraph: Enhancing Graph Reasoning Of Llms with Code, by Qiaolong Cai et al.
-
Summary of Llms Are Superior Feedback Providers: Bootstrapping Reasoning For Lie Detection with Self-generated Feedback, by Tanushree Banerjee et al.
-
Summary of Geo-llama: Leveraging Llms For Human Mobility Trajectory Generation with Spatiotemporal Constraints, by Siyu Li et al.
-
Summary of Focused Large Language Models Are Stable Many-shot Learners, by Peiwen Yuan et al.
-
Summary of Automatic Medical Report Generation: Methods and Applications, by Li Guo et al.
-
Summary of Lmm-vqa: Advancing Video Quality Assessment with Large Multimodal Models, by Qihang Ge et al.
-
Summary of Pixel-aligned Multi-view Generation with Depth Guided Decoder, by Zhenggang Tang et al.
-
Summary of Video-ccam: Enhancing Video-language Understanding with Causal Cross-attention Masks For Short and Long Videos, by Jiajun Fei et al.
-
Summary of Contrastive Learning Subspace For Text Clustering, by Qian Yong et al.
-
Summary of Swiftbrush V2: Make Your One-step Diffusion Model Better Than Its Teacher, by Trung Dao et al.
-
Summary of Cruxeval-x: a Benchmark For Multilingual Code Reasoning, Understanding and Execution, by Ruiyang Xu et al.
-
Summary of Vfm-det: Towards High-performance Vehicle Detection Via Large Foundation Models, by Wentao Wu et al.
-
Summary of Qd-vmr: Query Debiasing with Contextual Understanding Enhancement For Video Moment Retrieval, by Chenghua Gao et al.
-
Summary of Shapeicp: Iterative Category-level Object Pose and Shape Estimation From Depth, by Yihao Zhang and John J. Leonard
-
Summary of Map-free Visual Relocalization Enhanced by Instance Knowledge and Depth Knowledge, By Mingyu Xiao et al.
-
Summary of Say No to Freeloader: Protecting Intellectual Property Of Your Deep Model, by Lianyu Wang et al.
-
Summary of Instruct-deberta: a Hybrid Approach For Aspect-based Sentiment Analysis on Textual Reviews, by Dineth Jayakody et al.
-
Summary of Domaineval: An Auto-constructed Benchmark For Multi-domain Code Generation, by Qiming Zhu et al.
-
Summary of Temporal Fairness in Decision Making Problems, by Manuel R. Torres et al.
-
Summary of Enhancing Few-shot Transfer Learning with Optimized Multi-task Prompt Tuning Through Modular Prompt Composition, by Ahmad Pouramini et al.
-
Summary of Ensemble Modeling Of Multiple Physical Indicators to Dynamically Phenotype Autism Spectrum Disorder, by Marie Huynh (1) et al.
-
Summary of Sin-nerf2nerf: Editing 3d Scenes with Instructions Through Segmentation and Inpainting, by Jiseung Hong et al.
-
Summary of N-drivermotion: Driver Motion Learning and Prediction Using An Event-based Camera and Directly Trained Spiking Neural Networks on Loihi 2, by Hyo Jong Chung et al.
-
Summary of Optimizing Collaboration Of Llm Based Agents For Finite Element Analysis, by Chuan Tian and Yilei Zhang
-
Summary of Make Every Penny Count: Difficulty-adaptive Self-consistency For Cost-efficient Reasoning, by Xinglin Wang et al.
-
Summary of Anople: Few-shot Anomaly Detection Via Bi-directional Prompt Learning with Only Normal Samples, by Yujin Lee et al.
-
Summary of Balancing Diversity and Risk in Llm Sampling: How to Select Your Method and Parameter For Open-ended Text Generation, by Yuxuan Zhou et al.
-
Summary of Probing the Robustness Of Vision-language Pretrained Models: a Multimodal Adversarial Attack Approach, by Jiwei Guan et al.
-
Summary of Evaluating Alternative Training Interventions Using Personalized Computational Models Of Learning, by Christopher James Maclellan et al.
-
Summary of Real-time Posture Monitoring and Risk Assessment For Manual Lifting Tasks Using Mediapipe and Lstm, by Ereena Bagga and Ang Yang
-
Summary of Backdoorllm: a Comprehensive Benchmark For Backdoor Attacks on Large Language Models, by Yige Li et al.
-
Summary of Preference Consistency Matters: Enhancing Preference Learning in Language Models with Automated Self-curation Of Training Corpora, by Joonho Lee et al.
-
Summary of A Safe Self-evolution Algorithm For Autonomous Driving Based on Data-driven Risk Quantification Model, by Shuo Yang et al.
-
Summary of Dutytte: Deciphering Uncertainty in Origin-destination Travel Time Estimation, by Xiaowei Mao et al.
-
Summary of Examining the Commitments and Difficulties Inherent in Multimodal Foundation Models For Street View Imagery, by Zhenyuan Yang et al.
-
Summary of Staircase Cascaded Fusion Of Lightweight Local Pattern Recognition and Long-range Dependencies For Structural Crack Segmentation, by Hui Liu et al.
-
Summary of Cllmfs: a Contrastive Learning Enhanced Large Language Model Framework For Few-shot Named Entity Recognition, by Yafeng Zhang et al.
-
Summary of Can Ai Assistance Aid in the Grading Of Handwritten Answer Sheets?, by Pritam Sil et al.
-
Summary of Exploring Machine Learning Models For Lung Cancer Level Classification: a Comparative Ml Approach, by Mohsen Asghari Ilani et al.
-
Summary of Deepdiveai: Identifying Ai Related Documents in Large Scale Literature Data, by Zhou Xiaochen et al.
-
Summary of Frequency-aware Feature Fusion For Dense Image Prediction, by Linwei Chen et al.
-
Summary of Spatio-temporal Road Traffic Prediction Using Real-time Regional Knowledge, by Sumin Han et al.
-
Summary of Has Multimodal Learning Delivered Universal Intelligence in Healthcare? a Comprehensive Survey, by Qika Lin et al.
-
Summary of Multiple Areal Feature Aware Transportation Demand Prediction, by Sumin Han et al.
-
Summary of What Do You Want? User-centric Prompt Generation For Text-to-image Synthesis Via Multi-turn Guidance, by Yilun Liu et al.
-
Summary of Trustworthy, Responsible, and Safe Ai: a Comprehensive Architectural Framework For Ai Safety with Challenges and Mitigations, by Chen Chen et al.