Paper List

We recommend you use the search box as this list is very long.

Summary of Image-guided Outdoor Lidar Perception Quality Assessment For Autonomous Driving, by Ce Zhang et al.
Summary of Predicting the Big Five Personality Traits in Chinese Counselling Dialogues Using Large Language Models, by Yang Yan et al.
Summary of Towards Open-set Camera 3d Object Detection, by Zhuolin He et al.
Summary of Combining Supervised Learning and Reinforcement Learning For Multi-label Classification Tasks with Partial Labels, by Zixia Jia et al.
Summary of Compensate Quantization Errors: Make Weights Hierarchical to Compensate Each Other, by Yifei Gao et al.
Summary of Ubiss: a Unified Framework For Bimodal Semantic Summarization Of Videos, by Yuting Mei et al.
Summary of Pruning Via Merging: Compressing Llms Via Manifold Alignment Based Layer Merging, by Deyuan Liu et al.
Summary of Prompt-consistency Image Generation (pcig): a Unified Framework Integrating Llms, Knowledge Graphs, and Controllable Diffusion Models, by Yichen Sun et al.
Summary of Directed Domain Fine-tuning: Tailoring Separate Modalities For Specific Training Tasks, by Daniel Wen and Nafisa Hussain
Summary of Exploring Cross-domain Few-shot Classification Via Frequency-aware Prompting, by Tiange Zhang et al.
Summary of On the Transformations Across Reward Model, Parameter Update, and In-context Prompt, by Deng Cai and Huayang Li and Tingchen Fu and Siheng Li and Weiwen Xu and Shuaiyi Li and Bowen Cao and Zhisong Zhang and Xinting Huang and Leyang Cui and Yan Wang and Lemao Liu and Taro Watanabe and Shuming Shi
Summary of Guardrails For Avoiding Harmful Medical Product Recommendations and Off-label Promotion in Generative Ai Models, by Daniel Lopez-martinez
Summary of Dynamic Pseudo Label Optimization in Point-supervised Nuclei Segmentation, by Ziyue Wang et al.
Summary of D2sp: Dynamic Dual-stage Purification Framework For Dual Noise Mitigation in Vision-based Affective Recognition, by Haoran Wang et al.
Summary of Interclip-mep: Interactive Clip and Memory-enhanced Predictor For Multi-modal Sarcasm Detection, by Junjie Chen et al.
Summary of Towards Comprehensive Preference Data Collection For Reward Modeling, by Yulan Hu et al.
Summary of Otce: Hybrid Ssm and Attention with Cross Domain Mixture Of Experts to Construct Observer-thinker-conceiver-expresser, by Jingze Shi et al.
Summary of Carrot and Stick: Inducing Self-motivation with Positive & Negative Feedback, by Jimin Sohn et al.
Summary of Character-adapter: Prompt-guided Region Control For High-fidelity Character Customization, by Yuhang Ma et al.
Summary of Evaluation Of Language Models in the Medical Context Under Resource-constrained Settings, by Andrea Posada et al.
Summary of Hacking a Surrogate Model Approach to Xai, by Alexander Wilhelm and Katharina A. Zweig
Summary of Vision-language Consistency Guided Multi-modal Prompt Learning For Blind Ai Generated Image Quality Assessment, by Jun Fu et al.
Summary of Feature Fusion For Human Activity Recognition Using Parameter-optimized Multi-stage Graph Convolutional Network and Transformer Models, by Mohammad Belal (1) et al.
Summary of Objectnlq @ Ego4d Episodic Memory Challenge 2024, by Yisen Feng et al.
Summary of Data Issues in Industrial Ai System: a Meta-review and Research Strategy, by Xuejiao Li et al.
Summary of Simsmoe: Solving Representational Collapse Via Similarity Measure, by Giang Do et al.
Summary of Uncovering Hidden Intentions: Exploring Prompt Recovery For Deeper Insights Into Generated Texts, by Louis Give et al.
Summary of Sedmamba: Enhancing Selective State Space Modelling with Bottleneck Mechanism and Fine-to-coarse Temporal Fusion For Efficient Error Detection in Robot-assisted Surgery, by Jialang Xu et al.
Summary of Beyond the Doors Of Perception: Vision Transformers Represent Relations Between Objects, by Michael A. Lepori et al.
Summary of Enhancing Cross-document Event Coreference Resolution by Discourse Structure and Semantic Information, By Qiang Gao et al.
Summary of Evaluating the Effectiveness Of the Foundational Models For Q&a Classification in Mental Health Care, by Hassan Alhuzali and Ashwag Alasmari
Summary of Memorizing Documents with Guidance in Large Language Models, by Bumjin Park and Jaesik Choi
Summary of Database-augmented Query Representation For Information Retrieval, by Soyeong Jeong et al.
Summary of Harvesting Events From Multiple Sources: Towards a Cross-document Event Extraction Paradigm, by Qiang Gao et al.
Summary of Fastmem: Fast Memorization Of Prompt Improves Context Awareness Of Large Language Models, by Junyi Zhu et al.
Summary of Zero-shot Cross-lingual Ner Using Phonemic Representations For Low-resource Languages, by Jimin Sohn et al.
Summary of Eerpd: Leveraging Emotion and Emotion Regulation For Improving Personality Detection, by Zheng Li et al.
Summary of Continuous Output Personality Detection Models Via Mixed Strategy Training, by Rong Wang et al.
Summary of Multi-scale Temporal Difference Transformer For Video-text Retrieval, by Ni Wang et al.
Summary of One Thousand and One Pairs: a “novel” Challenge For Long-context Language Models, by Marzena Karpinska et al.
Summary of Video-infinity: Distributed Long Video Generation, by Zhenxiong Tan et al.
Summary of Repairing Catastrophic-neglect in Text-to-image Diffusion Models Via Attention-guided Feature Enhancement, by Zhiyuan Chang et al.
Summary of Langsuite: Planning, Controlling and Interacting with Large Language Models in Embodied Text Environments, by Zixia Jia et al.
Summary of Enhancing Large Language Model Performance with Gradient-based Parameter Selection, by Haoling Li et al.
Summary of Image Conductor: Precision Control For Interactive Video Synthesis, by Yaowei Li et al.
Summary of An Exploratory Study on Human-centric Video Anomaly Detection Through Variational Autoencoders and Trajectory Prediction, by Ghazal Alinezhad Noghre et al.
Summary of Exu: Ai Models For Examining Multilingual Disinformation Narratives and Understanding Their Spread, by Jake Vasilakes et al.
Summary of Radex: a Framework For Structured Information Extraction From Radiology Reports Based on Large Language Models, by Daniel Reichenpfader et al.
Summary of Mental Disorder Classification Via Temporal Representation Of Text, by Raja Kumar et al.
Summary of Intertwining Cp and Nlp: the Generation Of Unreasonably Constrained Sentences, by Alexandre Bonlarron et al.
Summary of Wundtgpt: Shaping Large Language Models to Be An Empathetic, Proactive Psychologist, by Chenyu Ren et al.
Summary of Code-switching Red-teaming: Llm Evaluation For Safety and Multilingual Understanding, by Haneul Yoo et al.
Summary of Crisissense-llm: Instruction Fine-tuned Large Language Model For Multi-label Social Media Text Classification in Disaster Informatics, by Kai Yin et al.
Summary of Pku-saferlhf: Towards Multi-level Safety Alignment For Llms with Human Preference, by Jiaming Ji et al.
Summary of Jobfair: a Framework For Benchmarking Gender Hiring Bias in Large Language Models, by Ze Wang et al.
Summary of Torchspatial: a Location Encoding Framework and Benchmark For Spatial Representation Learning, by Nemin Wu et al.
Summary of Acoustic Feature Mixup For Balanced Multi-aspect Pronunciation Assessment, by Heejin Do et al.
Summary of Large Language Models Have Intrinsic Self-correction Ability, by Dancheng Liu et al.
Summary of Ai-driven Approaches For Optimizing Power Consumption: a Comprehensive Survey, by Parag Biswas et al.
Summary of Rankadaptor: Hierarchical Rank Allocation For Efficient Fine-tuning Pruned Llms Via Performance Model, by Changhai Zhou et al.
Summary of Fine-grained Background Representation For Weakly Supervised Semantic Segmentation, by Xu Yin et al.
Summary of Identifying and Solving Conditional Image Leakage in Image-to-video Diffusion Model, by Min Zhao et al.
Summary of Hcqa @ Ego4d Egoschema Challenge 2024, by Haoyu Zhang et al.
Summary of Trustworthy Enhanced Multi-view Multi-modal Alzheimer’s Disease Prediction with Brain-wide Imaging Transcriptomics Data, by Shan Cong et al.
Summary of Do Large Language Models Exhibit Cognitive Dissonance? Studying the Difference Between Revealed Beliefs and Stated Answers, by Manuel Mondal et al.
Summary of Unveiling the Impact Of Multi-modal Interactions on User Engagement: a Comprehensive Evaluation in Ai-driven Conversations, by Lichao Zhang et al.
Summary of Routefinder: Towards Foundation Models For Vehicle Routing Problems, by Federico Berto et al.
Summary of Giusberto: a Legal Language Model For Personal Data De-identification in Italian Court Of Auditors Decisions, by Giulio Salierno et al.
Summary of Fair, Manipulation-robust, and Transparent Sortition, by Carmel Baharav et al.
Summary of Assessing Good, Bad and Ugly Arguments Generated by Chatgpt: a New Dataset, Its Methodology and Associated Tasks, By Victor Hugo Nascimento Rocha et al.
Summary of Knobtree: Intelligent Database Parameter Configuration Via Explainable Reinforcement Learning, by Jiahan Chen et al.
Summary of Enhancing Idiomatic Representation in Multiple Languages Via An Adaptive Contrastive Triplet Loss, by Wei He et al.
Summary of This Actually Looks Like That: Proto-bagnets For Local and Global Interpretability-by-design, by Kerol Djoumessi et al.
Summary of Uda: a Benchmark Suite For Retrieval Augmented Generation in Real-world Document Analysis, by Yulong Hui et al.
Summary of Exploring the Efficacy Of Robotic Assistants with Chatgpt and Claude in Enhancing Adhd Therapy: Innovating Treatment Paradigms, by Santiago Berrezueta-guzman et al.
Summary of How Effective Is Gpt-4 Turbo in Generating School-level Questions From Textbooks Based on Bloom’s Revised Taxonomy?, by Subhankar Maity et al.
Summary of Deep Uav Path Planning with Assured Connectivity in Dense Urban Setting, by Jiyong Oh et al.
Summary of Videoscore: Building Automatic Metrics to Simulate Fine-grained Human Feedback For Video Generation, by Xuan He et al.
Summary of Towards Robust Training Datasets For Machine Learning with Ontologies: a Case Study For Emergency Road Vehicle Detection, by Lynn Vonderhaar et al.
Summary of Longrag: Enhancing Retrieval-augmented Generation with Long-context Llms, by Ziyan Jiang et al.
Summary of An End-to-end, Segmentation-free, Arabic Handwritten Recognition Model on Khatt, by Sondos Aabed et al.
Summary of Bug in the Code Stack: Can Llms Find Bugs in Large Python Code Stacks, by Hokyung Lee et al.
Summary of Safe Inputs but Unsafe Output: Benchmarking Cross-modality Safety Alignment Of Large Vision-language Model, by Siyin Wang et al.
Summary of Training Next Generation Ai Users and Developers at Ncsa, by Daniel S. Katz et al.
Summary of Relation Extraction with Fine-tuned Large Language Models in Retrieval Augmented Generation Frameworks, by Sefika Efeoglu and Adrian Paschke
Summary of Scidmt: a Large-scale Corpus For Detecting Scientific Mentions, by Huitong Pan and Qi Zhang and Cornelia Caragea and Eduard Dragut and Longin Jan Latecki
Summary of An Adapter-based Unified Model For Multiple Spoken Language Processing Tasks, by Varsha Suresh et al.
Summary of A Large Language Model Outperforms Other Computational Approaches to the High-throughput Phenotyping Of Physician Notes, by Syed I. Munzir et al.
Summary of Compliance Cards: Automated Eu Ai Act Compliance Analyses Amidst a Complex Ai Supply Chain, by Bill Marino and Yaqub Chaudhary and Yulu Pi and Rui-jie Yew and Preslav Aleksandrov and Carwyn Rahman and William F. Shen and Isaac Robinson and Nicholas D. Lane
Summary of How Critically Can An Ai Think? a Framework For Evaluating the Quality Of Thinking Of Generative Artificial Intelligence, by Luke Zaphir et al.
Summary of A Learn-then-reason Model Towards Generalization in Knowledge Base Question Answering, by Lingxi Zhang and Jing Zhang and Yanling Wang and Cuiping Li and Hong Chen
Summary of Acr: a Benchmark For Automatic Cohort Retrieval, by Dung Ngoc Thai et al.
Summary of Camera-invariant Meta-learning Network For Single-camera-training Person Re-identification, by Jiangbo Pei et al.
Summary of Automated Architectural Space Layout Planning Using a Physics-inspired Generative Design Framework, by Zhipeng Li et al.
Summary of Is a Picture Worth a Thousand Words? Delving Into Spatial Reasoning For Vision Language Models, by Jiayu Wang et al.
Summary of Peano-vit: Power-efficient Approximations Of Non-linearities in Vision Transformers, by Mohammad Erfan Sadeghi et al.
Summary of From Llms to Mllms: Exploring the Landscape Of Multimodal Jailbreaking, by Siyuan Wang et al.
Summary of Ai-based Anomaly Detection For Clinical-grade Histopathological Diagnostics, by Jonas Dippel et al.
Summary of Talking the Talk Does Not Entail Walking the Walk: on the Limits Of Large Language Models in Lexical Entailment Recognition, by Candida M. Greco et al.
Summary of Autonomous Agents For Collaborative Task Under Information Asymmetry, by Wei Liu et al.