Paper List
We recommend you use the search box as this list is very long.
-
Summary of Feature Fusion For Human Activity Recognition Using Parameter-optimized Multi-stage Graph Convolutional Network and Transformer Models, by Mohammad Belal (1) et al.
-
Summary of Hacking a Surrogate Model Approach to Xai, by Alexander Wilhelm and Katharina A. Zweig
-
Summary of Evaluation Of Language Models in the Medical Context Under Resource-constrained Settings, by Andrea Posada et al.
-
Summary of Vision-language Consistency Guided Multi-modal Prompt Learning For Blind Ai Generated Image Quality Assessment, by Jun Fu et al.
-
Summary of Cave: Controllable Authorship Verification Explanations, by Sahana Ramnath et al.
-
Summary of Causalmmm: Learning Causal Structure For Marketing Mix Modeling, by Chang Gong et al.
-
Summary of Expected Runtime Comparisons Between Breadth-first Search and Constant-depth Restarting Random Walks, by Daniel Platnick et al.
-
Summary of Olympicarena Medal Ranks: Who Is the Most Intelligent Ai So Far?, by Zhen Huang et al.
-
Summary of Blending Llms Into Cascaded Speech Translation: Kit’s Offline Speech Translation System For Iwslt 2024, by Sai Koneru et al.
-
Summary of The Progression Of Transformers From Language to Vision to Mot: a Literature Review on Multi-object Tracking with Transformers, by Abhi Kamboj
-
Summary of Lottery Ticket Adaptation: Mitigating Destructive Interference in Llms, by Ashwinee Panda et al.
-
Summary of Stablenormal: Reducing Diffusion Variance For Stable and Sharp Normal, by Chongjie Ye et al.
-
Summary of Memorizing Documents with Guidance in Large Language Models, by Bumjin Park and Jaesik Choi
-
Summary of Database-augmented Query Representation For Information Retrieval, by Soyeong Jeong et al.
-
Summary of Harvesting Events From Multiple Sources: Towards a Cross-document Event Extraction Paradigm, by Qiang Gao et al.
-
Summary of Zero-shot Cross-lingual Ner Using Phonemic Representations For Low-resource Languages, by Jimin Sohn et al.
-
Summary of Fastmem: Fast Memorization Of Prompt Improves Context Awareness Of Large Language Models, by Junyi Zhu et al.
-
Summary of Eerpd: Leveraging Emotion and Emotion Regulation For Improving Personality Detection, by Zheng Li et al.
-
Summary of Continuous Output Personality Detection Models Via Mixed Strategy Training, by Rong Wang et al.
-
Summary of One Thousand and One Pairs: a “novel” Challenge For Long-context Language Models, by Marzena Karpinska et al.
-
Summary of Multi-scale Temporal Difference Transformer For Video-text Retrieval, by Ni Wang et al.
-
Summary of Combining Supervised Learning and Reinforcement Learning For Multi-label Classification Tasks with Partial Labels, by Zixia Jia et al.
-
Summary of Repairing Catastrophic-neglect in Text-to-image Diffusion Models Via Attention-guided Feature Enhancement, by Zhiyuan Chang et al.
-
Summary of Video-infinity: Distributed Long Video Generation, by Zhenxiong Tan et al.
-
Summary of Compensate Quantization Errors: Make Weights Hierarchical to Compensate Each Other, by Yifei Gao et al.
-
Summary of Langsuite: Planning, Controlling and Interacting with Large Language Models in Embodied Text Environments, by Zixia Jia et al.
-
Summary of Ubiss: a Unified Framework For Bimodal Semantic Summarization Of Videos, by Yuting Mei et al.
-
Summary of Pruning Via Merging: Compressing Llms Via Manifold Alignment Based Layer Merging, by Deyuan Liu et al.
-
Summary of Prompt-consistency Image Generation (pcig): a Unified Framework Integrating Llms, Knowledge Graphs, and Controllable Diffusion Models, by Yichen Sun et al.
-
Summary of Directed Domain Fine-tuning: Tailoring Separate Modalities For Specific Training Tasks, by Daniel Wen and Nafisa Hussain
-
Summary of Exploring Cross-domain Few-shot Classification Via Frequency-aware Prompting, by Tiange Zhang et al.
-
Summary of On the Transformations Across Reward Model, Parameter Update, and In-context Prompt, by Deng Cai and Huayang Li and Tingchen Fu and Siheng Li and Weiwen Xu and Shuaiyi Li and Bowen Cao and Zhisong Zhang and Xinting Huang and Leyang Cui and Yan Wang and Lemao Liu and Taro Watanabe and Shuming Shi
-
Summary of Crisissense-llm: Instruction Fine-tuned Large Language Model For Multi-label Social Media Text Classification in Disaster Informatics, by Kai Yin et al.
-
Summary of Code-switching Red-teaming: Llm Evaluation For Safety and Multilingual Understanding, by Haneul Yoo et al.
-
Summary of Jobfair: a Framework For Benchmarking Gender Hiring Bias in Large Language Models, by Ze Wang et al.
-
Summary of Pku-saferlhf: Towards Multi-level Safety Alignment For Llms with Human Preference, by Jiaming Ji et al.
-
Summary of Torchspatial: a Location Encoding Framework and Benchmark For Spatial Representation Learning, by Nemin Wu et al.
-
Summary of Large Language Models Have Intrinsic Self-correction Ability, by Dancheng Liu et al.
-
Summary of Acoustic Feature Mixup For Balanced Multi-aspect Pronunciation Assessment, by Heejin Do et al.
-
Summary of Ai-driven Approaches For Optimizing Power Consumption: a Comprehensive Survey, by Parag Biswas et al.
-
Summary of Identifying and Solving Conditional Image Leakage in Image-to-video Diffusion Model, by Min Zhao et al.
-
Summary of Fine-grained Background Representation For Weakly Supervised Semantic Segmentation, by Xu Yin et al.
-
Summary of Hcqa @ Ego4d Egoschema Challenge 2024, by Haoyu Zhang et al.
-
Summary of Data Issues in Industrial Ai System: a Meta-review and Research Strategy, by Xuejiao Li et al.
-
Summary of Objectnlq @ Ego4d Episodic Memory Challenge 2024, by Yisen Feng et al.
-
Summary of Uncovering Hidden Intentions: Exploring Prompt Recovery For Deeper Insights Into Generated Texts, by Louis Give et al.
-
Summary of Simsmoe: Solving Representational Collapse Via Similarity Measure, by Giang Do et al.
-
Summary of Sedmamba: Enhancing Selective State Space Modelling with Bottleneck Mechanism and Fine-to-coarse Temporal Fusion For Efficient Error Detection in Robot-assisted Surgery, by Jialang Xu et al.
-
Summary of Beyond the Doors Of Perception: Vision Transformers Represent Relations Between Objects, by Michael A. Lepori et al.
-
Summary of Evaluating the Effectiveness Of the Foundational Models For Q&a Classification in Mental Health Care, by Hassan Alhuzali and Ashwag Alasmari
-
Summary of Enhancing Cross-document Event Coreference Resolution by Discourse Structure and Semantic Information, By Qiang Gao et al.
-
Summary of Uda: a Benchmark Suite For Retrieval Augmented Generation in Real-world Document Analysis, by Yulong Hui et al.
-
Summary of Exploring the Efficacy Of Robotic Assistants with Chatgpt and Claude in Enhancing Adhd Therapy: Innovating Treatment Paradigms, by Santiago Berrezueta-guzman et al.
-
Summary of How Effective Is Gpt-4 Turbo in Generating School-level Questions From Textbooks Based on Bloom’s Revised Taxonomy?, by Subhankar Maity et al.
-
Summary of Deep Uav Path Planning with Assured Connectivity in Dense Urban Setting, by Jiyong Oh et al.
-
Summary of Videoscore: Building Automatic Metrics to Simulate Fine-grained Human Feedback For Video Generation, by Xuan He et al.
-
Summary of Towards Robust Training Datasets For Machine Learning with Ontologies: a Case Study For Emergency Road Vehicle Detection, by Lynn Vonderhaar et al.
-
Summary of Safe Inputs but Unsafe Output: Benchmarking Cross-modality Safety Alignment Of Large Vision-language Model, by Siyin Wang et al.
-
Summary of Longrag: Enhancing Retrieval-augmented Generation with Long-context Llms, by Ziyan Jiang et al.
-
Summary of An End-to-end, Segmentation-free, Arabic Handwritten Recognition Model on Khatt, by Sondos Aabed et al.
-
Summary of Bug in the Code Stack: Can Llms Find Bugs in Large Python Code Stacks, by Hokyung Lee et al.
-
Summary of Enhancing Large Language Model Performance with Gradient-based Parameter Selection, by Haoling Li et al.
-
Summary of Image Conductor: Precision Control For Interactive Video Synthesis, by Yaowei Li et al.
-
Summary of An Exploratory Study on Human-centric Video Anomaly Detection Through Variational Autoencoders and Trajectory Prediction, by Ghazal Alinezhad Noghre et al.
-
Summary of Automatic Parking Planning Control Method Based on Improved A* Algorithm, by Yuxuan Zhao
-
Summary of Automated Parking Planning with Vision-based Bev Approach, by Yuxuan Zhao
-
Summary of Exu: Ai Models For Examining Multilingual Disinformation Narratives and Understanding Their Spread, by Jake Vasilakes et al.
-
Summary of Radex: a Framework For Structured Information Extraction From Radiology Reports Based on Large Language Models, by Daniel Reichenpfader et al.
-
Summary of Mental Disorder Classification Via Temporal Representation Of Text, by Raja Kumar et al.
-
Summary of Intertwining Cp and Nlp: the Generation Of Unreasonably Constrained Sentences, by Alexandre Bonlarron et al.
-
Summary of Wundtgpt: Shaping Large Language Models to Be An Empathetic, Proactive Psychologist, by Chenyu Ren et al.
-
Summary of Automated Architectural Space Layout Planning Using a Physics-inspired Generative Design Framework, by Zhipeng Li et al.
-
Summary of Is a Picture Worth a Thousand Words? Delving Into Spatial Reasoning For Vision Language Models, by Jiayu Wang et al.
-
Summary of From Llms to Mllms: Exploring the Landscape Of Multimodal Jailbreaking, by Siyuan Wang et al.
-
Summary of Ai-based Anomaly Detection For Clinical-grade Histopathological Diagnostics, by Jonas Dippel et al.
-
Summary of Peano-vit: Power-efficient Approximations Of Non-linearities in Vision Transformers, by Mohammad Erfan Sadeghi et al.
-
Summary of Talking the Talk Does Not Entail Walking the Walk: on the Limits Of Large Language Models in Lexical Entailment Recognition, by Candida M. Greco et al.
-
Summary of Giebench: Towards Holistic Evaluation Of Group Identity-based Empathy For Large Language Models, by Leyan Wang et al.
-
Summary of Towards Retrieval Augmented Generation Over Large Video Libraries, by Yannis Tevissen et al.
-
Summary of Autonomous Agents For Collaborative Task Under Information Asymmetry, by Wei Liu et al.
-
Summary of Trustworthy Enhanced Multi-view Multi-modal Alzheimer’s Disease Prediction with Brain-wide Imaging Transcriptomics Data, by Shan Cong et al.
-
Summary of Ceasefire: An Ai-powered System For Combatting Illicit Firearms Trafficking, by Jorgen Cani et al.
-
Summary of Do Large Language Models Exhibit Cognitive Dissonance? Studying the Difference Between Revealed Beliefs and Stated Answers, by Manuel Mondal et al.
-
Summary of Unveiling the Impact Of Multi-modal Interactions on User Engagement: a Comprehensive Evaluation in Ai-driven Conversations, by Lichao Zhang et al.
-
Summary of Routefinder: Towards Foundation Models For Vehicle Routing Problems, by Federico Berto et al.
-
Summary of Fair, Manipulation-robust, and Transparent Sortition, by Carmel Baharav et al.
-
Summary of Giusberto: a Legal Language Model For Personal Data De-identification in Italian Court Of Auditors Decisions, by Giulio Salierno et al.
-
Summary of Knobtree: Intelligent Database Parameter Configuration Via Explainable Reinforcement Learning, by Jiahan Chen et al.
-
Summary of Assessing Good, Bad and Ugly Arguments Generated by Chatgpt: a New Dataset, Its Methodology and Associated Tasks, By Victor Hugo Nascimento Rocha et al.
-
Summary of This Actually Looks Like That: Proto-bagnets For Local and Global Interpretability-by-design, by Kerol Djoumessi et al.
-
Summary of Enhancing Idiomatic Representation in Multiple Languages Via An Adaptive Contrastive Triplet Loss, by Wei He et al.
-
Summary of V-lasik: Consistent Glasses-removal From Videos Using Synthetic Data, by Rotem Shalev-arkushin et al.
-
Summary of Graphreader: Building Graph-based Agent to Enhance Long-context Abilities Of Large Language Models, by Shilong Li et al.
-
Summary of Whiteboard-of-thought: Thinking Step-by-step Across Modalities, by Sachit Menon and Richard Zemel and Carl Vondrick
-
Summary of Sorry-bench: Systematically Evaluating Large Language Model Safety Refusal, by Tinghao Xie et al.
-
Summary of Can Llms Learn by Teaching For Better Reasoning? a Preliminary Study, By Xuefei Ning et al.
-
Summary of Holistic Evaluation For Interleaved Text-and-image Generation, by Minqian Liu et al.
-
Summary of Speech Prefix-tuning with Rnnt Loss For Improving Llm Predictions, by Murali Karthick Baskar et al.
-
Summary of Do Llms Have Distinct and Consistent Personality? Trait: Personality Testset Designed For Llms with Psychometrics, by Seungbeen Lee et al.