Paper List
We recommend you use the search box as this list is very long.
-
Summary of Commonit: Commonality-aware Instruction Tuning For Large Language Models Via Data Partitions, by Jun Rao et al.
-
Summary of Clipdrag: Combining Text-based and Drag-based Instructions For Image Editing, by Ziqi Jiang et al.
-
Summary of Scaling Parameter-constrained Language Models with Quality Data, by Ernie Chang et al.
-
Summary of Adaptive Masking Enhances Visual Grounding, by Sen Jia et al.
-
Summary of Mbds: a Multi-body Dynamics Simulation Dataset For Graph Networks Simulators, by Sheng Yang and Fengge Wu and Junsuo Zhao
-
Summary of Investigating and Mitigating Object Hallucinations in Pretrained Vision-language (clip) Models, by Yufang Liu et al.
-
Summary of Nlip_lab-iith Low-resource Mt System For Wmt24 Indic Mt Shared Task, by Pramit Sahoo et al.
-
Summary of Generating Bilingual Example Sentences with Large Language Models As Lexicography Assistants, by Raphael Merx et al.
-
Summary of Enriching Ontologies with Disjointness Axioms Using Large Language Models, by Elias Crum et al.
-
Summary of Towards a Benchmark For Large Language Models For Business Process Management Tasks, by Kiran Busch and Henrik Leopold
-
Summary of Grounded-videollm: Sharpening Fine-grained Temporal Grounding in Video Large Language Models, by Haibo Wang et al.
-
Summary of Comparative Analysis and Ensemble Enhancement Of Leading Cnn Architectures For Breast Cancer Classification, by Gary Murphy et al.
-
Summary of Comparing Zero-shot Self-explanations with Human Rationales in Text Classification, by Stephanie Brandl and Oliver Eberle
-
Summary of An X-ray Is Worth 15 Features: Sparse Autoencoders For Interpretable Radiology Report Generation, by Ahmed Abdulaal et al.
-
Summary of Llms Know More Than They Show: on the Intrinsic Representation Of Llm Hallucinations, by Hadas Orgad et al.
-
Summary of Curvature Diversity-driven Deformation and Domain Alignment For Point Cloud, by Mengxi Wu et al.
-
Summary of Unified Multimodal Interleaved Document Representation For Retrieval, by Jaewoo Lee and Joonho Ko and Jinheon Baek and Soyeong Jeong and Sung Ju Hwang
-
Summary of Domain-specific Retrieval-augmented Generation Using Vector Stores, Knowledge Graphs, and Tensor Factorization, by Ryan C. Barron et al.
-
Summary of Avg-llava: a Large Multimodal Model with Adaptive Visual Granularity, by Zhibin Lan et al.
-
Summary of Justice or Prejudice? Quantifying Biases in Llm-as-a-judge, by Jiayi Ye et al.
-
Summary of Fakeshield: Explainable Image Forgery Detection and Localization Via Multi-modal Large Language Models, by Zhipei Xu et al.
-
Summary of Bovila: Bootstrapping Video-language Alignment Via Llm-based Self-questioning and Answering, by Jin Chen et al.
-
Summary of Complex-valued Convolutional Neural Network Classification Of Hand Gesture From Radar Images, by Shokooh Khandan
-
Summary of Mind the Uncertainty in Human Disagreement: Evaluating Discrepancies Between Model Predictions and Human Responses in Vqa, by Jian Lan et al.
-
Summary of Robust Symmetry Detection Via Riemannian Langevin Dynamics, by Jihyeon Je et al.
-
Summary of Navigation with Vlm Framework: Go to Any Language, by Zecheng Yin and Chonghao Cheng and Lizhen
-
Summary of Logic-free Building Automation: Learning the Control Of Room Facilities with Wall Switches and Ceiling Camera, by Hideya Ochiai et al.
-
Summary of Estimating Body Volume and Height Using 3d Data, by Vivek Ganesh Sonar et al.
-
Summary of Bipolar Fuzzy Relation Equations Systems Based on the Product T-norm, by M. Eugenia Cornejo et al.
-
Summary of Leveraging Retrieval Augment Approach For Multimodal Emotion Recognition Under Missing Modalities, by Qi Fan et al.
-
Summary of Better Instruction-following Through Minimum Bayes Risk, by Ian Wu et al.
-
Summary of Llama-berry: Pairwise Optimization For O1-like Olympiad-level Mathematical Reasoning, by Di Zhang et al.
-
Summary of Learning the Latent Rules Of a Game From Data: a Chess Story, by Ben Fauber
-
Summary of Iot-llm: Enhancing Real-world Iot Task Reasoning with Large Language Models, by Tuo An et al.
-
Summary of Strong Preferences Affect the Robustness Of Preference Models and Value Alignment, by Ziwei Xu et al.
-
Summary of Collective Critics For Creative Story Generation, by Minwook Bae et al.
-
Summary of Recurrent Few-shot Model For Document Verification, by Maxime Talarmain et al.
-
Summary of Revealing the Inherent Instructability Of Pre-trained Language Models, by Seokhyun An et al.
-
Summary of Mixed-session Conversation with Egocentric Memory, by Jihyoung Jang et al.
-
Summary of Dog-iqa: Standard-guided Zero-shot Mllm For Mix-grained Image Quality Assessment, by Kai Liu et al.
-
Summary of Can Large Language Models Grasp Legal Theories? Enhance Legal Reasoning with Insights From Multi-agent Collaboration, by Weikang Yuan et al.
-
Summary of Choices Are More Important Than Efforts: Llm Enables Efficient Multi-agent Exploration, by Yun Qu et al.
-
Summary of Contextual Document Embeddings, by John X. Morris et al.
-
Summary of A Schema-aware Logic Reformulation For Graph Reachability, by Davide Di Pierro and Stefano Ferilli
-
Summary of Intelligence at the Edge Of Chaos, by Shiyang Zhang et al.
-
Summary of Nl-eye: Abductive Nli For Images, by Mor Ventura et al.
-
Summary of Plots Unlock Time-series Understanding in Multimodal Models, by Mayank Daswani et al.
-
Summary of Grounded Answers For Multi-agent Decision-making Problem Through Generative World Model, by Zeyang Liu et al.
-
Summary of Undesirable Memorization in Large Language Models: a Survey, by Ali Satvaty et al.
-
Summary of Unsupervised Point Cloud Completion Through Unbalanced Optimal Transport, by Taekyung Lee et al.
-
Summary of Distilling An End-to-end Voice Assistant Without Instruction Training Data, by William Held et al.
-
Summary of Helmet: How to Evaluate Long-context Language Models Effectively and Thoroughly, by Howard Yen et al.
-
Summary of Lost-in-distance: Impact Of Contextual Proximity on Llm Performance in Graph Tasks, by Hamed Firooz et al.
-
Summary of Ulcergpt: a Multimodal Approach Leveraging Large Language and Vision Models For Diabetic Foot Ulcer Image Transcription, by Reza Basiri et al.
-
Summary of Zodiac: a Cardiologist-level Llm Framework For Multi-agent Diagnostics, by Yuan Zhou et al.
-
Summary of Quantifying the Gaps Between Translation and Native Perception in Training For Multimodal, Multilingual Retrieval, by Kyle Buettner et al.
-
Summary of Rlef: Grounding Code Llms in Execution Feedback with Reinforcement Learning, by Jonas Gehring et al.
-
Summary of Tracking Objects That Change in Appearance with Phase Synchrony, by Sabine Muzellec et al.
-
Summary of Planning in Strawberry Fields: Evaluating and Improving the Planning and Scheduling Capabilities Of Lrm O1, by Karthik Valmeekam et al.
-
Summary of A Llm-powered Automatic Grading Framework with Human-level Guidelines Optimization, by Yucheng Chu et al.
-
Summary of From Pixels to Tokens: Byte-pair Encoding on Quantized Visual Modalities, by Wanpeng Zhang et al.
-
Summary of Graphic: a Graph-based In-context Example Retrieval Model For Multi-step Reasoning, by Jiale Fu et al.
-
Summary of Can Language Models Take a Hint? Prompting For Controllable Contextualized Commonsense Inference, by Pedro Colon-hernandez et al.
-
Summary of Aligning with Logic: Measuring, Evaluating and Improving Logical Preference Consistency in Large Language Models, by Yinhong Liu et al.
-
Summary of Codepmp: Scalable Preference Model Pretraining For Large Language Model Reasoning, by Huimu Yu et al.
-
Summary of Sca: Highly Efficient Semantic-consistent Unrestricted Adversarial Attack, by Zihao Pan et al.
-
Summary of How Much Can Rag Help the Reasoning Of Llm?, by Jingyu Liu et al.
-
Summary of Alphaedit: Null-space Constrained Knowledge Editing For Language Models, by Junfeng Fang et al.
-
Summary of A Comprehensive Survey Of Mamba Architectures For Medical Image Analysis: Classification, Segmentation, Restoration and Beyond, by Shubhi Bansal et al.
-
Summary of From Concrete to Abstract: a Multimodal Generative Approach to Abstract Concept Learning, by Haodong Xie et al.
-
Summary of Towards Comprehensive Detection Of Chinese Harmful Memes, by Junyu Lu et al.
-
Summary of Synco: Synthetic Hard Negatives For Contrastive Visual Representation Learning, by Nikolaos Giakoumoglou et al.
-
Summary of Bridging Context Gaps: Leveraging Coreference Resolution For Long Contextual Understanding, by Yanming Liu et al.
-
Summary of Trying to Be Human: Linguistic Traces Of Stochastic Empathy in Language Models, by Bennett Kleinberg et al.
-
Summary of Mind Scramble: Unveiling Large Language Model Psychology Via Typoglycemia, by Miao Yu et al.
-
Summary of Why Context Matters in Vqa and Reasoning: Semantic Interventions For Vlm Input Modalities, by Kenza Amara et al.
-
Summary of Factalign: Long-form Factuality Alignment Of Large Language Models, by Chao-wei Huang and Yun-nung Chen
-
Summary of U-shaped and Inverted-u Scaling Behind Emergent Abilities Of Large Language Models, by Tung-yu Wu and Pei-yu Lo
-
Summary of Credes: Causal Reasoning Enhancement and Dual-end Searching For Solving Long-range Reasoning Problems Using Llms, by Kangsheng Wang et al.
-
Summary of Auto-demo Prompting: Leveraging Generated Outputs As Demonstrations For Enhanced Batch Prompting, by Longyu Feng et al.
-
Summary of Interpretable Contrastive Monte Carlo Tree Search Reasoning, by Zitian Gao et al.
-
Summary of Define: Enhancing Llm Decision-making with Factor Profiles and Analogical Reasoning, by Yebowen Hu et al.
-
Summary of Vitaglyph: Vitalizing Artistic Typography with Flexible Dual-branch Diffusion Models, by Kailai Feng et al.
-
Summary of When a Language Model Is Optimized For Reasoning, Does It Still Show Embers Of Autoregression? An Analysis Of Openai O1, by R. Thomas Mccoy et al.
-
Summary of Fabricdiffusion: High-fidelity Texture Transfer For 3d Garments Generation From In-the-wild Clothing Images, by Cheng Zhang and Yuanhao Wang and Francisco Vicente Carrasco and Chenglei Wu and Jinlong Yang and Thabo Beeler and Fernando De La Torre
-
Summary of Samba: Synchronized Set-of-sequences Modeling For Multiple Object Tracking, by Mattia Segu et al.
-
Summary of Privacy-preserving Sam Quantization For Efficient Edge Intelligence in Healthcare, by Zhikai Li et al.
-
Summary of Automatic Scene Generation: State-of-the-art Techniques, Models, Datasets, Challenges, and Future Prospects, by Awal Ahmed Fime et al.
-
Summary of From Experts to the Public: Governing Multimodal Language Models in Politically Sensitive Video Analysis, by Tanusree Sharma et al.
-
Summary of A Spark Of Vision-language Intelligence: 2-dimensional Autoregressive Transformer For Efficient Finegrained Image Generation, by Liang Chen et al.
-
Summary of Enhancing Screen Time Identification in Children with a Multi-view Vision Language Model and Screen Time Tracker, by Xinlong Hou et al.
-
Summary of From Code to Correctness: Closing the Last Mile Of Code Generation with Hierarchical Debugging, by Yuling Shi et al.
-
Summary of Ahp-powered Llm Reasoning For Multi-criteria Evaluation Of Open-ended Responses, by Xiaotian Lu et al.
-
Summary of Fancric : Multi-agentic Framework For Crafting Fantasy 11 Cricket Teams, by Mohit Bhatnagar
-
Summary of Finetuning Pre-trained Model with Limited Data For Lidar-based 3d Object Detection by Bridging Domain Gaps, By Jiyun Jang et al.
-
Summary of Unveiling Language Skills Via Path-level Circuit Discovery, by Hang Chen and Jiaying Zhu and Xinyu Yang and Wenya Wang