Paper List
We recommend you use the search box as this list is very long.
-
Summary of Gradient-weighted Feature Back-projection: a Fast Alternative to Feature Distillation in 3d Gaussian Splatting, by Joji Joseph et al.
-
Summary of Beyond Visual Understanding: Introducing Parrot-360v For Vision Language Model Benchmarking, by Harsha Vardhan Khurdula et al.
-
Summary of Adversarial Prompt Distillation For Vision-language Models, by Lin Luo et al.
-
Summary of Unveiling User Preferences: a Knowledge Graph and Llm-driven Approach For Conversational Recommendation, by Zhangchi Qiu et al.
-
Summary of Towards Next-generation Medical Agent: How O1 Is Reshaping Decision-making in Medical Scenarios, by Shaochen Xu et al.
-
Summary of Leveraging Ai and Nlp For Bank Marketing: a Systematic Review and Gap Analysis, by Christopher Gerling et al.
-
Summary of Learning to Ask: Conversational Product Search Via Representation Learning, by Jie Zou et al.
-
Summary of Popular Llms Amplify Race and Gender Disparities in Human Mobility, by Xinhua Wu and Qi R. Wang
-
Summary of Ranking Unraveled: Recipes For Llm Rankings in Head-to-head Ai Combat, by Roland Daynauth et al.
-
Summary of Streetviewllm: Extracting Geographic Information Using a Chain-of-thought Multimodal Large Language Model, by Zongrong Li et al.
-
Summary of The Impossible Test: a 2024 Unsolvable Dataset and a Chance For An Agi Quiz, by David Noever et al.
-
Summary of Mediating Modes Of Thought: Llm’s For Design Scripting, by Moritz Rietschel et al.
-
Summary of Ensuring Safety and Trust: Analyzing the Risks Of Large Language Models in Medicine, by Yifan Yang et al.
-
Summary of Robust Planning with Compound Llm Architectures: An Llm-modulo Approach, by Atharva Gundawar et al.
-
Summary of Ghostrnn: Reducing State Redundancy in Rnn with Cheap Operations, by Hang Zhou et al.
-
Summary of A Survey on Human-centric Llms, by Jing Yi Wang et al.
-
Summary of Star-agents: Automatic Data Optimization with Llm Agents For Instruction Tuning, by Hang Zhou et al.
-
Summary of Srsa: a Cost-efficient Strategy-router Search Agent For Real-world Human-machine Interactions, by Yaqi Wang et al.
-
Summary of Comparative Analysis Of Pooling Mechanisms in Llms: a Sentiment Analysis Perspective, by Jinming Xing et al.
-
Summary of Improving Mathematical Reasoning Capabilities Of Small Language Models Via Feedback-driven Distillation, by Xunyu Zhu et al.
-
Summary of Multiverse Of Greatness: Generating Story Branches with Llms, by Pittawat Taveekitworachai et al.
-
Summary of Universal and Context-independent Triggers For Precise Control Of Llm Outputs, by Jiashuo Liang et al.
-
Summary of Safety Without Semantic Disruptions: Editing-free Safe Image Generation Via Context-preserving Dual Latent Reconstruction, by Jordan Vice et al.
-
Summary of Mirror Target Yolo: An Improved Yolov8 Method with Indirect Vision For Heritage Buildings Fire Detection, by Jian Liang and Junsheng Cheng
-
Summary of Logic Augmented Generation, by Aldo Gangemi and Andrea Giovanni Nuzzolese
-
Summary of Llm-based Multi-agent Systems: Techniques and Business Perspectives, by Yingxuan Yang et al.
-
Summary of Uterine Ultrasound Image Captioning Using Deep Learning Techniques, by Abdennour Boulesnane et al.
-
Summary of Forecasting Future International Events: a Reliable Dataset For Text-based Event Modeling, by Daehoon Gwak et al.
-
Summary of Functionchat-bench: Comprehensive Evaluation Of Language Models’ Generative Capabilities in Korean Tool-use Dialogs, by Shinbok Lee et al.
-
Summary of Multi Lora Meets Vision: Merging Multiple Adapters to Create a Multi Task Model, by Ege Kesim et al.
-
Summary of Fopru: Focal Pruning For Efficient Large Vision-language Models, by Lei Jiang et al.
-
Summary of Towards Context-rich Automated Biodiversity Assessments: Deriving Ai-powered Insights From Camera Trap Data, by Paul Fergus et al.
-
Summary of Is This Generated Person Existed in Real-world? Fine-grained Detecting and Calibrating Abnormal Human-body, by Zeqing Wang et al.
-
Summary of Physics-informed Llm-agent For Automated Modulation Design in Power Electronics Systems, by Junhua Liu and Fanfan Lin and Xinze Li and Kwan Hui Lim and Shuai Zhao
-
Summary of Intent-aware Dialogue Generation and Multi-task Contrastive Learning For Multi-turn Intent Classification, by Junhua Liu and Yong Keat Tan and Bin Fu and Kwan Hui Lim
-
Summary of Knowledge Graphs, Large Language Models, and Hallucinations: An Nlp Perspective, by Ernests Lavrinovics et al.
-
Summary of Unifiedcrawl: Aggregated Common Crawl For Affordable Adaptation Of Llms on Low-resource Languages, by Bethel Melesse Tessema (1) et al.
-
Summary of Can Artificial Intelligence Generate Quality Research Topics Reflecting Patient Concerns?, by Jiyeong Kim et al.
-
Summary of Revisiting the Integration Of Convolution and Attention For Vision Backbone, by Lei Zhu et al.
-
Summary of Rv4chatbot: Are Chatbots Allowed to Dream Of Electric Sheep?, by Andrea Gatti (university Of Genoa) et al.
-
Summary of Resolving Multiple-dynamic Model Uncertainty in Hypothesis-driven Belief-mdps, by Ofer Dagan et al.
-
Summary of Videoautoarena: An Automated Arena For Evaluating Large Multimodal Models in Video Analysis Through User Simulation, by Ziyang Luo et al.
-
Summary of Fact-level Confidence Calibration and Self-correction, by Yige Yuan et al.
-
Summary of Unification Of Balti and Trans-border Sister Dialects in the Essence Of Llms and Ai Technology, by Muhammad Sharif et al.
-
Summary of A Resource Efficient Fusion Network For Object Detection in Bird’s-eye View Using Camera and Raw Radar Data, by Kavin Chandrasekaran et al.
-
Summary of Limba: An Open-source Framework For the Preservation and Valorization Of Low-resource Languages Using Generative Models, by Salvatore Mario Carta et al.
-
Summary of Patentedits: Framing Patent Novelty As Textual Entailment, by Ryan Lee et al.
-
Summary of Advancing Complex Medical Communication in Arabic with Sporo Arasum: Surpassing Existing Large Language Models, by Chanseo Lee et al.
-
Summary of Balrog: Benchmarking Agentic Llm and Vlm Reasoning on Games, by Davide Paglieri et al.
-
Summary of Entropy Bootstrapping For Weakly Supervised Nuclei Detection, by James Willoughby et al.
-
Summary of Amsnet-kg: a Netlist Dataset For Llm-based Ams Circuit Auto-design Using Knowledge Graph Rag, by Yichen Shi et al.
-
Summary of Integrated Water Resource Management in the Segura Hydrographic Basin: An Artificial Intelligence Approach, by Urtzi Otamendi et al.
-
Summary of Addrllm: Address Rewriting Via Large Language Model on Nationwide Logistics Data, by Qinchen Yang et al.
-
Summary of Unveiling Redundancy in Diffusion Transformers (dits): a Systematic Study, by Xibo Sun et al.
-
Summary of Improved Gui Grounding Via Iterative Narrowing, by Anthony Nguyen
-
Summary of Enhancing Bidirectional Sign Language Communication: Integrating Yolov8 and Nlp For Real-time Gesture Recognition & Translation, by Hasnat Jamil Bhuiyan et al.
-
Summary of No Free Delivery Service: Epistemic Limits Of Passive Data Collection in Complex Social Systems, by Maximilian Nickel
-
Summary of Benchmarking Gpt-4 Against Human Translators: a Comprehensive Evaluation Across Languages, Domains, and Expertise Levels, by Jianhao Yan et al.
-
Summary of Piors: Personalized Intelligent Outpatient Reception Based on Large Language Model with Multi-agents Medical Scenario Simulation, by Zhijie Bao et al.
-
Summary of Separable Mixture Of Low-rank Adaptation For Continual Visual Instruction Tuning, by Ziqi Wang et al.
-
Summary of Xagents: a Framework For Interpretable Rule-based Multi-agents Cooperation, by Hailong Yang et al.
-
Summary of An Exploration Of the Effect Of Quantisation on Energy Consumption and Inference Time Of Starcoder2, by Pepijn De Reus et al.
-
Summary of A Novel Approach to Eliminating Hallucinations in Large Language Model-assisted Causal Discovery, by Grace Sng et al.
-
Summary of Sefd: Semantic-enhanced Framework For Detecting Llm-generated Text, by Weiqing He et al.
-
Summary of Playing Language Game with Llms Leads to Jailbreaking, by Yu Peng et al.
-
Summary of Suicide Risk Assessment on Social Media with Semi-supervised Learning, by Max Lovitt et al.
-
Summary of Visual Cue Enhancement and Dual Low-rank Adaptation For Efficient Visual Instruction Fine-tuning, by Pengkun Jiao et al.
-
Summary of Visual-oriented Fine-grained Knowledge Editing For Multimodal Large Language Models, by Zhen Zeng et al.
-
Summary of Declare and Justify: Explicit Assumptions in Ai Evaluations Are Necessary For Effective Regulation, by Peter Barnett et al.
-
Summary of Probing the Capacity Of Language Model Agents to Operationalize Disparate Experiential Context Despite Distraction, by Sonny George et al.
-
Summary of The Game-theoretic Symbiosis Of Trust and Ai in Networked Systems, by Yunfei Ge and Quanyan Zhu
-
Summary of Real-time Energy-optimal Path Planning For Electric Vehicles, by Saman Ahmadi et al.
-
Summary of Lavida Drive: Vision-text Interaction Vlm For Autonomous Driving with Token Selection, Recovery and Enhancement, by Siwen Jiao et al.
-
Summary of Unsupervised Homography Estimation on Multimodal Image Pair Via Alternating Optimization, by Sanghyeob Song et al.
-
Summary of Video-rag: Visually-aligned Retrieval-augmented Long Video Comprehension, by Yongdong Luo et al.
-
Summary of Song Form-aware Full-song Text-to-lyrics Generation with Multi-level Granularity Syllable Count Control, by Yunkee Chae et al.
-
Summary of Graphcl: Graph-based Clustering For Semi-supervised Medical Image Segmentation, by Mengzhu Wang et al.
-
Summary of Aglp: a Graph Learning Perspective For Semi-supervised Domain Adaptation, by Houcheng Su et al.
-
Summary of Cross-camera Distracted Driver Classification Through Feature Disentanglement and Contrastive Learning, by Simone Bianco et al.
-
Summary of Xmask3d: Cross-modal Mask Reasoning For Open Vocabulary 3d Semantic Segmentation, by Ziyi Wang et al.
-
Summary of Evaluating Tokenizer Performance Of Large Language Models Across Official Indian Languages, by S. Tamang and D. J. Bora
-
Summary of Sseditor: Controllable Mask-to-scene Generation with Diffusion Model, by Haowen Zheng and Yanyan Liang
-
Summary of Efficient Training in Multi-agent Reinforcement Learning: a Communication-free Framework For the Box-pushing Problem, by David Ge et al.
-
Summary of Balancing Accuracy and Efficiency in Multi-turn Intent Classification For Llm-powered Dialog Systems in Production, by Junhua Liu and Yong Keat Tan and Bin Fu and Kwan Hui Lim
-
Summary of Clip Unreasonable Potential in Single-shot Face Recognition, by Nhan T. Luu
-
Summary of Do Llms Understand Ambiguity in Text? a Case Study in Open-world Question Answering, by Aryan Keluskar et al.
-
Summary of Evaluating the Prompt Steerability Of Large Language Models, by Erik Miehling et al.
-
Summary of Preference-conditioned Gradient Variations For Multi-objective Quality-diversity, by Hannah Janmohamed et al.
-
Summary of Exploring Iterative Controllable Summarization with Large Language Models, by Sangwon Ryu et al.
-
Summary of Rethinking Top Probability From Multi-view For Distracted Driver Behaviour Localization, by Quang Vinh Nguyen et al.
-
Summary of Recall and Refine: a Simple but Effective Source-free Open-set Domain Adaptation Framework, by Ismail Nejjar et al.
-
Summary of Topological Symmetry Enhanced Graph Convolution For Skeleton-based Action Recognition, by Zeyu Liang et al.
-
Summary of Leveraging Mllm Embeddings and Attribute Smoothing For Compositional Zero-shot Learning, by Xudong Yan et al.
-
Summary of Whisper Finetuning on Nepali Language, by Sanjay Rijal et al.
-
Summary of Thinking Before Looking: Improving Multimodal Llm Reasoning Via Mitigating Visual Hallucination, by Haojie Zheng et al.
-
Summary of Neurosymbolic Graph Enrichment For Grounded World Models, by Stefano De Giorgis et al.
-
Summary of Deep Learning-driven Heat Map Analysis For Evaluating Thickness Of Wounded Skin Layers, by Devakumar Gr et al.