Paper List
We recommend you use the search box as this list is very long.
-
Summary of Ghostrnn: Reducing State Redundancy in Rnn with Cheap Operations, by Hang Zhou et al.
-
Summary of A Survey on Human-centric Llms, by Jing Yi Wang et al.
-
Summary of Srsa: a Cost-efficient Strategy-router Search Agent For Real-world Human-machine Interactions, by Yaqi Wang et al.
-
Summary of Comparative Analysis Of Pooling Mechanisms in Llms: a Sentiment Analysis Perspective, by Jinming Xing et al.
-
Summary of Multiverse Of Greatness: Generating Story Branches with Llms, by Pittawat Taveekitworachai et al.
-
Summary of Improving Mathematical Reasoning Capabilities Of Small Language Models Via Feedback-driven Distillation, by Xunyu Zhu et al.
-
Summary of Universal and Context-independent Triggers For Precise Control Of Llm Outputs, by Jiashuo Liang et al.
-
Summary of Texgen: a Generative Diffusion Model For Mesh Textures, by Xin Yu et al.
-
Summary of Focus: Knowledge-enhanced Adaptive Visual Compression For Few-shot Whole Slide Image Classification, by Zhengrui Guo et al.
-
Summary of Resolution-agnostic Transformer-based Climate Downscaling, by Declan Curran and Hira Saleem and Sanaa Hobeichi and Flora Salim
-
Summary of Kbalign: Efficient Self Adaptation on Specific Knowledge Bases, by Zheni Zeng et al.
-
Summary of Videoespresso: a Large-scale Chain-of-thought Dataset For Fine-grained Video Reasoning Via Core Frame Selection, by Songhao Han et al.
-
Summary of Dynamics-aware Gaussian Splatting Streaming Towards Fast On-the-fly Training For 4d Reconstruction, by Zhening Liu et al.
-
Summary of Design-o-meter: Towards Evaluating and Refining Graphic Designs, by Sahil Goyal et al.
-
Summary of Multi Lora Meets Vision: Merging Multiple Adapters to Create a Multi Task Model, by Ege Kesim et al.
-
Summary of Fopru: Focal Pruning For Efficient Large Vision-language Models, by Lei Jiang et al.
-
Summary of Is This Generated Person Existed in Real-world? Fine-grained Detecting and Calibrating Abnormal Human-body, by Zeqing Wang et al.
-
Summary of Physics-informed Llm-agent For Automated Modulation Design in Power Electronics Systems, by Junhua Liu and Fanfan Lin and Xinze Li and Kwan Hui Lim and Shuai Zhao
-
Summary of Towards Context-rich Automated Biodiversity Assessments: Deriving Ai-powered Insights From Camera Trap Data, by Paul Fergus et al.
-
Summary of Intent-aware Dialogue Generation and Multi-task Contrastive Learning For Multi-turn Intent Classification, by Junhua Liu and Yong Keat Tan and Bin Fu and Kwan Hui Lim
-
Summary of Knowledge Graphs, Large Language Models, and Hallucinations: An Nlp Perspective, by Ernests Lavrinovics et al.
-
Summary of Unifiedcrawl: Aggregated Common Crawl For Affordable Adaptation Of Llms on Low-resource Languages, by Bethel Melesse Tessema (1) et al.
-
Summary of Rv4chatbot: Are Chatbots Allowed to Dream Of Electric Sheep?, by Andrea Gatti (university Of Genoa) et al.
-
Summary of Resolving Multiple-dynamic Model Uncertainty in Hypothesis-driven Belief-mdps, by Ofer Dagan et al.
-
Summary of Revisiting the Integration Of Convolution and Attention For Vision Backbone, by Lei Zhu et al.
-
Summary of Can Artificial Intelligence Generate Quality Research Topics Reflecting Patient Concerns?, by Jiyeong Kim et al.
-
Summary of Unveiling User Preferences: a Knowledge Graph and Llm-driven Approach For Conversational Recommendation, by Zhangchi Qiu et al.
-
Summary of Towards Next-generation Medical Agent: How O1 Is Reshaping Decision-making in Medical Scenarios, by Shaochen Xu et al.
-
Summary of Leveraging Ai and Nlp For Bank Marketing: a Systematic Review and Gap Analysis, by Christopher Gerling et al.
-
Summary of Popular Llms Amplify Race and Gender Disparities in Human Mobility, by Xinhua Wu and Qi R. Wang
-
Summary of Learning to Ask: Conversational Product Search Via Representation Learning, by Jie Zou et al.
-
Summary of Streetviewllm: Extracting Geographic Information Using a Chain-of-thought Multimodal Large Language Model, by Zongrong Li et al.
-
Summary of Ranking Unraveled: Recipes For Llm Rankings in Head-to-head Ai Combat, by Roland Daynauth et al.
-
Summary of Balrog: Benchmarking Agentic Llm and Vlm Reasoning on Games, by Davide Paglieri et al.
-
Summary of Integrated Water Resource Management in the Segura Hydrographic Basin: An Artificial Intelligence Approach, by Urtzi Otamendi et al.
-
Summary of Amsnet-kg: a Netlist Dataset For Llm-based Ams Circuit Auto-design Using Knowledge Graph Rag, by Yichen Shi et al.
-
Summary of Addrllm: Address Rewriting Via Large Language Model on Nationwide Logistics Data, by Qinchen Yang et al.
-
Summary of Improved Gui Grounding Via Iterative Narrowing, by Anthony Nguyen
-
Summary of Enhancing Bidirectional Sign Language Communication: Integrating Yolov8 and Nlp For Real-time Gesture Recognition & Translation, by Hasnat Jamil Bhuiyan et al.
-
Summary of Unveiling Redundancy in Diffusion Transformers (dits): a Systematic Study, by Xibo Sun et al.
-
Summary of No Free Delivery Service: Epistemic Limits Of Passive Data Collection in Complex Social Systems, by Maximilian Nickel
-
Summary of Piors: Personalized Intelligent Outpatient Reception Based on Large Language Model with Multi-agents Medical Scenario Simulation, by Zhijie Bao et al.
-
Summary of Benchmarking Gpt-4 Against Human Translators: a Comprehensive Evaluation Across Languages, Domains, and Expertise Levels, by Jianhao Yan et al.
-
Summary of Xagents: a Framework For Interpretable Rule-based Multi-agents Cooperation, by Hailong Yang et al.
-
Summary of Separable Mixture Of Low-rank Adaptation For Continual Visual Instruction Tuning, by Ziqi Wang et al.
-
Summary of Safety Without Semantic Disruptions: Editing-free Safe Image Generation Via Context-preserving Dual Latent Reconstruction, by Jordan Vice et al.
-
Summary of Mirror Target Yolo: An Improved Yolov8 Method with Indirect Vision For Heritage Buildings Fire Detection, by Jian Liang and Junsheng Cheng
-
Summary of Logic Augmented Generation, by Aldo Gangemi and Andrea Giovanni Nuzzolese
-
Summary of Llm-based Multi-agent Systems: Techniques and Business Perspectives, by Yingxuan Yang et al.
-
Summary of Uterine Ultrasound Image Captioning Using Deep Learning Techniques, by Abdennour Boulesnane et al.
-
Summary of Forecasting Future International Events: a Reliable Dataset For Text-based Event Modeling, by Daehoon Gwak et al.
-
Summary of Functionchat-bench: Comprehensive Evaluation Of Language Models’ Generative Capabilities in Korean Tool-use Dialogs, by Shinbok Lee et al.
-
Summary of Probing the Capacity Of Language Model Agents to Operationalize Disparate Experiential Context Despite Distraction, by Sonny George et al.
-
Summary of The Game-theoretic Symbiosis Of Trust and Ai in Networked Systems, by Yunfei Ge and Quanyan Zhu
-
Summary of Real-time Energy-optimal Path Planning For Electric Vehicles, by Saman Ahmadi et al.
-
Summary of Unsupervised Homography Estimation on Multimodal Image Pair Via Alternating Optimization, by Sanghyeob Song et al.
-
Summary of Lavida Drive: Vision-text Interaction Vlm For Autonomous Driving with Token Selection, Recovery and Enhancement, by Siwen Jiao et al.
-
Summary of Video-rag: Visually-aligned Retrieval-augmented Long Video Comprehension, by Yongdong Luo et al.
-
Summary of Song Form-aware Full-song Text-to-lyrics Generation with Multi-level Granularity Syllable Count Control, by Yunkee Chae et al.
-
Summary of Graphcl: Graph-based Clustering For Semi-supervised Medical Image Segmentation, by Mengzhu Wang et al.
-
Summary of Aglp: a Graph Learning Perspective For Semi-supervised Domain Adaptation, by Houcheng Su et al.
-
Summary of Cross-camera Distracted Driver Classification Through Feature Disentanglement and Contrastive Learning, by Simone Bianco et al.
-
Summary of Xmask3d: Cross-modal Mask Reasoning For Open Vocabulary 3d Semantic Segmentation, by Ziyi Wang et al.
-
Summary of Videoautoarena: An Automated Arena For Evaluating Large Multimodal Models in Video Analysis Through User Simulation, by Ziyang Luo et al.
-
Summary of A Resource Efficient Fusion Network For Object Detection in Bird’s-eye View Using Camera and Raw Radar Data, by Kavin Chandrasekaran et al.
-
Summary of Unification Of Balti and Trans-border Sister Dialects in the Essence Of Llms and Ai Technology, by Muhammad Sharif et al.
-
Summary of Fact-level Confidence Calibration and Self-correction, by Yige Yuan et al.
-
Summary of Limba: An Open-source Framework For the Preservation and Valorization Of Low-resource Languages Using Generative Models, by Salvatore Mario Carta et al.
-
Summary of Patentedits: Framing Patent Novelty As Textual Entailment, by Ryan Lee et al.
-
Summary of Advancing Complex Medical Communication in Arabic with Sporo Arasum: Surpassing Existing Large Language Models, by Chanseo Lee et al.
-
Summary of Entropy Bootstrapping For Weakly Supervised Nuclei Detection, by James Willoughby et al.
-
Summary of Exploring Iterative Controllable Summarization with Large Language Models, by Sangwon Ryu et al.
-
Summary of Rethinking Top Probability From Multi-view For Distracted Driver Behaviour Localization, by Quang Vinh Nguyen et al.
-
Summary of Topological Symmetry Enhanced Graph Convolution For Skeleton-based Action Recognition, by Zeyu Liang et al.
-
Summary of Recall and Refine: a Simple but Effective Source-free Open-set Domain Adaptation Framework, by Ismail Nejjar et al.
-
Summary of Whisper Finetuning on Nepali Language, by Sanjay Rijal et al.
-
Summary of Leveraging Mllm Embeddings and Attribute Smoothing For Compositional Zero-shot Learning, by Xudong Yan et al.
-
Summary of Thinking Before Looking: Improving Multimodal Llm Reasoning Via Mitigating Visual Hallucination, by Haojie Zheng et al.
-
Summary of Neurosymbolic Graph Enrichment For Grounded World Models, by Stefano De Giorgis et al.
-
Summary of Deep Learning-driven Heat Map Analysis For Evaluating Thickness Of Wounded Skin Layers, by Devakumar Gr et al.
-
Summary of Enhanced Sign Language Translation Between American Sign Language (asl) and Indian Sign Language (isl) Using Llms, by Malay Kumar et al.
-
Summary of Catch: Complementary Adaptive Token-level Contrastive Decoding to Mitigate Hallucinations in Lvlms, by Zhehan Kan et al.
-
Summary of Enhancing Multi-class Disease Classification: Neoplasms, Cardiovascular, Nervous System, and Digestive Disorders Using Advanced Llms, by Ahmed Akib Jawad Karim et al.
-
Summary of An Exploration Of the Effect Of Quantisation on Energy Consumption and Inference Time Of Starcoder2, by Pepijn De Reus et al.
-
Summary of A Novel Approach to Eliminating Hallucinations in Large Language Model-assisted Causal Discovery, by Grace Sng et al.
-
Summary of Visual Cue Enhancement and Dual Low-rank Adaptation For Efficient Visual Instruction Fine-tuning, by Pengkun Jiao et al.
-
Summary of Playing Language Game with Llms Leads to Jailbreaking, by Yu Peng et al.
-
Summary of Sefd: Semantic-enhanced Framework For Detecting Llm-generated Text, by Weiqing He et al.
-
Summary of Suicide Risk Assessment on Social Media with Semi-supervised Learning, by Max Lovitt et al.
-
Summary of Visual-oriented Fine-grained Knowledge Editing For Multimodal Large Language Models, by Zhen Zeng et al.
-
Summary of Declare and Justify: Explicit Assumptions in Ai Evaluations Are Necessary For Effective Regulation, by Peter Barnett et al.
-
Summary of Bi-mamba: Towards Accurate 1-bit State Space Models, by Shengkun Tang and Liqun Ma and Haonan Li and Mingjie Sun and Zhiqiang Shen
-
Summary of Survey on Semantic Interpretation Of Tabular Data: Challenges and Directions, by Marco Cremaschi et al.
-
Summary of Reslearn: Transformer-based Residual Learning For Metaverse Network Traffic Prediction, by Yoga Suhas Kuruba Manjunath et al.
-
Summary of Atomthink: a Slow Thinking Framework For Multimodal Mathematical Reasoning, by Kun Xiang et al.
-
Summary of On-board Vision-language Models For Personalized Autonomous Vehicle Motion Control: System Design and Real-world Validation, by Can Cui et al.