Paper List
We recommend you use the search box as this list is very long.
-
Summary of Uc-nerf: Uncertainty-aware Conditional Neural Radiance Fields From Endoscopic Sparse Views, by Jiaxin Guo et al.
-
Summary of Longllava: Scaling Multi-modal Llms to 1000 Images Efficiently Via a Hybrid Architecture, by Xidong Wang et al.
-
Summary of Mobileunetr: a Lightweight End-to-end Hybrid Vision Transformer For Efficient Medical Image Segmentation, by Shehan Perera et al.
-
Summary of 3d-lex V1.0: 3d Lexicons For American Sign Language and Sign Language Of the Netherlands, by Oline Ranum and Gomer Otterspeer and Jari I. Andersen and Robert G. Belleman and Floris Roelofsen
-
Summary of Cyberhost: Taming Audio-driven Avatar Diffusion Model with Region Codebook Attention, by Gaojie Lin et al.
-
Summary of From Grounding to Planning: Benchmarking Bottlenecks in Web Agents, by Segev Shlomov et al.
-
Summary of A Randomized Simulation Trial Evaluating Abimed, a Clinical Decision Support System For Medication Reviews and Polypharmacy Management, by Abdelmalek Mouazer et al.
-
Summary of Comprehensive Equity Index (cei): Definition and Application to Bias Evaluation in Biometrics, by Imanol Solano et al.
-
Summary of Beaver: An Enterprise Benchmark For Text-to-sql, by Peter Baile Chen et al.
-
Summary of Transdae: Dual Attention Mechanism in a Hierarchical Transformer For Efficient Medical Image Segmentation, by Bobby Azad et al.
-
Summary of Allweathernet:unified Image Enhancement For Autonomous Driving Under Adverse Weather and Lowlight-conditions, by Chenghao Qian et al.
-
Summary of A Deployed Online Reinforcement Learning Algorithm in An Oral Health Clinical Trial, by Anna L. Trella et al.
-
Summary of Depthcrafter: Generating Consistent Long Depth Sequences For Open-world Videos, by Wenbo Hu et al.
-
Summary of On a Heuristic Approach to the Description Of Consciousness As a Hypercomplex System State and the Possibility Of Machine Consciousness (german Edition), by Ralf Otte
-
Summary of Biochemical Prostate Cancer Recurrence Prediction: Thinking Fast & Slow, by Suhang You et al.
-
Summary of Action-based Adhd Diagnosis in Video, by Yichun Li et al.
-
Summary of Initial Development and Evaluation Of the Creative Artificial Intelligence Through Recurring Developments and Determinations (cairdd) System, by Jeremy Straub and Zach Johnson
-
Summary of Arctic-snowcoder: Demystifying High-quality Data in Code Pretraining, by Yuxiang Wei et al.
-
Summary of Do Large Language Models Possess Sensitive to Sentiment?, by Yang Liu et al.
-
Summary of Coral Model Generation From Single Images For Virtual Reality Applications, by Jie Fu (university Of the Arts London et al.
-
Summary of Large Language Models and Cognitive Science: a Comprehensive Review Of Similarities, Differences, and Challenges, by Qian Niu et al.
-
Summary of Multi-modal Situated Reasoning in 3d Scenes, by Xiongkun Linghu et al.
-
Summary of Detecting Korean Food Using Image Using Hierarchical Model, by Hoang Khanh Lam et al.
-
Summary of Poliprompt: a High-performance Cost-effective Llm-based Text Classification Framework For Political Science, by Menglin Liu et al.
-
Summary of Amg: Avatar Motion Guided Video Generation, by Zhangsihao Yang et al.
-
Summary of Earthgen: Generating the World From Top-down Views, by Ansh Sharma et al.
-
Summary of Think Twice Before Recognizing: Large Multimodal Models For General Fine-grained Traffic Sign Recognition, by Yaozong Gan et al.
-
Summary of Ea-ras: Towards Efficient and Accurate End-to-end Reconstruction Of Anatomical Skeleton, by Zhiheng Peng et al.
-
Summary of Self-instructed Derived Prompt Generation Meets In-context Learning: Unlocking New Potential Of Black-box Llms, by Zhuo Li et al.
-
Summary of Benchmarking Cognitive Domains For Llms: Insights From Taiwanese Hakka Culture, by Chen-chi Chang et al.
-
Summary of Blocks As Probes: Dissecting Categorization Ability Of Large Multimodal Models, by Bin Fu and Qiyang Wan and Jialin Li and Ruiping Wang and Xilin Chen
-
Summary of Lssf-net: Lightweight Segmentation with Self-awareness, Spatial Attention, and Focal Modulation, by Hamza Farooq et al.
-
Summary of Improving Apple Object Detection with Occlusion-enhanced Distillation, by Liang Geng
-
Summary of Adacomp: Extractive Context Compression with Adaptive Predictor For Retrieval-augmented Large Language Models, by Qianchi Zhang and Hainan Zhang and Liang Pang and Hongwei Zheng and Zhiming Zheng
-
Summary of Booster: Tackling Harmful Fine-tuning For Large Language Models Via Attenuating Harmful Perturbation, by Tiansheng Huang et al.
-
Summary of Decompose the Model: Mechanistic Interpretability in Image Models with Generalized Integrated Gradients (gig), by Yearim Kim et al.
-
Summary of Training on the Benchmark Is Not All You Need, by Shiwen Ni et al.
-
Summary of Adaptive Explicit Knowledge Transfer For Knowledge Distillation, by Hyungkeun Park et al.
-
Summary of Dialogue You Can Trust: Human and Ai Perspectives on Generated Conversations, by Ike Ebubechukwu et al.
-
Summary of Learning State-dependent Policy Parametrizations For Dynamic Technician Routing with Rework, by Jonas Stein et al.
-
Summary of Real-time Indoor Object Detection Based on Hybrid Cnn-transformer Approach, by Salah Eddine Laidoudi et al.
-
Summary of Latent Distillation For Continual Object Detection at the Edge, by Francesco Pasti et al.
-
Summary of What Are the Essential Factors in Crafting Effective Long Context Multi-hop Instruction Datasets? Insights and Best Practices, by Zhi Chen et al.
-
Summary of A Perspective on Literary Metaphor in the Context Of Generative Ai, by Imke Van Heerden and Anil Bas
-
Summary of Learning in Hybrid Active Inference Models, by Poppy Collis et al.
-
Summary of From Bird’s-eye to Street View: Crafting Diverse and Condition-aligned Images with Latent Diffusion Model, by Xiaojie Xu et al.
-
Summary of Scope: Sign Language Contextual Processing with Embedding From Llms, by Yuqi Liu et al.
-
Summary of Dpdedit: Detail-preserved Diffusion Models For Multimodal Fashion Image Editing, by Xiaolong Wang et al.
-
Summary of Pre-trained Language Models For Keyphrase Prediction: a Review, by Muhammad Umair et al.
-
Summary of Ds Myolo: a Reliable Object Detector Based on Ssms For Driving Scenarios, by Yang Li and Jianli Xiao
-
Summary of Large Language Models Can Understanding Depth From Monocular Images, by Zhongyi Xia and Tianzhao Wu
-
Summary of Fmrft: Fusion Mamba and Detr For Query Time Sequence Intersection Fish Tracking, by Mingyuan Yao et al.
-
Summary of Esp-pct: Enhanced Vr Semantic Performance Through Efficient Compression Of Temporal and Spatial Redundancies in Point Cloud Transformers, by Luoyu Mei et al.
-
Summary of Integrating End-to-end and Modular Driving Approaches For Online Corner Case Detection in Autonomous Driving, by Gemb Kaljavesi et al.
-
Summary of Conversational Complexity For Assessing Risk in Large Language Models, by John Burden et al.
-
Summary of Path-consistency: Prefix Enhancement For Efficient Inference in Llm, by Jiace Zhu et al.
-
Summary of Real-time Accident Anticipation For Autonomous Driving Through Monocular Depth-enhanced 3d Modeling, by Haicheng Liao et al.
-
Summary of Pediatric Brain Tumor Classification Using Digital Histopathology and Deep Learning: Evaluation Of Sota Methods on a Multi-center Swedish Cohort, by Iulian Emil Tampu et al.
-
Summary of Pairing Analogy-augmented Generation with Procedural Memory For Procedural Q&a, by K Roth and Rushil Gupta and Simon Halle and Bang Liu
-
Summary of Language Models Benefit From Preparation with Elicited Knowledge, by Jiacan Yu et al.
-
Summary of H-arc: a Robust Estimate Of Human Performance on the Abstraction and Reasoning Corpus Benchmark, by Solim Legris et al.
-
Summary of Comfybench: Benchmarking Llm-based Agents in Comfyui For Autonomously Designing Collaborative Ai Systems, by Xiangyuan Xue et al.
-
Summary of Kvasir-vqa: a Text-image Pair Gi Tract Dataset, by Sushant Gautam et al.
-
Summary of Polyrating: a Cost-effective and Bias-aware Rating System For Llm Evaluation, by Jasper Dekoninck et al.
-
Summary of Abstaining Machine Learning — Philosophical Considerations, by Daniela Schuster
-
Summary of Lpuwf-ldm: Enhanced Latent Diffusion Model For Precise Late-phase Uwf-fa Generation on Limited Dataset, by Zhaojie Fang et al.
-
Summary of Trusted Unified Feature-neighborhood Dynamics For Multi-view Classification, by Haojian Huang et al.
-
Summary of Cooperative Path Planning with Asynchronous Multiagent Reinforcement Learning, by Jiaming Yin et al.
-
Summary of Diffusion Based Multi-domain Neuroimaging Harmonization Method with Preservation Of Anatomical Details, by Haoyu Lan et al.
-
Summary of Building Fkg.in: a Knowledge Graph For Indian Food, by Saransh Kumar Gupta et al.
-
Summary of You-only-randomize-once: Shaping Statistical Properties in Constraint-based Pcg, by Jediah Katz et al.
-
Summary of Accelerating Hybrid Agent-based Models and Fuzzy Cognitive Maps: How to Combine Agents Who Think Alike?, by Philippe J. Giabbanelli and Jack T. Beerman
-
Summary of Entropy Loss: An Interpretability Amplifier Of 3d Object Detection Network For Intelligent Driving, by Haobo Yang et al.
-
Summary of Jaxlife: An Open-ended Agentic Simulator, by Chris Lu et al.
-
Summary of Equitable Skin Disease Prediction Using Transfer Learning and Domain Adaptation, by Sajib Acharjee Dip et al.
-
Summary of User-specific Dialogue Generation with User Profile-aware Pre-training Model and Parameter-efficient Fine-tuning, by Atsushi Otsuka and Kazuya Matsuo and Ryo Ishii and Narichika Nomoto and Hiroaki Sugiyama
-
Summary of Vired: Prediction Of Visual Relations in Engineering Drawings, by Chao Gu et al.
-
Summary of Multi-scale Temporal Fusion Transformer For Incomplete Vehicle Trajectory Prediction, by Zhanwen Liu et al.
-
Summary of Xnet V2: Fewer Limitations, Better Results and Greater Universality, by Yanfeng Zhou et al.
-
Summary of Large Language Models For Automatic Detection Of Sensitive Topics, by Ruoyu Wen et al.
-
Summary of 3d Priors-guided Diffusion For Blind Face Restoration, by Xiaobin Lu et al.
-
Summary of Unlocking the Wisdom Of Large Language Models: An Introduction to the Path to Artificial General Intelligence, by Edward Y. Chang
-
Summary of Mapwise: Evaluating Vision-language Models For Advanced Map Queries, by Srija Mukhopadhyay et al.
-
Summary of Onlysportslm: Optimizing Sports-domain Language Models with Sota Performance Under Billion Parameters, by Zexin Chen et al.
-
Summary of Toward a More Complete Omr Solution, by Guang Yang (1) et al.
-
Summary of The Merit Dataset: Modelling and Efficiently Rendering Interpretable Transcripts, by I. De Rodrigo et al.
-
Summary of Wikicausal: Corpus and Evaluation Framework For Causal Knowledge Graph Construction, by Oktie Hassanzadeh
-
Summary of Predicting the Target Word Of Game-playing Conversations Using a Low-rank Dialect Adapter For Decoder Models, by Dipankar Srirag et al.
-
Summary of Geospatial Foundation Models For Image Analysis: Evaluating and Enhancing Nasa-ibm Prithvi’s Domain Adaptability, by Chia-yu Hsu et al.
-
Summary of Genai-powered Multi-agent Paradigm For Smart Urban Mobility: Opportunities and Challenges For Integrating Large Language Models (llms) and Retrieval-augmented Generation (rag) with Intelligent Transportation Systems, by Haowen Xu et al.
-
Summary of Streamlining Forest Wildfire Surveillance: Ai-enhanced Uavs Utilizing the Flame Aerial Video Dataset For Lightweight and Efficient Monitoring, by Lemeng Zhao et al.
-
Summary of Plant Detection From Ultra High Resolution Remote Sensing Images: a Semantic Segmentation Approach Based on Fuzzy Loss, by Shivam Pande et al.
-
Summary of Mapping Earth Mounds From Space, by Baki Uzun et al.
-
Summary of Large Language Models-enabled Digital Twins For Precision Medicine in Rare Gynecological Tumors, by Jacqueline Lammert et al.
-
Summary of Testing and Evaluation Of Large Language Models: Correctness, Non-toxicity, and Fairness, by Wenxuan Wang
-
Summary of Learning to Ask: When Llm Agents Meet Unclear Instruction, by Wenxuan Wang et al.
-
Summary of Does Knowledge Localization Hold True? Surprising Differences Between Entity and Relation Perspectives in Language Models, by Yifan Wei et al.
-
Summary of Dame: Personalized Federated Social Event Detection with Dual Aggregation Mechanism, by Xiaoyan Yu et al.
-
Summary of Entity-aware Biaffine Attention Model For Improved Constituent Parsing with Reduced Entity Violations, by Xinyi Bai