Paper List
We recommend you use the search box as this list is very long.
-
Summary of Exploring the Benefits Of Domain-pretraining Of Generative Large Language Models For Chemistry, by Anurag Acharya et al.
-
Summary of Hybrid Attention For Robust Rgb-t Pedestrian Detection in Real-world Conditions, by Arunkumar Rathinam et al.
-
Summary of Streamingbench: Assessing the Gap For Mllms to Achieve Streaming Video Understanding, by Junming Lin et al.
-
Summary of Rtify: Aligning Deep Neural Networks with Human Behavioral Decisions, by Yu-ang Cheng et al.
-
Summary of Adaptive Stereo Depth Estimation with Multi-spectral Images Across All Lighting Conditions, by Zihan Qin et al.
-
Summary of Deploying Multi-task Online Server with Large Language Model, by Yincen Qu et al.
-
Summary of Evaluating Moral Beliefs Across Llms Through a Pluralistic Framework, by Xuelin Liu et al.
-
Summary of Touchstone Benchmark: Are We on the Right Way For Evaluating Ai Algorithms For Medical Segmentation?, by Pedro R. A. S. Bassi et al.
-
Summary of Quill: Quotation Generation Enhancement Of Large Language Models, by Jin Xiao et al.
-
Summary of Fine-tuning Vision-language Model For Automated Engineering Drawing Information Extraction, by Muhammad Tayyab Khan et al.
-
Summary of Relation Learning and Aggregate-attention For Multi-person Motion Prediction, by Kehua Qu et al.
-
Summary of Automating Exploratory Proteomics Research Via Language Models, by Ning Ding et al.
-
Summary of Number Cookbook: Number Understanding Of Language Models and How to Improve It, by Haotong Yang et al.
-
Summary of Gs2pose: Two-stage 6d Object Pose Estimation Guided by Gaussian Splatting, By Jilan Mei et al.
-
Summary of Graph-dpep: Decomposed Plug and Ensemble Play For Few-shot Document Relation Extraction with Graph-of-thoughts Reasoning, by Tao Zhang et al.
-
Summary of Domain Expansion and Boundary Growth For Open-set Single-source Domain Generalization, by Pengkun Jiao et al.
-
Summary of Exploring the Interplay Between Video Generation and World Models in Autonomous Driving: a Survey, by Ao Fu et al.
-
Summary of [vision Paper] Probot: Enhancing Patient-reported Outcome Measures For Diabetic Retinopathy Using Chatbots and Generative Ai, by Maren Pielka et al.
-
Summary of Region-guided Attack on the Segment Anything Model (sam), by Xiaoliang Liu et al.
-
Summary of Autonomous Decision Making For Uav Cooperative Pursuit-evasion Game with Reinforcement Learning, by Yang Zhao et al.
-
Summary of Leveraging Large Language Models in Code Question Answering: Baselines and Issues, by Georgy Andryushchenko et al.
-
Summary of Adaptive Genetic Selection Based Pinning Control with Asymmetric Coupling For Multi-network Heterogeneous Vehicular Systems, by Weian Guo et al.
-
Summary of Humanvlm: Foundation For Human-scene Vision-language Model, by Dawei Dai et al.
-
Summary of Gradient-guided Conditional Diffusion Models For Private Image Reconstruction: Analyzing Adversarial Impacts Of Differential Privacy and Denoising, by Tao Huang et al.
-
Summary of Self-supervised Cross-modality Learning For Uncertainty-aware Object Detection and Recognition in Applications Which Lack Pre-labelled Training Data, by Irum Mehboob et al.
-
Summary of Hfgaussian: Learning Generalizable Gaussian Human with Integrated Human Features, by Arnab Dey et al.
-
Summary of On Improved Conditioning Mechanisms and Pre-training Strategies For Diffusion Models, by Tariq Berrada Ifriqi et al.
-
Summary of Gis Copilot: Towards An Autonomous Gis Agent For Spatial Analysis, by Temitope Akinboyewa et al.
-
Summary of Causal Responsibility Attribution For Human-ai Collaboration, by Yahang Qi et al.
-
Summary of Knowledge Graphs Of Driving Scenes to Empower the Emerging Capabilities Of Neurosymbolic Ai, by Ruwan Wickramarachchi et al.
-
Summary of Spontaneous Emergence Of Agent Individuality Through Social Interactions in Llm-based Communities, by Ryosuke Takata et al.
-
Summary of Smoa: Improving Multi-agent Large Language Models with Sparse Mixture-of-agents, by Dawei Li et al.
-
Summary of Veritas: a Unified Approach to Reliability Evaluation, by Rajkumar Ramamurthy et al.
-
Summary of Enhancing Multiple Dimensions Of Trustworthiness in Llms Via Sparse Activation Control, by Yuxin Xiao et al.
-
Summary of Modeling and Simulation Of a Multi Robot System Architecture, by Ahmed R. Sadik et al.
-
Summary of A Comparative Analysis Of Instruction Fine-tuning Llms For Financial Text Classification, by Sorouralsadat Fatemi et al.
-
Summary of Building a Synthetic Vascular Model: Evaluation in An Intracranial Aneurysms Detection Scenario, by Rafic Nader and Florent Autrusseau and Vincent L’allinec and Romain Bourcier
-
Summary of Imagining and Building Wise Machines: the Centrality Of Ai Metacognition, by Samuel G. B. Johnson et al.
-
Summary of Evaluating the Impact Of Lab Test Results on Large Language Models Generated Differential Diagnoses From Clinical Case Vignettes, by Balu Bhasuran et al.
-
Summary of Dr. Sow: Density Ratio Of Strong-over-weak Llms For Reducing the Cost Of Human Annotation in Preference Tuning, by Guangxuan Xu et al.
-
Summary of Towards Leveraging News Media to Support Impact Assessment Of Ai Technologies, by Mowafak Allaham et al.
-
Summary of Decoupled Data Augmentation For Improving Image Classification, by Ruoxin Chen et al.
-
Summary of Inquire: a Natural World Text-to-image Retrieval Benchmark, by Edward Vendrow et al.
-
Summary of Facttest: Factuality Testing in Large Language Models with Finite-sample and Distribution-free Guarantees, by Fan Nie et al.
-
Summary of Investigating Idiomaticity in Word Representations, by Wei He et al.
-
Summary of Enhancing Indoor Mobility with Connected Sensor Nodes: a Real-time, Delay-aware Cooperative Perception Approach, by Minghao Ning et al.
-
Summary of A Comparative Analysis Of Counterfactual Explanation Methods For Text Classifiers, by Stephen Mcaleese and Mark Keane
-
Summary of V-dpo: Mitigating Hallucination in Large Vision Language Models Via Vision-guided Direct Preference Optimization, by Yuxi Xie et al.
-
Summary of Wave Network: An Ultra-small Language Model, by Xin Zhang et al.
-
Summary of Game Plot Design with An Llm-powered Assistant: An Empirical Study with Game Designers, by Seyed Hossein Alavi et al.
-
Summary of Multimodal Commonsense Knowledge Distillation For Visual Question Answering, by Shuo Yang et al.
-
Summary of Ecocropsaid: Economic Crops Aerial Image Dataset For Land Use Classification, by Sangdaow Noppitak et al.
-
Summary of The Evolution Of Rwkv: Advancements in Efficient Language Modeling, by Akul Datta
-
Summary of Detect An Object at Once Without Fine-tuning, by Junyu Hao et al.
-
Summary of Hunyuan3d 1.0: a Unified Framework For Text-to-3d and Image-to-3d Generation, by Xianghui Yang et al.
-
Summary of Crmarena: Understanding the Capacity Of Llm Agents to Perform Professional Crm Tasks in Realistic Environments, by Kung-hsiang Huang et al.
-
Summary of Grid-based Projection Of Spatial Data Into Knowledge Graphs, by Amin Anjomshoaa et al.
-
Summary of Evaluating Creative Short Story Generation in Humans and Large Language Models, by Mete Ismayilzada et al.
-
Summary of Genxd: Generating Any 3d and 4d Scenes, by Yuyang Zhao et al.
-
Summary of Can Large Language Models Generalize Analogy Solving Like People Can?, by Claire E. Stevenson et al.
-
Summary of Addressing Uncertainty in Llms to Enhance Reliability in Generative Ai, by Ramneet Kaur et al.
-
Summary of Improving Scientific Hypothesis Generation with Knowledge Grounded Large Language Models, by Guangzhi Xiong et al.
-
Summary of How Far Is Video Generation From World Model: a Physical Law Perspective, by Bingyi Kang et al.
-
Summary of Ideabench: Benchmarking Large Language Models For Research Idea Generation, by Sikun Guo et al.
-
Summary of Can Llms Make Trade-offs Involving Stipulated Pain and Pleasure States?, by Geoff Keeling et al.
-
Summary of Sled: Self Logits Evolution Decoding For Improving Factuality in Large Language Models, by Jianyi Zhang et al.
-
Summary of Entropic Hetero-associative Memory, by Rafael Morales et al.
-
Summary of Todo: Enhancing Llm Alignment with Ternary Preferences, by Yuxiang Guo et al.
-
Summary of Typescore: a Text Fidelity Metric For Text-to-image Generative Models, by Georgia Gabriela Sampaio et al.
-
Summary of Rate, Explain and Cite (rec): Enhanced Explanation and Attribution in Automatic Evaluation by Large Language Models, By Aliyah R. Hsu et al.
-
Summary of An Exploration Of Higher Education Course Evaluation by Large Language Models, By Bo Yuan et al.
-
Summary of A Multi-task Role-playing Agent Capable Of Imitating Character Linguistic Styles, by Siyuan Chen et al.
-
Summary of Sinatools: Open Source Toolkit For Arabic Natural Language Processing, by Tymaa Hammouda et al.
-
Summary of Are Llms Good Pragmatic Speakers?, by Mingyue Jian et al.
-
Summary of Rs-moe: a Vision-language Model with Mixture Of Experts For Remote Sensing Image Captioning and Visual Question Answering, by Hui Lin et al.
-
Summary of Osad: Open-set Aircraft Detection in Sar Images, by Xiayang Xiao et al.
-
Summary of Dreampolish: Domain Score Distillation with Progressive Geometry Generation, by Yean Cheng et al.
-
Summary of Ontology Population Using Llms, by Sanaz Saki Norouzi et al.
-
Summary of Vq-map: Bird’s-eye-view Map Layout Estimation in Tokenized Discrete Space Via Vector Quantization, by Yiwei Zhang et al.
-
Summary of Ecoact: Economic Agent Determines When to Register What Action, by Shaokun Zhang et al.
-
Summary of Optical Flow Representation Alignment Mamba Diffusion Model For Medical Video Generation, by Zhenbin Wang et al.
-
Summary of Optimizing Gastrointestinal Diagnostics: a Cnn-based Model For Vce Image Classification, by Vaneeta Ahlawat et al.
-
Summary of Free-mask: a Novel Paradigm Of Integration Between the Segmentation Diffusion Model and Image Editing to Improve Segmentation Ability, by Bo Gao et al.
-
Summary of Constrained Human-ai Cooperation: An Inclusive Embodied Social Intelligence Challenge, by Weihua Du et al.
-
Summary of Silver Medal Solution For Image Matching Challenge 2024, by Yian Wang
-
Summary of Mining and Transferring Feature-geometry Coherence For Unsupervised Point Cloud Registration, by Kezheng Xiong et al.
-
Summary of Toddlers’ Active Gaze Behavior Supports Self-supervised Object Learning, by Zhengyang Yu et al.
-
Summary of Lidattack: Robust Black-box Attack on Lidar-based Object Detection, by Jinyin Chen et al.
-
Summary of Foundations and Recent Trends in Multimodal Mobile Agents: a Survey, by Biao Wu et al.
-
Summary of Shortcut Learning in In-context Learning: a Survey, by Rui Song et al.
-
Summary of Sibylsat: Using Sat As An Oracle to Perform a Greedy Search on Tohtn Planning, by Gaspard Quenard (marvin) et al.
-
Summary of Enhancing Osteoporosis Detection: An Explainable Multi-modal Learning Framework with Feature Fusion and Variable Clustering, by Mehdi Hosseini Chagahi et al.
-
Summary of Respact: Harmonizing Reasoning, Speaking, and Acting Towards Building Large Language Model-based Conversational Ai Agents, by Vardhan Dongre et al.
-
Summary of Identifying Implicit Social Biases in Vision-language Models, by Kimia Hamidieh et al.
-
Summary of Semi-strongly Solved: a New Definition Leading Computer to Perfect Gameplay, by Hiroki Takizawa
-
Summary of Rule Based Rewards For Language Model Safety, by Tong Mu et al.
-
Summary of Infant Agent: a Tool-integrated, Logic-driven Agent with Cost-effective Api Usage, by Bin Lei et al.