Paper List

We recommend you use the search box as this list is very long.

Summary of Command-line Risk Classification Using Transformer-based Neural Architectures, by Paolo Notaro et al.
Summary of Efficient Compression Of Sparse Accelerator Data Using Implicit Neural Representations and Importance Sampling, by Xihaier Luo et al.
Summary of A Neurosymbolic Fast and Slow Architecture For Graph Coloring, by Vedant Khandelwal et al.
Summary of Instantswap: Fast Customized Concept Swapping Across Sharp Shape Differences, by Chenyang Zhu et al.
Summary of Painternet: Adaptive Image Inpainting with Actual-token Attention and Diverse Mask Control, by Ruichen Wang et al.
Summary of Best Practices For Large Language Models in Radiology, by Christian Bluethgen et al.
Summary of Schedule on the Fly: Diffusion Time Prediction For Faster and Better Image Generation, by Zilyu Ye et al.
Summary of Collaborative Instance Object Navigation: Leveraging Uncertainty-awareness to Minimize Human-agent Dialogues, by Francesco Taioli et al.
Summary of Exploring React Prompting For Task-oriented Dialogue: Insights and Shortcomings, by Michelle Elizabeth et al.
Summary of Indexing Economic Fluctuation Narratives From Keiki Watchers Survey, by Eriko Shigetsugu et al.
Summary of Mulan: Adapting Multilingual Diffusion Models For Hundreds Of Languages with Negligible Cost, by Sen Xing et al.
Summary of Fedpaw: Federated Learning with Personalized Aggregation Weights For Urban Vehicle Speed Prediction, by Yuepeng He et al.
Summary of Align-kd: Distilling Cross-modal Alignment Knowledge For Mobile Vision-language Model, by Qianhan Feng et al.
Summary of Mftf: Mask-free Training-free Object Level Layout Control Diffusion Model, by Shan Yang
Summary of Enhancing Perception Capabilities Of Multimodal Llms with Training-free Fusion, by Zhuokun Chen et al.
Summary of Multimodal Medical Disease Classification with Llama Ii, by Christian Gapp et al.
Summary of Long Video Diffusion Generation with Segmented Cross-attention and Content-rich Video Data Curation, by Xin Yan et al.
Summary of The “llm World Of Words” English Free Association Norms Generated by Large Language Models, By Katherine Abramski et al.
Summary of Integrative Cam: Adaptive Layer Fusion For Comprehensive Interpretation Of Cnns, by Aniket K. Singh et al.
Summary of Research on Cervical Cancer P16/ki-67 Immunohistochemical Dual-staining Image Recognition Algorithm Based on Yolo, by Xiao-jun Wu et al.
Summary of Mambau-lite: a Lightweight Model Based on Mamba and Integrated Channel-spatial Attention For Skin Lesion Segmentation, by Thi-nhu-quynh Nguyen et al.
Summary of Towards Cross-lingual Audio Abuse Detection in Low-resource Settings with Few-shot Learning, by Aditya Narayan Sankaran et al.
Summary of Mvimgnet2.0: a Larger-scale Dataset Of Multi-view Images, by Xiaoguang Han et al.
Summary of Local Vs. Global: Local Land-use and Land-cover Models Deliver Higher Quality Maps, by Girmaw Abebe Tadesse et al.
Summary of Long Text Outline Generation: Chinese Text Outline Based on Unsupervised Framework and Large Language Mode, by Yan Yan and Yuanchi Ma
Summary of Improving Physics Reasoning in Large Language Models Using Mixture Of Refinement Agents, by Raj Jaiswal et al.
Summary of Alignmamba: Enhancing Multimodal Mamba with Local and Global Cross-modal Alignment, by Yan Li et al.
Summary of Improving Multimodal Llms Ability in Geometry Problem Solving, Reasoning, and Multistep Scoring, by Avinash Anand et al.
Summary of Knowledgeprompts: Exploring the Abilities Of Large Language Models to Solve Proportional Analogies Via Knowledge-enhanced Prompting, by Thilini Wijesiriwardene and Ruwan Wickramarachchi and Sreeram Vennam and Vinija Jain and Aman Chadha and Amitava Das and Ponnurangam Kumaraguru and Amit Sheth
Summary of Learn to Unlearn: Meta-learning-based Knowledge Graph Embedding Unlearning, by Naixing Xu et al.
Summary of Playable Game Generation, by Mingyu Yang et al.
Summary of Bigcity: a Universal Spatiotemporal Model For Unified Trajectory and Traffic State Data Analysis, by Xie Yu et al.
Summary of Linear Probe Penalties Reduce Llm Sycophancy, by Henry Papadatos et al.
Summary of Large Language Models As Mirrors Of Societal Moral Standards, by Evi Papadopoulou et al.
Summary of Llms As Mirrors Of Societal Moral Standards: Reflection Of Cultural Divergence and Agreement Across Ethical Topics, by Mijntje Meijer et al.
Summary of Evaluating Automated Radiology Report Quality Through Fine-grained Phrasal Grounding Of Clinical Findings, by Razi Mahmood et al.
Summary of Reducing Inference Energy Consumption Using Dual Complementary Cnns, by Michail Kinnas et al.
Summary of Advancing Speech Language Models by Scaling Supervised Fine-tuning with Over 60,000 Hours Of Synthetic Speech Dialogue Data, By Shuaijiang Zhao et al.
Summary of A Hierarchical Heuristic For Clustered Steiner Trees in the Plane with Obstacles, by Victor Parque
Summary of How the Use Of Feature Selection Methods Influences the Efficiency and Accuracy Of Complex Network Simulations, by Katarzyna Musial et al.
Summary of Tas-tsc: a Data-driven Framework For Estimating Time Of Arrival Using Temporal-attribute-spatial Tri-space Coordination Of Truck Trajectories, by Mengran Li et al.
Summary of Object Agnostic 3d Lifting in Space and Time, by Christopher Fusco et al.
Summary of Obi-bench: Can Lmms Aid in Study Of Ancient Script on Oracle Bones?, by Zijian Chen et al.
Summary of Fullstack Bench: Evaluating Llms As Full Stack Coders, by Bytedance-seed-foundation-code-team: Yao Cheng et al.
Summary of Motion Dreamer: Boundary Conditional Motion Reasoning For Physically Coherent Video Generation, by Tianshuo Xu et al.
Summary of Rethinking Generalizability and Discriminability Of Self-supervised Learning From Evolutionary Game Theory Perspective, by Jiangmeng Li et al.
Summary of Unveiling Performance Challenges Of Large Language Models in Low-resource Healthcare: a Demographic Fairness Perspective, by Yue Zhou et al.
Summary of Phyt2v: Llm-guided Iterative Self-refinement For Physics-grounded Text-to-video Generation, by Qiyao Xue et al.
Summary of Turing Representational Similarity Analysis (rsa): a Flexible Method For Measuring Alignment Between Human and Artificial Intelligence, by Mattson Ogg et al.
Summary of Fairness at Every Intersection: Uncovering and Mitigating Intersectional Biases in Multimodal Clinical Predictions, by Resmi Ramachandranpillai et al.
Summary of Leveraging Llm For Automated Ontology Extraction and Knowledge Graph Generation, by Mohammad Sadeq Abolhasani et al.
Summary of Lvlm-count: Enhancing the Counting Ability Of Large Vision-language Models, by Muhammad Fetrat Qharabagh et al.
Summary of The Advancement Of Personalized Learning Potentially Accelerated by Generative Ai, By Yuang Wei et al.
Summary of A Comparative Study Of Llm-based Asr and Whisper in Low Resource and Code Switching Scenario, by Zheshu Song and Ziyang Ma and Yifan Yang and Jianheng Zhuo and Xie Chen
Summary of Towards Adaptive Mechanism Activation in Language Agent, by Ziyang Huang et al.
Summary of Adascale: Dynamic Context-aware Dnn Scaling Via Automated Adaptation Loop on Mobile Devices, by Yuzhan Wang et al.
Summary of Revisiting Self-supervised Heterogeneous Graph Learning From Spectral Clustering Perspective, by Yujie Mo and Zhihe Lu and Runpeng Yu and Xiaofeng Zhu and Xinchao Wang
Summary of Exploring Cognition Through Morphological Info-computational Framework, by Gordana Dodig-crnkovic
Summary of Rethinking Cognition: Morphological Info-computation and the Embodied Paradigm in Life and Artificial Intelligence, by Gordana Dodig-crnkovic
Summary of Ctrlnerf: the Generative Neural Radiation Fields For the Controllable Synthesis Of High-fidelity 3d-aware Images, by Jian Liu and Zhen Yu
Summary of Pgso: Prompt-based Generative Sequence Optimization Network For Aspect-based Sentiment Analysis, by Hao Dong et al.
Summary of Selfprompt: Autonomously Evaluating Llm Robustness Via Domain-constrained Knowledge Guidelines and Refined Adversarial Prompts, by Aihua Pei et al.
Summary of Divd: Deblurring with Improved Video Diffusion Model, by Haoyang Long and Yan Wang and Wendong Wang
Summary of Fine Tuning Large Language Models to Deliver Cbt For Depression, by Talha Tahir
Summary of Adapting the Re-id Challenge For Static Sensors, by Avirath Sundaresan et al.
Summary of Plancritic: Formal Planning with Human Feedback, by Owen Burns et al.
Summary of Cognitive Biases in Large Language Models: a Survey and Mitigation Experiments, by Yasuaki Sumita et al.
Summary of Eftvit: Efficient Federated Training Of Vision Transformers with Masked Images on Resource-constrained Edge Devices, by Meihan Wu et al.
Summary of Empowering the Deaf and Hard Of Hearing Community: Enhancing Video Captions Using Large Language Models, by Nadeen Fathallah et al.
Summary of Safety Alignment Backfires: Preventing the Re-emergence Of Suppressed Concepts in Fine-tuned Text-to-image Diffusion Models, by Sanghyun Kim et al.
Summary of Cada: Cross-problem Routing Solver with Constraint-aware Dual-attention, by Han Li et al.
Summary of Enhancing Zero-shot Chain Of Thought Prompting Via Uncertainty-guided Strategy Selection, by Shanu Kumar et al.
Summary of Strategic Application Of Aigc For Uav Trajectory Design: a Channel Knowledge Map Approach, by Chiya Zhang et al.
Summary of Droidcall: a Dataset For Llm-powered Android Intent Invocation, by Weikai Xie et al.
Summary of Federated Progressive Self-distillation with Logits Calibration For Personalized Iiot Edge Intelligence, by Yingchao Wang and Wenqi Niu
Summary of Freecond: Free Lunch in the Input Conditions Of Text-guided Inpainting, by Teng-fang Hsiao et al.
Summary of Learner Attentiveness and Engagement Analysis in Online Education Using Computer Vision, by Sharva Gogawale et al.
Summary of Optimizing Sequential Recommendation Models with Scaling Laws and Approximate Entropy, by Tingjia Shen et al.
Summary of Benchmark Real-time Adaptation and Communication Capabilities Of Embodied Agent in Collaborative Scenarios, by Shipeng Liu et al.
Summary of Agribench: a Hierarchical Agriculture Benchmark For Multimodal Large Language Models, by Yutong Zhou and Masahiro Ryo
Summary of Node Importance Estimation Leveraging Llms For Semantic Augmentation in Knowledge Graphs, by Xinyu Lin et al.
Summary of Lambda: Covering the Multimodal Critical Scenarios For Automated Driving Systems by Search Space Quantization, By Xinzheng Wu et al.
Summary of Human Action Clips: Detecting Ai-generated Human Motion, by Matyas Bohacek et al.
Summary of Sims: Simulating Stylized Human-scene Interactions with Retrieval-augmented Script Generation, by Wenjia Wang et al.
Summary of Planning Vs Reasoning: Ablations to Test Capabilities Of Lora Layers, by Neel Redkar
Summary of Improving Medical Diagnostics with Vision-language Models: Convex Hull-based Uncertainty Analysis, by Ferhat Ozgur Catak and Murat Kuzlu and Taylor Patrick
Summary of Diffguard: Text-based Safety Checker For Diffusion Models, by Massine El Khader et al.
Summary of Mosabench: Multi-object Sentiment Analysis Benchmark For Evaluating Multimodal Large Language Models Understanding Of Complex Image, by Shezheng Song et al.
Summary of Addressing Vulnerabilities in Ai-image Detection: Challenges and Proposed Solutions, by Justin Jiang
Summary of Graph Canvas For Controllable 3d Scene Generation, by Libin Liu and Shen Chen and Sen Jia and Jingzhe Shi and Zhongyu Jiang and Can Jin and Wu Zongkai and Jenq-neng Hwang and Lei Li
Summary of Scenetap: Scene-coherent Typographic Adversarial Planner Against Vision-language Models in Real-world Environments, by Yue Cao et al.
Summary of Proceedings Of the 2024 Xcsp3 Competition, by Gilles Audemard et al.
Summary of Hybrid Discriminative Attribute-object Embedding Network For Compositional Zero-shot Learning, by Yang Liu et al.
Summary of Relation-aware Meta-learning For Zero-shot Sketch-based Image Retrieval, by Yang Liu et al.
Summary of Orthus: Autoregressive Interleaved Image-text Generation with Modality-specific Heads, by Siqi Kou et al.
Summary of Open-sora Plan: Open-source Large Video Generation Model, by Bin Lin et al.
Summary of Sparse Attention Vectors: Generative Multimodal Model Features Are Discriminative Vision-language Classifiers, by Chancharik Mitra et al.
Summary of Dlava: Document Language and Vision Assistant For Answer Localization with Enhanced Interpretability and Trustworthiness, by Ahmad Mohammadshirazi et al.
Summary of Towards the Ultimate Programming Language: Trust and Benevolence in the Age Of Artificial Intelligence, by Bartosz Sawicki et al.
Summary of To Ensemble or Not: Assessing Majority Voting Strategies For Phishing Detection with Large Language Models, by Fouad Trad et al.