Paper List
We recommend you use the search box as this list is very long.
-
Summary of Long Video Diffusion Generation with Segmented Cross-attention and Content-rich Video Data Curation, by Xin Yan et al.
-
Summary of Integrative Cam: Adaptive Layer Fusion For Comprehensive Interpretation Of Cnns, by Aniket K. Singh et al.
-
Summary of The “llm World Of Words” English Free Association Norms Generated by Large Language Models, By Katherine Abramski et al.
-
Summary of Research on Cervical Cancer P16/ki-67 Immunohistochemical Dual-staining Image Recognition Algorithm Based on Yolo, by Xiao-jun Wu et al.
-
Summary of Mambau-lite: a Lightweight Model Based on Mamba and Integrated Channel-spatial Attention For Skin Lesion Segmentation, by Thi-nhu-quynh Nguyen et al.
-
Summary of Towards Cross-lingual Audio Abuse Detection in Low-resource Settings with Few-shot Learning, by Aditya Narayan Sankaran et al.
-
Summary of Mvimgnet2.0: a Larger-scale Dataset Of Multi-view Images, by Xiaoguang Han et al.
-
Summary of Artificial Intelligence For Geometry-based Feature Extraction, Analysis and Synthesis in Artistic Images: a Survey, by Mridula Vijendran et al.
-
Summary of Pld+: Accelerating Llm Inference by Leveraging Language Model Artifacts, By Shwetha Somasundaram et al.
-
Summary of Fastrm: An Efficient and Automatic Explainability Framework For Multimodal Generative Models, by Gabriela Ben-melech Stan et al.
-
Summary of Intelligent Spark Agents: a Modular Langgraph Framework For Scalable, Visualized, and Enhanced Big Data Machine Learning Workflows, by Jialin Wang and Zhihua Duan
-
Summary of Artbrain: An Explainable End-to-end Toolkit For Classification and Attribution Of Ai-generated Art and Style, by Ravidu Suien Rammuni Silva et al.
-
Summary of Copyrightshield: Spatial Similarity Guided Backdoor Defense Against Copyright Infringement in Diffusion Models, by Zhixiang Guo et al.
-
Summary of Seqafford: Sequential 3d Affordance Reasoning Via Multimodal Large Language Model, by Chunlin Yu et al.
-
Summary of Videolights: Feature Refinement and Cross-task Alignment Transformer For Joint Video Highlight Detection and Moment Retrieval, by Dhiman Paul et al.
-
Summary of Bigcity: a Universal Spatiotemporal Model For Unified Trajectory and Traffic State Data Analysis, by Xie Yu et al.
-
Summary of Llms As Mirrors Of Societal Moral Standards: Reflection Of Cultural Divergence and Agreement Across Ethical Topics, by Mijntje Meijer et al.
-
Summary of Large Language Models As Mirrors Of Societal Moral Standards, by Evi Papadopoulou et al.
-
Summary of Linear Probe Penalties Reduce Llm Sycophancy, by Henry Papadatos et al.
-
Summary of Reducing Inference Energy Consumption Using Dual Complementary Cnns, by Michail Kinnas et al.
-
Summary of Evaluating Automated Radiology Report Quality Through Fine-grained Phrasal Grounding Of Clinical Findings, by Razi Mahmood et al.
-
Summary of Advancing Speech Language Models by Scaling Supervised Fine-tuning with Over 60,000 Hours Of Synthetic Speech Dialogue Data, By Shuaijiang Zhao et al.
-
Summary of A Hierarchical Heuristic For Clustered Steiner Trees in the Plane with Obstacles, by Victor Parque
-
Summary of How the Use Of Feature Selection Methods Influences the Efficiency and Accuracy Of Complex Network Simulations, by Katarzyna Musial et al.
-
Summary of Tas-tsc: a Data-driven Framework For Estimating Time Of Arrival Using Temporal-attribute-spatial Tri-space Coordination Of Truck Trajectories, by Mengran Li et al.
-
Summary of Object Agnostic 3d Lifting in Space and Time, by Christopher Fusco et al.
-
Summary of Obi-bench: Can Lmms Aid in Study Of Ancient Script on Oracle Bones?, by Zijian Chen et al.
-
Summary of Instantswap: Fast Customized Concept Swapping Across Sharp Shape Differences, by Chenyang Zhu et al.
-
Summary of Painternet: Adaptive Image Inpainting with Actual-token Attention and Diverse Mask Control, by Ruichen Wang et al.
-
Summary of Schedule on the Fly: Diffusion Time Prediction For Faster and Better Image Generation, by Zilyu Ye et al.
-
Summary of Best Practices For Large Language Models in Radiology, by Christian Bluethgen et al.
-
Summary of Collaborative Instance Object Navigation: Leveraging Uncertainty-awareness to Minimize Human-agent Dialogues, by Francesco Taioli et al.
-
Summary of Exploring React Prompting For Task-oriented Dialogue: Insights and Shortcomings, by Michelle Elizabeth et al.
-
Summary of Indexing Economic Fluctuation Narratives From Keiki Watchers Survey, by Eriko Shigetsugu et al.
-
Summary of Mulan: Adapting Multilingual Diffusion Models For Hundreds Of Languages with Negligible Cost, by Sen Xing et al.
-
Summary of Lvlm-count: Enhancing the Counting Ability Of Large Vision-language Models, by Muhammad Fetrat Qharabagh et al.
-
Summary of A Comparative Study Of Llm-based Asr and Whisper in Low Resource and Code Switching Scenario, by Zheshu Song and Ziyang Ma and Yifan Yang and Jianheng Zhuo and Xie Chen
-
Summary of The Advancement Of Personalized Learning Potentially Accelerated by Generative Ai, By Yuang Wei et al.
-
Summary of Towards Adaptive Mechanism Activation in Language Agent, by Ziyang Huang et al.
-
Summary of Adascale: Dynamic Context-aware Dnn Scaling Via Automated Adaptation Loop on Mobile Devices, by Yuzhan Wang et al.
-
Summary of Revisiting Self-supervised Heterogeneous Graph Learning From Spectral Clustering Perspective, by Yujie Mo and Zhihe Lu and Runpeng Yu and Xiaofeng Zhu and Xinchao Wang
-
Summary of Exploring Cognition Through Morphological Info-computational Framework, by Gordana Dodig-crnkovic
-
Summary of Rethinking Cognition: Morphological Info-computation and the Embodied Paradigm in Life and Artificial Intelligence, by Gordana Dodig-crnkovic
-
Summary of Ctrlnerf: the Generative Neural Radiation Fields For the Controllable Synthesis Of High-fidelity 3d-aware Images, by Jian Liu and Zhen Yu
-
Summary of Pgso: Prompt-based Generative Sequence Optimization Network For Aspect-based Sentiment Analysis, by Hao Dong et al.
-
Summary of Selfprompt: Autonomously Evaluating Llm Robustness Via Domain-constrained Knowledge Guidelines and Refined Adversarial Prompts, by Aihua Pei et al.
-
Summary of Divd: Deblurring with Improved Video Diffusion Model, by Haoyang Long and Yan Wang and Wendong Wang
-
Summary of Local Vs. Global: Local Land-use and Land-cover Models Deliver Higher Quality Maps, by Girmaw Abebe Tadesse et al.
-
Summary of Long Text Outline Generation: Chinese Text Outline Based on Unsupervised Framework and Large Language Mode, by Yan Yan and Yuanchi Ma
-
Summary of Improving Physics Reasoning in Large Language Models Using Mixture Of Refinement Agents, by Raj Jaiswal et al.
-
Summary of Alignmamba: Enhancing Multimodal Mamba with Local and Global Cross-modal Alignment, by Yan Li et al.
-
Summary of Improving Multimodal Llms Ability in Geometry Problem Solving, Reasoning, and Multistep Scoring, by Avinash Anand et al.
-
Summary of Knowledgeprompts: Exploring the Abilities Of Large Language Models to Solve Proportional Analogies Via Knowledge-enhanced Prompting, by Thilini Wijesiriwardene and Ruwan Wickramarachchi and Sreeram Vennam and Vinija Jain and Aman Chadha and Amitava Das and Ponnurangam Kumaraguru and Amit Sheth
-
Summary of Playable Game Generation, by Mingyu Yang et al.
-
Summary of Safety Alignment Backfires: Preventing the Re-emergence Of Suppressed Concepts in Fine-tuned Text-to-image Diffusion Models, by Sanghyun Kim et al.
-
Summary of Strategic Application Of Aigc For Uav Trajectory Design: a Channel Knowledge Map Approach, by Chiya Zhang et al.
-
Summary of Droidcall: a Dataset For Llm-powered Android Intent Invocation, by Weikai Xie et al.
-
Summary of Federated Progressive Self-distillation with Logits Calibration For Personalized Iiot Edge Intelligence, by Yingchao Wang and Wenqi Niu
-
Summary of Freecond: Free Lunch in the Input Conditions Of Text-guided Inpainting, by Teng-fang Hsiao et al.
-
Summary of Learner Attentiveness and Engagement Analysis in Online Education Using Computer Vision, by Sharva Gogawale et al.
-
Summary of Optimizing Sequential Recommendation Models with Scaling Laws and Approximate Entropy, by Tingjia Shen et al.
-
Summary of Agribench: a Hierarchical Agriculture Benchmark For Multimodal Large Language Models, by Yutong Zhou and Masahiro Ryo
-
Summary of Benchmark Real-time Adaptation and Communication Capabilities Of Embodied Agent in Collaborative Scenarios, by Shipeng Liu et al.
-
Summary of Node Importance Estimation Leveraging Llms For Semantic Augmentation in Knowledge Graphs, by Xinyu Lin et al.
-
Summary of Lambda: Covering the Multimodal Critical Scenarios For Automated Driving Systems by Search Space Quantization, By Xinzheng Wu et al.
-
Summary of Fullstack Bench: Evaluating Llms As Full Stack Coders, by Bytedance-seed-foundation-code-team: Yao Cheng et al.
-
Summary of Rethinking Generalizability and Discriminability Of Self-supervised Learning From Evolutionary Game Theory Perspective, by Jiangmeng Li et al.
-
Summary of Motion Dreamer: Boundary Conditional Motion Reasoning For Physically Coherent Video Generation, by Tianshuo Xu et al.
-
Summary of Unveiling Performance Challenges Of Large Language Models in Low-resource Healthcare: a Demographic Fairness Perspective, by Yue Zhou et al.
-
Summary of Turing Representational Similarity Analysis (rsa): a Flexible Method For Measuring Alignment Between Human and Artificial Intelligence, by Mattson Ogg et al.
-
Summary of Phyt2v: Llm-guided Iterative Self-refinement For Physics-grounded Text-to-video Generation, by Qiyao Xue et al.
-
Summary of Fairness at Every Intersection: Uncovering and Mitigating Intersectional Biases in Multimodal Clinical Predictions, by Resmi Ramachandranpillai et al.
-
Summary of Leveraging Llm For Automated Ontology Extraction and Knowledge Graph Generation, by Mohammad Sadeq Abolhasani et al.
-
Summary of Proceedings Of the 2024 Xcsp3 Competition, by Gilles Audemard et al.
-
Summary of Orthus: Autoregressive Interleaved Image-text Generation with Modality-specific Heads, by Siqi Kou et al.
-
Summary of Open-sora Plan: Open-source Large Video Generation Model, by Bin Lin et al.
-
Summary of Sparse Attention Vectors: Generative Multimodal Model Features Are Discriminative Vision-language Classifiers, by Chancharik Mitra et al.
-
Summary of Dlava: Document Language and Vision Assistant For Answer Localization with Enhanced Interpretability and Trustworthiness, by Ahmad Mohammadshirazi et al.
-
Summary of To Ensemble or Not: Assessing Majority Voting Strategies For Phishing Detection with Large Language Models, by Fouad Trad et al.
-
Summary of Towards the Ultimate Programming Language: Trust and Benevolence in the Age Of Artificial Intelligence, by Bartosz Sawicki et al.
-
Summary of Digital Twin in Industries: a Comprehensive Survey, by Md Bokhtiar Al Zami et al.
-
Summary of An Ai-driven Data Mesh Architecture Enhancing Decision-making in Infrastructure Construction and Public Procurement, by Saurabh Mishra et al.
-
Summary of Twisted Convolutional Networks (tcns): Enhancing Feature Interactions For Non-spatial Data Classification, by Junbo Jacob Lian
-
Summary of Fine Tuning Large Language Models to Deliver Cbt For Depression, by Talha Tahir
-
Summary of Adapting the Re-id Challenge For Static Sensors, by Avirath Sundaresan et al.
-
Summary of Plancritic: Formal Planning with Human Feedback, by Owen Burns et al.
-
Summary of Cognitive Biases in Large Language Models: a Survey and Mitigation Experiments, by Yasuaki Sumita et al.
-
Summary of Eftvit: Efficient Federated Training Of Vision Transformers with Masked Images on Resource-constrained Edge Devices, by Meihan Wu et al.
-
Summary of Empowering the Deaf and Hard Of Hearing Community: Enhancing Video Captions Using Large Language Models, by Nadeen Fathallah et al.
-
Summary of Cada: Cross-problem Routing Solver with Constraint-aware Dual-attention, by Han Li et al.
-
Summary of Enhancing Zero-shot Chain Of Thought Prompting Via Uncertainty-guided Strategy Selection, by Shanu Kumar et al.
-
Summary of Talking to Dino: Bridging Self-supervised Vision Backbones with Language For Open-vocabulary Segmentation, by Luca Barsellotti et al.
-
Summary of Omulet: Orchestrating Multiple Tools For Practicable Conversational Recommendation, by Se-eun Yoon et al.
-
Summary of Integrating Transit Signal Priority Into Multi-agent Reinforcement Learning Based Traffic Signal Control, by Dickness Kakitahi Kwesiga et al.
-
Summary of Beyond Surface Structure: a Causal Assessment Of Llms’ Comprehension Ability, by Yujin Han et al.
-
Summary of Tqa-bench: Evaluating Llms For Multi-table Question Answering with Scalable Context and Symbolic Extension, by Zipeng Qiu et al.