Paper List
We recommend you use the search box as this list is very long.
-
Summary of Flair: Vlm with Fine-grained Language-informed Image Representations, by Rui Xiao et al.
-
Summary of Human Evaluation Of Procedural Knowledge Graph Extraction From Text with Large Language Models, by Valentina Anita Carriero and Antonia Azzini and Ilaria Baroni and Mario Scrocca and Irene Celino
-
Summary of The Matrix: Infinite-horizon World Generation with Real-time Moving Control, by Ruili Feng et al.
-
Summary of Gaussian Splatting Under Attack: Investigating Adversarial Noise in 3d Objects, by Abdurrahman Zeybey et al.
-
Summary of Cnnsum: Exploring Long-context Summarization with Large Language Models in Chinese Novels, by Lingxiao Wei et al.
-
Summary of Minimization Of Boolean Complexity in In-context Concept Learning, by Leroy Z. Wang et al.
-
Summary of Flame 3 Dataset: Unleashing the Power Of Radiometric Thermal Uav Imagery For Wildfire Management, by Bryce Hopkins et al.
-
Summary of A Novel Compact Llm Framework For Local, High-privacy Ehr Data Applications, by Yixiang Qu et al.
-
Summary of Constrained Identifiability Of Causal Effects, by Yizuo Chen et al.
-
Summary of Deep-learning Based Docking Methods: Fair Comparisons to Conventional Docking Workflows, by Ajay N. Jain et al.
-
Summary of Mld-ea: Check and Complete Narrative Coherence by Introducing Emotions and Actions, By Jinming Zhang et al.
-
Summary of Panoptic Diffusion Models: Co-generation Of Images and Segmentation Maps, by Yinghan Long et al.
-
Summary of Human Multi-view Synthesis From a Single-view Model:transferred Body and Face Representations, by Yu Feng et al.
-
Summary of Pemf-vto: Point-enhanced Video Virtual Try-on Via Mask-free Paradigm, by Tianyu Chang et al.
-
Summary of Specification Generation For Neural Networks in Systems, by Isha Chaudhary et al.
-
Summary of Tokenflow: Unified Image Tokenizer For Multimodal Understanding and Generation, by Liao Qu et al.
-
Summary of Preference-based Opponent Shaping in Differentiable Games, by Xinyu Qiao et al.
-
Summary of Chatts: Aligning Time Series with Llms Via Synthetic Data For Enhanced Understanding and Reasoning, by Zhe Xie et al.
-
Summary of Experience-driven Discovery Of Planning Strategies, by Ruiqi He et al.
-
Summary of Fine-grained Behavior Simulation with Role-playing Large Language Model on Social Media, by Kun Li et al.
-
Summary of Robust Multi-bit Text Watermark with Llm-based Paraphrasers, by Xiaojun Xu et al.
-
Summary of F-se-lstm: a Time Series Anomaly Detection Method with Frequency Domain Information, by Yi-xiang Lu et al.
-
Summary of Fcl-vit: Task-aware Attention Tuning For Continual Learning, by Anestis Kaimakamidis et al.
-
Summary of Towards Rich Emotions in 3d Avatars: a Text-to-3d Avatar Generation Benchmark, by Haidong Xu et al.
-
Summary of Bias Analysis Of Ai Models For Undergraduate Student Admissions, by Kelly Van Busum and Shiaofen Fang
-
Summary of Wem-gan: Wavelet Transform Based Facial Expression Manipulation, by Dongya Sun et al.
-
Summary of Graph-powered Defense: Controller Area Network Intrusion Detection For Unmanned Aerial Vehicles, by Reek Majumder et al.
-
Summary of Semantic Tokens in Retrieval Augmented Generation, by Joel Suro
-
Summary of Factored Space Models: Towards Causality Between Levels Of Abstraction, by Scott Garrabrant et al.
-
Summary of Ai-driven Resource Allocation Framework For Microservices in Hybrid Cloud Platforms, by Biman Barua and M. Shamim Kaiser
-
Summary of Av-odyssey Bench: Can Your Multimodal Llms Really Understand Audio-visual Information?, by Kaixiong Gong et al.
-
Summary of Time-reversal Provides Unsupervised Feedback to Llms, by Yerram Varun et al.
-
Summary of Projection Abstractions in Planning Under the Lenses Of Abstractions For Mdps, by Giuseppe Canonaco et al.
-
Summary of Scaling Image Tokenizers with Grouped Spherical Quantization, by Jiangtao Wang et al.
-
Summary of Qa-toolbox: Conversational Question-answering For Process Task Guidance in Manufacturing, by Ramesh Manuvinakurike et al.
-
Summary of Anigs: Animatable Gaussian Avatar From a Single Image with Inconsistent Gaussian Reconstruction, by Lingteng Qiu et al.
-
Summary of Scalable Image Tokenization with Index Backpropagation Quantization, by Fengyuan Shi et al.
-
Summary of Applying Irt to Distinguish Between Human and Generative Ai Responses to Multiple-choice Assessments, by Alona Strugatski and Giora Alexandron
-
Summary of Hybrid-squad: Hybrid Scholarly Question Answering Dataset, by Tilahun Abedissa Taffa et al.
-
Summary of An Evolutionary Large Language Model For Hallucination Mitigation, by Abdennour Boulesnane and Abdelhakim Souilah
-
Summary of Optimization Of Transformer Heart Disease Prediction Model Based on Particle Swarm Optimization Algorithm, by Jingyuan Yi et al.
-
Summary of A Privacy-preserving Distributed Credible Evidence Fusion Algorithm For Collective Decision-making, by Chaoxiong Ma et al.
-
Summary of Personalized Multimodal Large Language Models: a Survey, by Junda Wu et al.
-
Summary of Visco: Benchmarking Fine-grained Critique and Correction Towards Self-improvement in Visual Reasoning, by Xueqing Wu et al.
-
Summary of Mining Tweets to Predict Future Bitcoin Price, by Ashutosh Hathidara et al.
-
Summary of Analyzing the Impact Of Ai Tools on Student Study Habits and Academic Performance, by Ben Ward et al.
-
Summary of Anatomically-grounded Fact Checking Of Automated Chest X-ray Reports, by R. Mahmood et al.
-
Summary of Keeping Experts in the Loop: Expert-guided Optimization For Clinical Data Classification Using Large Language Models, by Nader Karayanni et al.
-
Summary of Videoicl: Confidence-based Iterative In-context Learning For Out-of-distribution Video Understanding, by Kangsan Kim et al.
-
Summary of Layoutvlm: Differentiable Optimization Of 3d Layout Via Vision-language Models, by Fan-yun Sun et al.
-
Summary of Comparative Performance Of Machine Learning Algorithms For Early Genetic Disorder and Subclass Classification, by Abu Bakar Siddik et al.
-
Summary of Deep Learning Approach For Predicting the Replicator Equation in Evolutionary Game Theory, by Advait Chandorkar
-
Summary of Cross-attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-image Generative Models, by Jungwon Park et al.
-
Summary of Videogen-of-thought: Step-by-step Generating Multi-shot Video with Minimal Manual Intervention, by Mingzhe Zheng et al.
-
Summary of Sustainable Self-evolution Adversarial Training, by Wenxuan Wang et al.
-
Summary of A Comprehensive Evaluation Of Large Language Models on Aspect-based Sentiment Analysis, by Changzhi Zhou et al.
-
Summary of Large Multimodal Agents For Accurate Phishing Detection with Enhanced Token Optimization and Cost Reduction, by Fouad Trad et al.
-
Summary of Ah-ocda: Amplitude-based Curriculum Learning and Hopfield Segmentation Model For Open Compound Domain Adaptation, by Jaehyun Choi et al.
-
Summary of Scimage: How Good Are Multimodal Large Language Models at Scientific Text-to-image Generation?, by Leixin Zhang et al.
-
Summary of Gerps-compare: Comparing Ner Methods For Legal Norm Analysis, by Sarah T. Bachinger et al.
-
Summary of Gracefully Filtering Backdoor Samples For Generative Large Language Models Without Retraining, by Zongru Wu et al.
-
Summary of Uncertainty-aware Regularization For Image-to-image Translation, by Anuja Vats et al.
-
Summary of Digital Epidemiology: Leveraging Social Media For Insight Into Epilepsy and Mental Health, by Liza Dahiya et al.
-
Summary of Quantifying the Reliability Of Predictions in Detection Transformers: Object-level Calibration and Image-level Uncertainty, by Young-jin Park and Carson Sobolewski and Navid Azizan
-
Summary of [cls] Attention Is All You Need For Training-free Visual Token Pruning: Make Vlm Inference Faster, by Qizhe Zhang et al.
-
Summary of Noise Injection Reveals Hidden Capabilities Of Sandbagging Language Models, by Cameron Tice et al.
-
Summary of Iqa-adapter: Exploring Knowledge Transfer From Image Quality Assessment to Diffusion-based Generative Models, by Khaled Abud et al.
-
Summary of Randar: Decoder-only Autoregressive Visual Generation in Random Orders, by Ziqi Pang et al.
-
Summary of The Reality Of Ai and Biorisk, by Aidan Peppin et al.
-
Summary of Recurrent Neural Network on Picture Model, by Weihan Xu
-
Summary of The Evolution and Future Perspectives Of Artificial Intelligence Generated Content, by Chengzhang Zhu et al.
-
Summary of Real-time Multilingual Sign Language Processing, by Amit Moryossef
-
Summary of The Use Of Large Language Models to Enhance Cancer Clinical Trial Educational Materials, by Mingye Gao et al.
-
Summary of Enhancing Deep Learning Model Robustness Through Metamorphic Re-training, by Said Togru et al.
-
Summary of Usage Governance Advisor: From Intent to Ai Governance, by Elizabeth M. Daly et al.
-
Summary of Llms4life: Large Language Models For Ontology Learning in Life Sciences, by Nadeen Fathallah et al.
-
Summary of Construction and Optimization Of Health Behavior Prediction Model For the Elderly in Smart Elderly Care, by Qian Guo et al.
-
Summary of Trust & Safety Of Llms and Llms in Trust & Safety, by Doohee You et al.
-
Summary of Accdiffusion V2: Towards More Accurate Higher-resolution Diffusion Extrapolation, by Zhihang Lin et al.
-
Summary of Beyond Generation: Unlocking Universal Editing Via Self-supervised Fine-tuning, by Harold Haodong Chen et al.
-
Summary of Graph Learning For Planning: the Story Thus Far and Open Challenges, by Dillon Z. Chen et al.
-
Summary of Pld+: Accelerating Llm Inference by Leveraging Language Model Artifacts, By Shwetha Somasundaram et al.
-
Summary of Artificial Intelligence For Geometry-based Feature Extraction, Analysis and Synthesis in Artistic Images: a Survey, by Mridula Vijendran et al.
-
Summary of Fastrm: An Efficient and Automatic Explainability Framework For Multimodal Generative Models, by Gabriela Ben-melech Stan et al.
-
Summary of Intelligent Spark Agents: a Modular Langgraph Framework For Scalable, Visualized, and Enhanced Big Data Machine Learning Workflows, by Jialin Wang and Zhihua Duan
-
Summary of Artbrain: An Explainable End-to-end Toolkit For Classification and Attribution Of Ai-generated Art and Style, by Ravidu Suien Rammuni Silva et al.
-
Summary of Copyrightshield: Spatial Similarity Guided Backdoor Defense Against Copyright Infringement in Diffusion Models, by Zhixiang Guo et al.
-
Summary of Videolights: Feature Refinement and Cross-task Alignment Transformer For Joint Video Highlight Detection and Moment Retrieval, by Dhiman Paul et al.
-
Summary of Seqafford: Sequential 3d Affordance Reasoning Via Multimodal Large Language Model, by Chunlin Yu et al.
-
Summary of Mba-rag: a Bandit Approach For Adaptive Retrieval-augmented Generation Through Question Complexity, by Xiaqiang Tang et al.
-
Summary of Handwriting-based Automated Assessment and Grading Of Degree Of Handedness: a Pilot Study, by Smriti Bala et al.
-
Summary of Ncdd: Nearest Centroid Distance Deficit For Out-of-distribution Detection in Gastrointestinal Vision, by Sandesh Pokhrel et al.
-
Summary of Medchain: Bridging the Gap Between Llm Agents and Clinical Practice Through Interactive Sequential Benchmarking, by Jie Liu et al.
-
Summary of If Eleanor Rigby Had Met Chatgpt: a Study on Loneliness in a Post-llm World, by Adrian De Wynter
-
Summary of Agentic-hls: An Agentic Reasoning Based High-level Synthesis System Using Large Language Models (ai For Eda Workshop 2024), by Ali Emre Oztas et al.
-
Summary of Nyt-connections: a Deceptively Simple Text Classification Task That Stumps System-1 Thinkers, by Angel Yahir Loredo Lopez et al.
-
Summary of Image Forgery Localization Via Guided Noise and Multi-scale Feature Aggregation, by Yakun Niu et al.