Paper List
We recommend you use the search box as this list is very long.
-
Summary of Simply Trainable Nearest Neighbour Machine Translation with Gpu Inference, by Hossam Amer et al.
-
Summary of Robust Conformal Volume Estimation in 3d Medical Images, by Benjamin Lambert et al.
-
Summary of A Study on the Implementation Method Of An Agent-based Advanced Rag System Using Graph, by Cheonsu Jeong
-
Summary of Appformer: a Novel Framework For Mobile App Usage Prediction Leveraging Progressive Multi-modal Data Fusion and Feature Extraction, by Chuike Sun et al.
-
Summary of Asi-seg: Audio-driven Surgical Instrument Segmentation with Surgeon Intention Understanding, by Zhen Chen et al.
-
Summary of Conversational Ai Multi-agent Interoperability, Universal Open Apis For Agentic Natural Language Multimodal Communications, by Diego Gosmar et al.
-
Summary of Multi-task Neural Networks For Pain Intensity Estimation Using Electrocardiogram and Demographic Factors, by Stefanos Gkikas et al.
-
Summary of Official-nv: An Llm-generated News Video Dataset For Multimodal Fake News Detection, by Yihao Wang et al.
-
Summary of Versusdebias: Universal Zero-shot Debiasing For Text-to-image Models Via Slm-based Prompt Engineering and Generative Adversary, by Hanjun Luo et al.
-
Summary of Wecromcl: Weakly Supervised Cross-modality Contrastive Learning For Transcription-only Supervised Text Spotting, by Jingjing Wu et al.
-
Summary of Forecast-peft: Parameter-efficient Fine-tuning For Pre-trained Motion Forecasting Models, by Jifeng Wang et al.
-
Summary of Are Llms Good Annotators For Discourse-level Event Relation Extraction?, by Kangda Wei et al.
-
Summary of Meta-rewarding Language Models: Self-improving Alignment with Llm-as-a-meta-judge, by Tianhao Wu et al.
-
Summary of You Shall Know a Piece by the Company It Keeps. Chess Plays As a Data For Word2vec Models, By Boris Orekhov
-
Summary of Enhancing Code Translation in Language Models with Few-shot Learning Via Retrieval-augmented Generation, by Manish Bhattarai et al.
-
Summary of Llms’ Understanding Of Natural Language Revealed, by Walid S. Saba
-
Summary of Optimus-0.3: Using Large Language Models to Model and Solve Optimization Problems at Scale, by Ali Ahmaditeshnizi et al.
-
Summary of Prometheus Chatbot: Knowledge Graph Collaborative Large Language Model For Computer Components Recommendation, by Yunsheng Wang et al.
-
Summary of Foundations For Unfairness in Anomaly Detection — Case Studies in Facial Imaging Data, by Michael Livanos and Ian Davidson
-
Summary of Smart Language Agents in Real-world Planning, by Annabelle Miin et al.
-
Summary of Harnessing Large Vision and Language Models in Agriculture: a Review, by Hongyan Zhu et al.
-
Summary of Ai-driven Healthcare: a Survey on Ensuring Fairness and Mitigating Bias, by Sribala Vidyadhari Chinta et al.
-
Summary of Gpt Deciphering Fedspeak: Quantifying Dissent Among Hawks and Doves, by Denis Peskoff et al.
-
Summary of Greedy Output Approximation: Towards Efficient Structured Pruning For Llms Without Retraining, by Jianwei Li and Yijun Dong and Qi Lei
-
Summary of Large Language Models As Co-pilots For Causal Inference in Medical Studies, by Ahmed Alaa et al.
-
Summary of Farssibert: a Novel Transformer-based Model For Semantic Similarity Measurement Of Persian Social Networks Informal Texts, by Seyed Mojtaba Sadjadi et al.
-
Summary of Llava-read: Enhancing Reading Ability Of Multimodal Language Models, by Ruiyi Zhang et al.
-
Summary of Why Misinformation Is Created? Detecting Them by Integrating Intent Features, By Bing Wang et al.
-
Summary of On Behalf Of the Stakeholders: Trends in Nlp Model Interpretability in the Era Of Llms, by Nitay Calderon et al.
-
Summary of Faster Image2video Generation: a Closer Look at Clip Image Embedding’s Impact on Spatio-temporal Cross-attentions, by Ashkan Taghipour et al.
-
Summary of Mamba-uie: Enhancing Underwater Images with Physical Model Constraint, by Song Zhang et al.
-
Summary of Interactive Learning in Computer Science Education Supported by a Discord Chatbot, By Santiago Berrezueta-guzman et al.
-
Summary of Fine-grained Scene Graph Generation Via Sample-level Bias Prediction, by Yansheng Li et al.
-
Summary of Large Language Models For Human-like Autonomous Driving: a Survey, by Yun Li et al.
-
Summary of Multi-modal Clip-informed Protein Editing, by Mingze Yin et al.
-
Summary of Inference-time Selective Debiasing to Enhance Fairness in Text Classification Models, by Gleb Kuzmin et al.
-
Summary of Semantic Communication Enhanced by Knowledge Graph Representation Learning, By Nour Hello et al.
-
Summary of Integrating Cognitive Ai with Generative Models For Enhanced Question Answering in Skill-based Learning, by Rochan H. Madhusudhana et al.
-
Summary of Adacoder: Adaptive Prompt Compression For Programmatic Visual Question Answering, by Mahiro Ukai et al.
-
Summary of Logic Distillation: Learning From Code Function by Function For Planning and Decision-making, By Dong Chen et al.
-
Summary of Identity-driven Hierarchical Role-playing Agents, by Libo Sun et al.
-
Summary of A Generic Review Of Integrating Artificial Intelligence in Cognitive Behavioral Therapy, by Meng Jiang et al.
-
Summary of Collaborative Evolving Strategy For Automatic Data-centric Development, by Xu Yang et al.
-
Summary of Neurosymbolic Ai For Enhancing Instructability in Generative Ai, by Amit Sheth et al.
-
Summary of Multi-robot System Architecture Design in Sysml and Bpmn, by Ahmed R. Sadik (honda Research Institute Europe et al.
-
Summary of Towards Generalized Offensive Language Identification, by Alphaeus Dmonte et al.
-
Summary of Score Matching Through the Roof: Linear, Nonlinear, and Latent Variables Causal Discovery, by Francesco Montagna et al.
-
Summary of Understanding Xai Through the Philosopher’s Lens: a Historical Perspective, by Martina Mattioli et al.
-
Summary of Any Four Real Numbers Are on All Fours with Analogy, by Yves Lepage et al.
-
Summary of Unifying Visual and Semantic Feature Spaces with Diffusion Models For Enhanced Cross-modal Alignment, by Yuze Zheng et al.
-
Summary of Predicting Winning Captions For Weekly New Yorker Comics, by Stanley Cao et al.
-
Summary of Unexplainability Of Artificial Intelligence Judgments in Kant’s Perspective, by Jongwoo Seo
-
Summary of Mmau: a Holistic Benchmark Of Agent Capabilities Across Diverse Domains, by Guoli Yin et al.
-
Summary of Generative Ai Augmented Induction-based Formal Verification, by Aman Kumar et al.
-
Summary of Intelligence Analysis Of Language Models, by Liane Galanti and Ethan Baron
-
Summary of Towards Automated Solution Recipe Generation For Industrial Asset Management with Llm, by Nianjun Zhou et al.
-
Summary of Towards a Cyber Information Ontology, by David Limbaugh et al.
-
Summary of A Fault Prognostic System For the Turbine Guide Bearings Of a Hydropower Plant Using Long-short Term Memory (lstm), by Yasir Saleem Afridi et al.
-
Summary of Optimizing Numerical Estimation and Operational Efficiency in the Legal Domain Through Large Language Models, by Jia-hong Huang et al.
-
Summary of Configural Processing As An Optimized Strategy For Robust Object Recognition in Neural Networks, by Hojin Jang et al.
-
Summary of Many-shot In-context Learning For Molecular Inverse Design, by Saeed Moayedpour et al.
-
Summary of Wonderful Team: Zero-shot Physical Task Planning with Visual Llms, by Zidan Wang et al.
-
Summary of Personalized and Context-aware Route Planning For Edge-assisted Vehicles, by Dinesh Cyril Selvaraj et al.
-
Summary of Restoreagent: Autonomous Image Restoration Agent Via Multimodal Large Language Models, by Haoyu Chen et al.
-
Summary of Attentionhand: Text-driven Controllable Hand Image Generation For 3d Hand Reconstruction in the Wild, by Junho Park et al.
-
Summary of Gaussiansr: High Fidelity 2d Gaussian Splatting For Arbitrary-scale Image Super-resolution, by Jintong Hu et al.
-
Summary of Difficulty Estimation and Simplification Of French Text Using Llms, by Henri Jamet et al.
-
Summary of Peft-u: Parameter-efficient Fine-tuning For User Personalization, by Christopher Clarke et al.
-
Summary of Self-supervised Pre-training with Diffusion Model For Few-shot Landmark Detection in X-ray Images, by Roberto Di Via et al.
-
Summary of Dallah: a Dialect-aware Multimodal Large Language Model For Arabic, by Fakhraddin Alwajih et al.
-
Summary of Taxonomy-aware Continual Semantic Segmentation in Hyperbolic Spaces For Open-world Perception, by Julia Hindel et al.
-
Summary of Affectively Framework: Towards Human-like Affect-based Agents, by Matthew Barthet et al.
-
Summary of Pianomime: Learning a Generalist, Dexterous Piano Player From Internet Demonstrations, by Cheng Qian et al.
-
Summary of Combining Cognitive and Generative Ai For Self-explanation in Interactive Ai Agents, by Shalini Sushri et al.
-
Summary of Robust Claim Verification Through Fact Detection, by Nazanin Jafari et al.
-
Summary of Mixed Non-linear Quantization For Vision Transformers, by Gihwan Kim et al.
-
Summary of A Role-specific Guided Large Language Model For Ophthalmic Consultation Based on Stylistic Differentiation, by Laiyi Fu et al.
-
Summary of A Reliable Common-sense Reasoning Socialbot Built Using Llms and Goal-directed Asp, by Yankai Zeng et al.
-
Summary of A Universal Prompting Strategy For Extracting Process Model Information From Natural Language Text Using Large Language Models, by Julian Neuberger et al.
-
Summary of Learning Robust Named Entity Recognizers From Noisy Data with Retrieval Augmentation, by Chaoyi Ai et al.
-
Summary of Dynamic Language Group-based Moe: Enhancing Code-switching Speech Recognition with Hierarchical Routing, by Hukai Huang et al.
-
Summary of Every Part Matters: Integrity Verification Of Scientific Figures Based on Multimodal Large Language Models, by Xiang Shi et al.
-
Summary of Streamtinynet: Video Streaming Analysis with Spatial-temporal Tinyml, by Hazem Hesham Yousef Shalby et al.
-
Summary of A Process Algebraic Framework For Multi-agent Dynamic Epistemic Systems, by Alessandro Aldini
-
Summary of Comoto: Unpaired Cross-modal Lesion Distillation Improves Breast Lesion Detection in Tomosynthesis, by Muhammad Alberb et al.
-
Summary of Cityx: Controllable Procedural Content Generation For Unbounded 3d Cities, by Shougao Zhang et al.
-
Summary of Examining the Influence Of Political Bias on Large Language Model Performance in Stance Classification, by Lynnette Hui Xian Ng et al.
-
Summary of Crasar-u-droids: a Large Scale Benchmark Dataset For Building Alignment and Damage Assessment in Georectified Suas Imagery, by Thomas Manzini et al.
-
Summary of Enhancing Agent Learning Through World Dynamics Modeling, by Zhiyuan Sun et al.
-
Summary of Cost-effective Instruction Learning For Pathology Vision and Language Analysis, by Kaitao Chen et al.
-
Summary of Mpox Detection Advanced: Rapid Epidemic Response Through Synthetic Data, by Yudara Kularathne et al.
-
Summary of How Lightweight Can a Vision Transformer Be, by Jen Hong Tan
-
Summary of A Unified Understanding Of Adversarial Vulnerability Regarding Unimodal Models and Vision-language Pre-training Models, by Haonan Zheng et al.
-
Summary of Untrained Neural Networks Can Demonstrate Memorization-independent Abstract Reasoning, by Tomer Barak and Yonatan Loewenstein
-
Summary of Enhancing Model Performance: Another Approach to Vision-language Instruction Tuning, by Vedanshu et al.
-
Summary of Umono: Physical Model Informed Hybrid Cnn-transformer Framework For Underwater Monocular Depth Estimation, by Jian Wang et al.
-
Summary of Shapley Value-based Contrastive Alignment For Multimodal Information Extraction, by Wen Luo and Yu Xia and Shen Tianshu and Sujian Li
-
Summary of Mew: Multiplexed Immunofluorescence Image Analysis Through An Efficient Multiplex Network, by Sukwon Yun et al.