Paper List

We recommend you use the search box as this list is very long.

Summary of Unipts: a Unified Framework For Proficient Post-training Sparsity, by Jingjing Xie et al.
Summary of Why Reinforcement Learning in Energy Systems Needs Explanations, by Hallah Shahid Butt et al.
Summary of Letsmap: Unsupervised Representation Learning For Semantic Bev Mapping, by Nikhil Gosala et al.
Summary of Competevo: Towards Morphological Evolution From Competition, by Kangyao Huang et al.
Summary of Dsdl: Data Set Description Language For Bridging Modalities and Tasks in Ai Data, by Bin Wang et al.
Summary of Self-supervised Learning Based Handwriting Verification, by Mihir Chauhan et al.
Summary of Sce-mae: Selective Correspondence Enhancement with Masked Autoencoder For Self-supervised Landmark Estimation, by Kejia Yin et al.
Summary of Intelligent Clinical Documentation: Harnessing Generative Ai For Patient-centric Clinical Note Generation, by Anjanava Biswas et al.
Summary of The Battle Of Llms: a Comparative Study in Conversational Qa Tasks, by Aryan Rangapur et al.
Summary of Frustratingly Easy Test-time Adaptation Of Vision-language Models, by Matteo Farina et al.
Summary of Llama-nas: Efficient Neural Architecture Search For Large Language Models, by Anthony Sarah et al.
Summary of A Review and Implementation Of Object Detection Models and Optimizations For Real-time Medical Mask Detection During the Covid-19 Pandemic, by Ioanna Gogou et al.
Summary of Widin: Wording Image For Domain-invariant Representation in Single-source Domain Generalization, by Jiawei Ma et al.
Summary of Raccoon: a Versatile Instructional Video Editing Framework with Auto-generated Narratives, by Jaehong Yoon et al.
Summary of Vig: Linear-complexity Visual Sequence Learning with Gated Linear Attention, by Bencheng Liao et al.
Summary of Gflow: Recovering 4d World From Monocular Video, by Shizun Wang et al.
Summary of Dig: Scalable and Efficient Diffusion Models with Gated Linear Attention, by Lianghui Zhu et al.
Summary of Llms and Memorization: on Quality and Specificity Of Copyright Compliance, by Felix B Mueller et al.
Summary of Improved Emotional Alignment Of Ai and Humans: Human Ratings Of Emotions Expressed by Stable Diffusion V1, Dall-e 2, and Dall-e 3, By James Derek Lomas et al.
Summary of Mm-mixing: Multi-modal Mixing Alignment For 3d Understanding, by Jiaze Wang et al.
Summary of Faiir: Building Toward a Conversational Ai Agent Assistant For Youth Mental Health Service Provision, by Stephen Obadinma et al.
Summary of Unleashing the Potential Of Text-attributed Graphs: Automatic Relation Decomposition Via Large Language Models, by Hyunjin Seo et al.
Summary of Chatgpt As the Marketplace Of Ideas: Should Truth-seeking Be the Goal Of Ai Content Governance?, by Jiawei Zhang
Summary of Mavin: Multi-action Video Generation with Diffusion Models Via Transition Video Infilling, by Bowen Zhang et al.
Summary of On Creativity and Open-endedness, by L. B. Soros et al.
Summary of Where’s Waldo: Diffusion Features For Personalized Segmentation and Retrieval, by Dvir Samuel et al.
Summary of Coupled Mamba: Enhanced Multi-modal Fusion with Coupled State Space Model, by Wenbing Li et al.
Summary of Edinburgh Clinical Nlp at Mediqa-corr 2024: Guiding Large Language Models with Hints, by Aryo Pradipta Gema et al.
Summary of Automated Real-world Sustainability Data Generation From Images Of Buildings, by Peter J Bentley et al.
Summary of Effovpr: Effective Foundation Model Utilization For Visual Place Recognition, by Issar Tzachor et al.
Summary of Towards Dialogues For Joint Human-ai Reasoning and Value Alignment, by Elfia Bezou-vrakatseli and Oana Cocarascu and Sanjay Modgil
Summary of Llm Experiments with Simulation: Large Language Model Multi-agent System For Simulation Model Parametrization in Digital Twins, by Yuchen Xia et al.
Summary of A Unified Temporal Knowledge Graph Reasoning Model Towards Interpolation and Extrapolation, by Kai Chen et al.
Summary of Facilitating Multi-role and Multi-behavior Collaboration Of Large Language Models For Online Job Seeking and Recruiting, by Hongda Sun et al.
Summary of An Agent Design with Goal Reaching Guarantees For Enhancement Of Learning, by Pavel Osinenko et al.
Summary of Pytag: Tabletop Games For Multi-agent Reinforcement Learning, by Martin Balla et al.
Summary of Defending Large Language Models Against Jailbreak Attacks Via Layer-specific Editing, by Wei Zhao and Zhe Li and Yige Li and Ye Zhang and Jun Sun
Summary of Unlocking Futures: a Natural Language Driven Career Prediction System For Computer Science and Software Engineering Students, by Sakir Hossain Faruque et al.
Summary of Learning to Detour: Shortcut Mitigating Augmentation For Weakly Supervised Semantic Segmentation, by Junehyoung Kwon et al.
Summary of Active Use Of Latent Constituency Representation in Both Humans and Large Language Models, by Wei Liu et al.
Summary of Utilitarian Algorithm Configuration For Infinite Parameter Spaces, by Devon Graham and Kevin Leyton-brown
Summary of Extreme Value Monte Carlo Tree Search, by Masataro Asai et al.
Summary of Text-only Synthesis For Image Captioning, by Qing Zhou et al.
Summary of Conv-coa: Improving Open-domain Question Answering in Large Language Models Via Conversational Chain-of-action, by Zhenyu Pan et al.
Summary of Getting More Juice Out Of the Sft Data: Reward Learning From Human Demonstration Improves Sft For Llm Alignment, by Jiaxiang Li et al.
Summary of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment, By Xin Xiao et al.
Summary of Mixdq: Memory-efficient Few-step Text-to-image Diffusion Models with Metric-decoupled Mixed Precision Quantization, by Tianchen Zhao et al.
Summary of Arithmetic Reasoning with Llm: Prolog Generation & Permutation, by Xiaocheng Yang et al.
Summary of White-box Multimodal Jailbreaks Against Large Vision-language Models, by Ruofan Wang et al.
Summary of Near-infrared and Low-rank Adaptation Of Vision Transformers in Remote Sensing, by Irem Ulku et al.
Summary of Ov-dquo: Open-vocabulary Detr with Denoising Text Query Training and Open-world Unknown Objects Supervision, by Junjie Wang et al.
Summary of Towards Clinical Ai Fairness: Filling Gaps in the Puzzle, by Mingxuan Liu et al.
Summary of Tool Learning with Large Language Models: a Survey, by Changle Qu et al.
Summary of Learning Shared Rgb-d Fields: Unified Self-supervised Pre-training For Label-efficient Lidar-camera 3d Perception, by Xiaohao Xu et al.
Summary of Proof Of Quality: a Costless Paradigm For Trustless Generative Ai Model Inference on Blockchains, by Zhenjie Zhang et al.
Summary of Self-guiding Exploration For Combinatorial Problems, by Zangir Iklassov and Yali Du and Farkhad Akimov and Martin Takac
Summary of Hybrid Preference Optimization: Augmenting Direct Preference Optimization with Auxiliary Objectives, by Anirudhan Badrinath et al.
Summary of Recent Trends in Personalized Dialogue Generation: a Review Of Datasets, Methodologies, and Evaluations, by Yi-pei Chen et al.
Summary of Modeling Dynamic Topics in Chain-free Fashion by Evolution-tracking Contrastive Learning and Unassociated Word Exclusion, By Xiaobao Wu et al.
Summary of Yuan 2.0-m32: Mixture Of Experts with Attention Router, by Shaohua Wu et al.
Summary of Fastopic: Pretrained Transformer Is a Fast, Adaptive, Stable, and Transferable Topic Model, by Xiaobao Wu et al.
Summary of Velora: Memory Efficient Training Using Rank-1 Sub-token Projections, by Roy Miles et al.
Summary of Fmri Predictors Based on Language Models Of Increasing Complexity Recover Brain Left Lateralization, by Laurent Bonnasse-gahot and Christophe Pallier
Summary of An Nlp Crosswalk Between the Common Core State Standards and Naep Item Specifications, by Gregory Camilli
Summary of Exploring and Steering the Moral Compass Of Large Language Models, by Alejandro Tlaie
Summary of Cost-efficient Knowledge-based Question Answering with Large Language Models, by Junnan Dong et al.
Summary of Mindmerger: Efficient Boosting Llm Reasoning in Non-english Languages, by Zixian Huang et al.
Summary of Gaussianformer: Scene As Gaussians For Vision-based 3d Semantic Occupancy Prediction, by Yuanhui Huang et al.
Summary of Vista: a Generalizable Driving World Model with High Fidelity and Versatile Controllability, by Shenyuan Gao et al.
Summary of Clibd: Bridging Vision and Genomics For Biodiversity Monitoring at Scale, by Zeming Gong et al.
Summary of Biodiscoveryagent: An Ai Agent For Designing Genetic Perturbation Experiments, by Yusuf Roohani et al.
Summary of Tima: Text-image Mutual Awareness For Balancing Zero-shot Adversarial Robustness and Generalization Ability, by Fengji Ma et al.
Summary of Video Enriched Retrieval Augmented Generation Using Aligned Video Captions, by Kevin Dela Rosa
Summary of Clavaddpm: Multi-relational Data Synthesis with Cluster-guided Diffusion Models, by Wei Pang et al.
Summary of The Widening Gap: the Benefits and Harms Of Generative Ai For Novice Programmers, by James Prather et al.
Summary of Lora-switch: Boosting the Efficiency Of Dynamic Llm Adapters Via System-algorithm Co-design, by Rui Kong et al.
Summary of Xl3m: a Training-free Framework For Llm Length Extension Based on Segment-wise Inference, by Shengnan Wang et al.
Summary of On the Sequence Evaluation Based on Stochastic Processes, by Tianhao Zhang et al.
Summary of Transvip: Speech to Speech Translation System with Voice and Isochrony Preservation, by Chenyang Le et al.
Summary of Faintbench: a Holistic and Precise Benchmark For Bias Evaluation in Text-to-image Models, by Hanjun Luo et al.
Summary of Don’t Miss the Forest For the Trees: Attentional Vision Calibration For Large Vision Language Models, by Sangmin Woo et al.
Summary of Diffusion Model Patching Via Mixture-of-prompts, by Seokil Ham et al.
Summary of Ritual: Random Image Transformations As a Universal Anti-hallucination Lever in Large Vision Language Models, by Sangmin Woo et al.
Summary of Tokenunify: Scalable Autoregressive Visual Pre-training with Mixture Token Prediction, by Yinda Chen et al.
Summary of Think Before You Act: a Two-stage Framework For Mitigating Gender Bias Towards Vision-language Tasks, by Yunqi Zhang et al.
Summary of A Large Language Model-based Multi-agent Manufacturing System For Intelligent Shopfloor, by Zhen Zhao et al.
Summary of Multiple Heads Are Better Than One: Mixture Of Modality Knowledge Experts For Entity Representation Learning, by Yichi Zhang et al.
Summary of Uncertainty Management in the Construction Of Knowledge Graphs: a Survey, by Lucas Jarnac et al.
Summary of Vocot: Unleashing Visually Grounded Multi-step Reasoning in Large Multi-modal Models, by Zejun Li et al.
Summary of Vision-and-language Navigation Generative Pretrained Transformer, by Wen Hanlin
Summary of Exploring the Llm Journey From Cognition to Expression with Linear Representations, by Yuzi Yan et al.
Summary of Position: Foundation Agents As the Paradigm Shift For Decision Making, by Xiaoqian Liu et al.
Summary of Tokenization Matters! Degrading Large Language Models Through Challenging Their Tokenization, by Dixuan Wang et al.
Summary of Compositional Few-shot Class-incremental Learning, by Yixiong Zou et al.
Summary of Reflectioncoder: Learning From Reflection Sequence For Enhanced One-off Code Generation, by Houxing Ren et al.
Summary of Leveraging Small Language Models For Text2sparql Tasks to Improve the Resilience Of Ai Assistance, by Felix Brei et al.
Summary of Empowering Character-level Text Infilling by Eliminating Sub-tokens, By Houxing Ren et al.
Summary of Superpixelwise Low-rank Approximation Based Partial Label Learning For Hyperspectral Image Classification, by Shujun Yang et al.
Summary of Llm-optic: Unveiling the Capabilities Of Large Language Models For Universal Visual Grounding, by Haoyu Zhao et al.
Summary of Teii: Think, Explain, Interact and Iterate with Large Language Models to Solve Cross-lingual Emotion Detection, by Long Cheng et al.