Paper List
We recommend you use the search box as this list is very long.
-
Summary of Unipts: a Unified Framework For Proficient Post-training Sparsity, by Jingjing Xie et al.
-
Summary of Why Reinforcement Learning in Energy Systems Needs Explanations, by Hallah Shahid Butt et al.
-
Summary of Letsmap: Unsupervised Representation Learning For Semantic Bev Mapping, by Nikhil Gosala et al.
-
Summary of Competevo: Towards Morphological Evolution From Competition, by Kangyao Huang et al.
-
Summary of Dsdl: Data Set Description Language For Bridging Modalities and Tasks in Ai Data, by Bin Wang et al.
-
Summary of Self-supervised Learning Based Handwriting Verification, by Mihir Chauhan et al.
-
Summary of Sce-mae: Selective Correspondence Enhancement with Masked Autoencoder For Self-supervised Landmark Estimation, by Kejia Yin et al.
-
Summary of Intelligent Clinical Documentation: Harnessing Generative Ai For Patient-centric Clinical Note Generation, by Anjanava Biswas et al.
-
Summary of The Battle Of Llms: a Comparative Study in Conversational Qa Tasks, by Aryan Rangapur et al.
-
Summary of Frustratingly Easy Test-time Adaptation Of Vision-language Models, by Matteo Farina et al.
-
Summary of Llama-nas: Efficient Neural Architecture Search For Large Language Models, by Anthony Sarah et al.
-
Summary of A Review and Implementation Of Object Detection Models and Optimizations For Real-time Medical Mask Detection During the Covid-19 Pandemic, by Ioanna Gogou et al.
-
Summary of Widin: Wording Image For Domain-invariant Representation in Single-source Domain Generalization, by Jiawei Ma et al.
-
Summary of Raccoon: a Versatile Instructional Video Editing Framework with Auto-generated Narratives, by Jaehong Yoon et al.
-
Summary of Vig: Linear-complexity Visual Sequence Learning with Gated Linear Attention, by Bencheng Liao et al.
-
Summary of Gflow: Recovering 4d World From Monocular Video, by Shizun Wang et al.
-
Summary of Dig: Scalable and Efficient Diffusion Models with Gated Linear Attention, by Lianghui Zhu et al.
-
Summary of Llms and Memorization: on Quality and Specificity Of Copyright Compliance, by Felix B Mueller et al.
-
Summary of Improved Emotional Alignment Of Ai and Humans: Human Ratings Of Emotions Expressed by Stable Diffusion V1, Dall-e 2, and Dall-e 3, By James Derek Lomas et al.
-
Summary of Mm-mixing: Multi-modal Mixing Alignment For 3d Understanding, by Jiaze Wang et al.
-
Summary of Faiir: Building Toward a Conversational Ai Agent Assistant For Youth Mental Health Service Provision, by Stephen Obadinma et al.
-
Summary of Unleashing the Potential Of Text-attributed Graphs: Automatic Relation Decomposition Via Large Language Models, by Hyunjin Seo et al.
-
Summary of Chatgpt As the Marketplace Of Ideas: Should Truth-seeking Be the Goal Of Ai Content Governance?, by Jiawei Zhang
-
Summary of Mavin: Multi-action Video Generation with Diffusion Models Via Transition Video Infilling, by Bowen Zhang et al.
-
Summary of On Creativity and Open-endedness, by L. B. Soros et al.
-
Summary of Where’s Waldo: Diffusion Features For Personalized Segmentation and Retrieval, by Dvir Samuel et al.
-
Summary of Coupled Mamba: Enhanced Multi-modal Fusion with Coupled State Space Model, by Wenbing Li et al.
-
Summary of Edinburgh Clinical Nlp at Mediqa-corr 2024: Guiding Large Language Models with Hints, by Aryo Pradipta Gema et al.
-
Summary of Automated Real-world Sustainability Data Generation From Images Of Buildings, by Peter J Bentley et al.
-
Summary of Effovpr: Effective Foundation Model Utilization For Visual Place Recognition, by Issar Tzachor et al.
-
Summary of Towards Dialogues For Joint Human-ai Reasoning and Value Alignment, by Elfia Bezou-vrakatseli and Oana Cocarascu and Sanjay Modgil
-
Summary of Llm Experiments with Simulation: Large Language Model Multi-agent System For Simulation Model Parametrization in Digital Twins, by Yuchen Xia et al.
-
Summary of A Unified Temporal Knowledge Graph Reasoning Model Towards Interpolation and Extrapolation, by Kai Chen et al.
-
Summary of Facilitating Multi-role and Multi-behavior Collaboration Of Large Language Models For Online Job Seeking and Recruiting, by Hongda Sun et al.
-
Summary of An Agent Design with Goal Reaching Guarantees For Enhancement Of Learning, by Pavel Osinenko et al.
-
Summary of Pytag: Tabletop Games For Multi-agent Reinforcement Learning, by Martin Balla et al.
-
Summary of Defending Large Language Models Against Jailbreak Attacks Via Layer-specific Editing, by Wei Zhao and Zhe Li and Yige Li and Ye Zhang and Jun Sun
-
Summary of Unlocking Futures: a Natural Language Driven Career Prediction System For Computer Science and Software Engineering Students, by Sakir Hossain Faruque et al.
-
Summary of Learning to Detour: Shortcut Mitigating Augmentation For Weakly Supervised Semantic Segmentation, by Junehyoung Kwon et al.
-
Summary of Active Use Of Latent Constituency Representation in Both Humans and Large Language Models, by Wei Liu et al.
-
Summary of Utilitarian Algorithm Configuration For Infinite Parameter Spaces, by Devon Graham and Kevin Leyton-brown
-
Summary of Extreme Value Monte Carlo Tree Search, by Masataro Asai et al.
-
Summary of Text-only Synthesis For Image Captioning, by Qing Zhou et al.
-
Summary of Conv-coa: Improving Open-domain Question Answering in Large Language Models Via Conversational Chain-of-action, by Zhenyu Pan et al.
-
Summary of Getting More Juice Out Of the Sft Data: Reward Learning From Human Demonstration Improves Sft For Llm Alignment, by Jiaxiang Li et al.
-
Summary of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment, By Xin Xiao et al.
-
Summary of Arithmetic Reasoning with Llm: Prolog Generation & Permutation, by Xiaocheng Yang et al.
-
Summary of White-box Multimodal Jailbreaks Against Large Vision-language Models, by Ruofan Wang et al.
-
Summary of Near-infrared and Low-rank Adaptation Of Vision Transformers in Remote Sensing, by Irem Ulku et al.
-
Summary of Ov-dquo: Open-vocabulary Detr with Denoising Text Query Training and Open-world Unknown Objects Supervision, by Junjie Wang et al.
-
Summary of Towards Clinical Ai Fairness: Filling Gaps in the Puzzle, by Mingxuan Liu et al.
-
Summary of Tool Learning with Large Language Models: a Survey, by Changle Qu et al.
-
Summary of Learning Shared Rgb-d Fields: Unified Self-supervised Pre-training For Label-efficient Lidar-camera 3d Perception, by Xiaohao Xu et al.
-
Summary of Proof Of Quality: a Costless Paradigm For Trustless Generative Ai Model Inference on Blockchains, by Zhenjie Zhang et al.
-
Summary of Self-guiding Exploration For Combinatorial Problems, by Zangir Iklassov and Yali Du and Farkhad Akimov and Martin Takac
-
Summary of Hybrid Preference Optimization: Augmenting Direct Preference Optimization with Auxiliary Objectives, by Anirudhan Badrinath et al.
-
Summary of Recent Trends in Personalized Dialogue Generation: a Review Of Datasets, Methodologies, and Evaluations, by Yi-pei Chen et al.
-
Summary of Modeling Dynamic Topics in Chain-free Fashion by Evolution-tracking Contrastive Learning and Unassociated Word Exclusion, By Xiaobao Wu et al.
-
Summary of Yuan 2.0-m32: Mixture Of Experts with Attention Router, by Shaohua Wu et al.
-
Summary of Velora: Memory Efficient Training Using Rank-1 Sub-token Projections, by Roy Miles et al.
-
Summary of Fmri Predictors Based on Language Models Of Increasing Complexity Recover Brain Left Lateralization, by Laurent Bonnasse-gahot and Christophe Pallier
-
Summary of Exploring and Steering the Moral Compass Of Large Language Models, by Alejandro Tlaie
-
Summary of Cost-efficient Knowledge-based Question Answering with Large Language Models, by Junnan Dong et al.
-
Summary of Gaussianformer: Scene As Gaussians For Vision-based 3d Semantic Occupancy Prediction, by Yuanhui Huang et al.
-
Summary of Vista: a Generalizable Driving World Model with High Fidelity and Versatile Controllability, by Shenyuan Gao et al.
-
Summary of Biodiscoveryagent: An Ai Agent For Designing Genetic Perturbation Experiments, by Yusuf Roohani et al.
-
Summary of Tima: Text-image Mutual Awareness For Balancing Zero-shot Adversarial Robustness and Generalization Ability, by Fengji Ma et al.
-
Summary of Video Enriched Retrieval Augmented Generation Using Aligned Video Captions, by Kevin Dela Rosa
-
Summary of Clavaddpm: Multi-relational Data Synthesis with Cluster-guided Diffusion Models, by Wei Pang et al.
-
Summary of The Widening Gap: the Benefits and Harms Of Generative Ai For Novice Programmers, by James Prather et al.
-
Summary of Lora-switch: Boosting the Efficiency Of Dynamic Llm Adapters Via System-algorithm Co-design, by Rui Kong et al.
-
Summary of Xl3m: a Training-free Framework For Llm Length Extension Based on Segment-wise Inference, by Shengnan Wang et al.
-
Summary of On the Sequence Evaluation Based on Stochastic Processes, by Tianhao Zhang et al.
-
Summary of Transvip: Speech to Speech Translation System with Voice and Isochrony Preservation, by Chenyang Le et al.
-
Summary of Faintbench: a Holistic and Precise Benchmark For Bias Evaluation in Text-to-image Models, by Hanjun Luo et al.
-
Summary of Don’t Miss the Forest For the Trees: Attentional Vision Calibration For Large Vision Language Models, by Sangmin Woo et al.
-
Summary of Diffusion Model Patching Via Mixture-of-prompts, by Seokil Ham et al.
-
Summary of Ritual: Random Image Transformations As a Universal Anti-hallucination Lever in Large Vision Language Models, by Sangmin Woo et al.
-
Summary of Tokenunify: Scalable Autoregressive Visual Pre-training with Mixture Token Prediction, by Yinda Chen et al.
-
Summary of Think Before You Act: a Two-stage Framework For Mitigating Gender Bias Towards Vision-language Tasks, by Yunqi Zhang et al.
-
Summary of A Large Language Model-based Multi-agent Manufacturing System For Intelligent Shopfloor, by Zhen Zhao et al.
-
Summary of Multiple Heads Are Better Than One: Mixture Of Modality Knowledge Experts For Entity Representation Learning, by Yichi Zhang et al.
-
Summary of Vocot: Unleashing Visually Grounded Multi-step Reasoning in Large Multi-modal Models, by Zejun Li et al.
-
Summary of Vision-and-language Navigation Generative Pretrained Transformer, by Wen Hanlin
-
Summary of Exploring the Llm Journey From Cognition to Expression with Linear Representations, by Yuzi Yan et al.
-
Summary of Position: Foundation Agents As the Paradigm Shift For Decision Making, by Xiaoqian Liu et al.
-
Summary of Tokenization Matters! Degrading Large Language Models Through Challenging Their Tokenization, by Dixuan Wang et al.
-
Summary of Compositional Few-shot Class-incremental Learning, by Yixiong Zou et al.
-
Summary of Reflectioncoder: Learning From Reflection Sequence For Enhanced One-off Code Generation, by Houxing Ren et al.
-
Summary of Leveraging Small Language Models For Text2sparql Tasks to Improve the Resilience Of Ai Assistance, by Felix Brei et al.
-
Summary of Empowering Character-level Text Infilling by Eliminating Sub-tokens, By Houxing Ren et al.
-
Summary of Superpixelwise Low-rank Approximation Based Partial Label Learning For Hyperspectral Image Classification, by Shujun Yang et al.
-
Summary of Llm-optic: Unveiling the Capabilities Of Large Language Models For Universal Visual Grounding, by Haoyu Zhao et al.
-
Summary of Teii: Think, Explain, Interact and Iterate with Large Language Models to Solve Cross-lingual Emotion Detection, by Long Cheng et al.