Paper List
We recommend you use the search box as this list is very long.
-
Summary of Diagnosing and Fixing Common Problems in Bayesian Optimization For Molecule Design, by Austin Tripp et al.
-
Summary of Loss Gradient Gaussian Width Based Generalization and Optimization Guarantees, by Arindam Banerjee et al.
-
Summary of A Concise Mathematical Description Of Active Inference in Discrete Time, by Jesse Van Oostrum et al.
-
Summary of Efficient Parallel Multi-hop Reasoning: a Scalable Approach For Knowledge Graph Analysis, by Jesmin Jahan Tithi and Fabio Checconi and Fabrizio Petrini
-
Summary of Estimating the Hallucination Rate Of Generative Ai, by Andrew Jesson and Nicolas Beltran-velez and Quentin Chu and Sweta Karlekar and Jannik Kossen and Yarin Gal and John P. Cunningham and David Blei
-
Summary of Multimodal Belief Prediction, by John Murzaku et al.
-
Summary of Partially Observed Trajectory Inference Using Optimal Transport and a Dynamics Prior, by Anming Gu et al.
-
Summary of Towards Generalized Hydrological Forecasting Using Transformer Models For 120-hour Streamflow Prediction, by Bekir Z. Demiray and Ibrahim Demir
-
Summary of Comparing Deep Learning Models For Rice Mapping in Bhutan Using High Resolution Satellite Imagery, by Biplov Bhandari et al.
-
Summary of Textgrad: Automatic “differentiation” Via Text, by Mert Yuksekgonul et al.
-
Summary of Flow Map Matching, by Nicholas M. Boffi et al.
-
Summary of Beyond Model Collapse: Scaling Up with Synthesized Data Requires Verification, by Yunzhen Feng et al.
-
Summary of Samba: Simple Hybrid State Space Models For Efficient Unlimited Context Language Modeling, by Liliang Ren et al.
-
Summary of Simple and Effective Masked Diffusion Language Models, by Subham Sekhar Sahoo et al.
-
Summary of Quickllama: Query-aware Inference Acceleration For Large Language Models, by Jingyao Li et al.
-
Summary of Map: Low-compute Model Merging with Amortized Pareto Fronts Via Quadratic Approximation, by Lu Li et al.
-
Summary of Ctrl-x: Controlling Structure and Appearance For Text-to-image Generation Without Guidance, by Kuan Heng Lin et al.
-
Summary of Cdsa: Conservative Denoising Score-based Algorithm For Offline Reinforcement Learning, by Zeyuan Liu et al.
-
Summary of Image and Video Tokenization with Binary Spherical Quantization, by Yue Zhao et al.
-
Summary of Situational Awareness Matters in 3d Vision Language Reasoning, by Yunze Man et al.
-
Summary of Reinforcement Learning Based Escape Route Generation in Low Visibility Environments, by Hari Srikanth
-
Summary of Domain-specific React For Physics-integrated Iterative Modeling: a Case Study Of Llm Agents For Gas Path Analysis Of Gas Turbines, by Tao Song and Yuwei Fan and Chenlong Feng and Keyu Song and Chao Liu and Dongxiang Jiang
-
Summary of Ai Sandbagging: Language Models Can Strategically Underperform on Evaluations, by Teun Van Der Weij et al.
-
Summary of Deep Implicit Optimization Enables Robust Learnable Features For Deformable Image Registration, by Rohit Jena et al.
-
Summary of World Models with Hints Of Large Language Models For Goal Achieving, by Zeyuan Liu et al.
-
Summary of When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models, by Haoran You et al.
-
Summary of Visual Representation Learning with Stochastic Frame Prediction, by Huiwon Jang et al.
-
Summary of Redefining Automotive Radar Imaging: a Domain-informed 1d Deep Learning Approach For High-resolution and Efficient Performance, by Ruxin Zheng and Shunqiao Sun and Holger Caesar and Honglei Chen and Jian Li
-
Summary of Guiding Llm Temporal Logic Generation with Explicit Separation Of Data and Control, by William Murphy et al.
-
Summary of Enhancing Tabular Data Optimization with a Flexible Graph-based Reinforced Exploration Strategy, by Xiaohan Huang et al.
-
Summary of Private Geometric Median, by Mahdi Haghifam et al.
-
Summary of Accelerating Ill-conditioned Hankel Matrix Recovery Via Structured Newton-like Descent, by Hanqin Cai et al.
-
Summary of Beyond Elbos: a Large-scale Evaluation Of Variational Methods For Sampling, by Denis Blessing et al.
-
Summary of Enhanced Gene Selection in Single-cell Genomics: Pre-filtering Synergy and Reinforced Optimization, by Weiliang Zhang et al.
-
Summary of Deformtime: Capturing Variable Dependencies with Deformable Attention For Time Series Forecasting, by Yuxuan Shu et al.
-
Summary of Beware Of Aliases — Signal Preservation Is Crucial For Robust Image Restoration, by Shashank Agnihotri and Julia Grabinski and Janis Keuper and Margret Keuper
-
Summary of A Multi-armed Bandit Approach to Online Selection and Evaluation Of Generative Models, by Xiaoyan Hu et al.
-
Summary of Benchmarking Vision-language Contrastive Methods For Medical Representation Learning, by Shuvendu Roy et al.
-
Summary of Reinforcement Learning From Human Feedback Without Reward Inference: Model-free Algorithm and Instance-dependent Analysis, by Qining Zhang et al.
-
Summary of Fkan: Fractional Kolmogorov-arnold Networks with Trainable Jacobi Basis Functions, by Alireza Afzal Aghaei
-
Summary of Deep Learning-based Approach For User Activity Detection with Grant-free Random Access in Cell-free Massive Mimo, by Ali Elkeshawy et al.
-
Summary of Ternaryllm: Ternarized Large Language Model, by Tianqi Chen et al.
-
Summary of A Synthetic Dataset For Personal Attribute Inference, by Hanna Yukhymenko et al.
-
Summary of Semantic-aware Spectrum Sharing in Internet Of Vehicles Based on Deep Reinforcement Learning, by Zhiyu Shao et al.
-
Summary of Improving Autoformalization Using Type Checking, by Auguste Poiroux et al.
-
Summary of Opfdata: Large-scale Datasets For Ac Optimal Power Flow with Topological Perturbations, by Sean Lovett et al.
-
Summary of Let Go Of Your Labels with Unsupervised Transfer, by Artyom Gadetsky et al.
-
Summary of Marginalization Consistent Mixture Of Separable Flows For Probabilistic Irregular Time Series Forecasting, by Vijaya Krishna Yalavarthi et al.
-
Summary of Hybrid Reinforcement Learning From Offline Observation Alone, by Yuda Song et al.
-
Summary of Scientific Computing with Large Language Models, by Christopher Culver et al.
-
Summary of Semlaflow — Efficient 3d Molecular Generation with Latent Attention and Equivariant Flow Matching, by Ross Irwin et al.
-
Summary of Active Learning For Affinity Prediction Of Antibodies, by Alexandra Gessner et al.
-
Summary of Joint Learning Of Context and Feedback Embeddings in Spoken Dialogue, by Livia Qian and Gabriel Skantze
-
Summary of Multi-objective Reinforcement Learning From Ai Feedback, by Marcus Williams
-
Summary of Bertaqa: How Much Do Language Models Know About Local Culture?, by Julen Etxaniz and Gorka Azkune and Aitor Soroa and Oier Lopez De Lacalle and Mikel Artetxe
-
Summary of Beyond Training: Optimizing Reinforcement Learning Based Job Shop Scheduling Through Adaptive Action Sampling, by Constantin Waubert De Puiseau et al.
-
Summary of 3d-properties: Identifying Challenges in Dpo and Charting a Path Forward, by Yuzi Yan et al.
-
Summary of Transferring Knowledge From Large Foundation Models to Small Downstream Models, by Shikai Qiu et al.
-
Summary of Dr-rag: Applying Dynamic Document Relevance to Retrieval-augmented Generation For Question-answering, by Zijian Hei and Weiling Liu and Wenjie Ou and Juyi Qiao and Junming Jiao and Guowen Song and Ting Tian and Yi Lin
-
Summary of Dnn Partitioning, Task Offloading, and Resource Allocation in Dynamic Vehicular Networks: a Lyapunov-guided Diffusion-based Reinforcement Learning Approach, by Zhang Liu and Hongyang Du and Junzhe Lin and Zhibin Gao and Lianfen Huang and Seyyedali Hosseinalipour and Dusit Niyato
-
Summary of Decor: Deconfounding Time Series with Robust Regression, by Felix Schur et al.
-
Summary of Moreaupruner: Robust Pruning Of Large Language Models Against Weight Perturbations, by Zixiao Wang et al.
-
Summary of Entropy-reinforced Planning with Large Language Models For Drug Discovery, by Xuefeng Liu et al.
-
Summary of Learning Discrete Latent Variable Structures with Tensor Rank Conditions, by Zhengming Chen et al.
-
Summary of Heterogeneous Learning Rate Scheduling For Neural Architecture Search on Long-tailed Datasets, by Chenxia Tang
-
Summary of Integrating Domain Knowledge For Handling Limited Data in Offline Rl, by Briti Gangopadhyay et al.
-
Summary of Multitrust: a Comprehensive Benchmark Towards Trustworthy Multimodal Large Language Models, by Yichi Zhang et al.
-
Summary of Reading Miscue Detection in Primary School Through Automatic Speech Recognition, by Lingyun Gao et al.
-
Summary of Efficient Mixture Learning in Black-box Variational Inference, by Alexandra Hotti et al.
-
Summary of Leveraging Large Language Models For Efficient Failure Analysis in Game Development, by Leonardo Marini et al.
-
Summary of D-gril: End-to-end Topological Learning with 2-parameter Persistence, by Soham Mukherjee et al.
-
Summary of Advancing Tool-augmented Large Language Models: Integrating Insights From Errors in Inference Trees, by Sijia Chen et al.
-
Summary of Augmenting Offline Rl with Unlabeled Data, by Zhao Wang et al.
-
Summary of Logical Distillation Of Graph Neural Networks, by Alexander Pluska et al.
-
Summary of Identifiable Object-centric Representation Learning Via Probabilistic Slot Attention, by Avinash Kori et al.
-
Summary of Failures Are Fated, but Can Be Faded: Characterizing and Mitigating Unwanted Behaviors in Large-scale Vision and Language Models, by Som Sagar et al.
-
Summary of Stable Minima Cannot Overfit in Univariate Relu Networks: Generalization by Large Step Sizes, By Dan Qiao et al.
-
Summary of Compassdock: Comprehensive Accurate Assessment Approach For Deep Learning-based Molecular Docking in Inference and Fine-tuning, by Ahmet Sarigun et al.
-
Summary of Sample Complexity Reduction Via Policy Difference Estimation in Tabular Reinforcement Learning, by Adhyyan Narang et al.
-
Summary of Flexible Parametric Inference For Space-time Hawkes Processes, by Emilia Siviero et al.
-
Summary of Flux: Fast Software-based Communication Overlap on Gpus Through Kernel Fusion, by Li-wen Chang et al.
-
Summary of Tokenize Features, Enhancing Tables: the Ft-tabpfn Model For Tabular Classification, by Quangao Liu et al.
-
Summary of Transformers Provably Learn Sparse Token Selection While Fully-connected Nets Cannot, by Zixuan Wang et al.
-
Summary of Nonlinear Time-series Embedding by Monotone Variational Inequality, By Jonathan Y. Zhou et al.
-
Summary of On the Limitation Of Kernel Dependence Maximization For Feature Selection, by Keli Liu and Feng Ruan
-
Summary of Training Dynamics Of Nonlinear Contrastive Learning Model in the High Dimensional Limit, by Lineghuan Meng et al.
-
Summary of Signmusketeers: An Efficient Multi-stream Approach For Sign Language Translation at Scale, by Shester Gueuwou et al.
-
Summary of Non-autoregressive Personalized Bundle Generation, by Wenchuan Yang et al.
-
Summary of Unleashing the Denoising Capability Of Diffusion Prior For Solving Inverse Problems, by Jiawei Zhang et al.
-
Summary of Low Rank Multi-dictionary Selection at Scale, by Boya Ma et al.
-
Summary of Distributional Miplib: a Multi-domain Library For Advancing Ml-guided Milp Methods, by Weimin Huang et al.
-
Summary of Generative Lifting Of Multiview to 3d From Unknown Pose: Wrapping Nerf Inside Diffusion, by Xin Yuan et al.