Paper List
We recommend you use the search box as this list is very long.
-
Summary of Cross-domain Policy Adaptation by Capturing Representation Mismatch, By Jiafei Lyu et al.
-
Summary of Efficient Recurrent Off-policy Rl Requires a Context-encoder-specific Learning Rate, by Fan-ming Luo et al.
-
Summary of Reshuffling Resampling Splits Can Improve Generalization Of Hyperparameter Optimization, by Thomas Nagler et al.
-
Summary of Beyond Canonicalization: How Tensorial Messages Improve Equivariant Message Passing, by Peter Lippmann et al.
-
Summary of Ags-gnn: Attribute-guided Sampling For Graph Neural Networks, by Siddhartha Shankar Das et al.
-
Summary of Ivideogpt: Interactive Videogpts Are Scalable World Models, by Jialong Wu et al.
-
Summary of Learning From True-false Labels Via Multi-modal Prompt Retrieving, by Zhongnian Li et al.
-
Summary of Cardinality Estimation on Hyper-relational Knowledge Graphs, by Fei Teng et al.
-
Summary of Rethinking Debiasing: Real-world Bias Analysis and Mitigation, by Peng Kuang et al.
-
Summary of Adversarial Attacks on Hidden Tasks in Multi-task Learning, by Yu Zhe et al.
-
Summary of Cooperative Backdoor Attack in Decentralized Reinforcement Learning with Theoretical Guarantee, by Mengtong Gao et al.
-
Summary of Learning Antenna Pointing Correction in Operations: Efficient Calibration Of a Black Box, by Leif Bergerhoff
-
Summary of Accelerating 3d Molecule Generation Via Jointly Geometric Optimal Transport, by Haokai Hong et al.
-
Summary of Novel Kernel Models and Exact Representor Theory For Neural Networks Beyond the Over-parameterized Regime, by Alistair Shilton et al.
-
Summary of Ftmixer: Frequency and Time Domain Representations Fusion For Time Series Modeling, by Zhengnan Li et al.
-
Summary of Paramrel: Learning Parameter Space Representation Via Progressively Encoding Bayesian Flow Networks, by Zhangkai Wu et al.
-
Summary of Bdetclip: Multimodal Prompting Contrastive Test-time Backdoor Detection, by Yuwei Niu et al.
-
Summary of Towards a General Time Series Anomaly Detector with Adaptive Bottlenecks and Dual Adversarial Decoders, by Qichao Shentu et al.
-
Summary of Prompt Tuning Strikes Back: Customizing Foundation Models with Low-rank Prompt Adaptation, by Abhinav Jain et al.
-
Summary of Minimizing Ucb: a Better Local Search Strategy in Local Bayesian Optimization, by Zheyi Fan et al.
-
Summary of Semi-supervised Learning Guided by the Generalized Bayes Rule Under Soft Revision, By Stefan Dietrich et al.
-
Summary of Towards a Probabilistic Fusion Approach For Robust Battery Prognostics, by Jokin Alcibar et al.
-
Summary of The Buffer Mechanism For Multi-step Information Reasoning in Language Models, by Zhiwei Wang et al.
-
Summary of Rankability-enhanced Revenue Uplift Modeling Framework For Online Marketing, by Bowei He et al.
-
Summary of Certified Inventory Control Of Critical Resources, by Ludvig Hult and Dave Zachariah and Petre Stoica
-
Summary of Conformal Classification with Equalized Coverage For Adaptively Selected Groups, by Yanfei Zhou et al.
-
Summary of Is Algorithmic Stability Testable? a Unified Framework Under Computational Constraints, by Yuetian Luo and Rina Foygel Barber
-
Summary of Towards Better Understanding Of In-context Learning Ability From In-context Uncertainty Quantification, by Shang Liu et al.
-
Summary of Quantifying the Gain in Weak-to-strong Generalization, by Moses Charikar et al.
-
Summary of Bayesian Optimization Of Functions Over Node Subsets in Graphs, by Huidong Liang et al.
-
Summary of A Counterfactual Analysis Of the Dishonest Casino, by Martin Haugh and Raghav Singal
-
Summary of Scaling Law For Time Series Forecasting, by Jingzhe Shi et al.
-
Summary of Beyond the Noise: Intrinsic Dimension Estimation with Optimal Neighbourhood Identification, by Antonio Di Noia et al.
-
Summary of Exploring the Evolution Of Hidden Activations with Live-update Visualization, by Xianglin Yang et al.
-
Summary of Better Membership Inference Privacy Measurement Through Discrepancy, by Ruihan Wu et al.
-
Summary of Intelligent Go-explore: Standing on the Shoulders Of Giant Foundation Models, by Cong Lu et al.
-
Summary of Enhancing Learning with Label Differential Privacy by Vector Approximation, By Puning Zhao et al.
-
Summary of Online Prompt Pricing Based on Combinatorial Multi-armed Bandit and Hierarchical Stackelberg Game, by Meiling Li et al.
-
Summary of Prodag: Projection-induced Variational Inference For Directed Acyclic Graphs, by Ryan Thompson et al.
-
Summary of Diffusion Actor-critic with Entropy Regulator, by Yinuo Wang et al.
-
Summary of Learning the Distribution Map in Reverse Causal Performative Prediction, by Daniele Bracale et al.
-
Summary of Extracting Heuristics From Large Language Models For Reward Shaping in Reinforcement Learning, by Siddhant Bhambri et al.
-
Summary of Indexed Minimum Empirical Divergence-based Algorithms For Linear Bandits, by Jie Bian and Vincent Y. F. Tan
-
Summary of Denoising Lm: Pushing the Limits Of Error Correction Models For Speech Recognition, by Zijin Gu et al.
-
Summary of Extracting Prompts by Inverting Llm Outputs, By Collin Zhang et al.
-
Summary of Fast Inference with Kronecker-sparse Matrices, by Antoine Gonon et al.
-
Summary of What Variables Affect Out-of-distribution Generalization in Pretrained Models?, by Md Yousuf Harun et al.
-
Summary of Adjointdeis: Efficient Gradients For Diffusion Models, by Zander W. Blasingame and Chen Liu
-
Summary of Oac: Output-adaptive Calibration For Accurate Post-training Quantization, by Ali Edalati (1) et al.
-
Summary of Amortized Nonmyopic Active Search Via Deep Imitation Learning, by Quan Nguyen et al.
-
Summary of Ceebert: Cross-domain Inference in Early Exit Bert, by Divya Jyoti Bajpai and Manjesh Kumar Hanawal
-
Summary of Credal Wrapper Of Model Averaging For Uncertainty Estimation on Out-of-distribution Detection, by Kaizheng Wang et al.
-
Summary of Reinforcement Learning For Infinite-horizon Average-reward Linear Mdps Via Approximation by Discounted-reward Mdps, By Kihyuk Hong et al.
-
Summary of Revisiting Moe and Dense Speed-accuracy Comparisons For Llm Training, by Xianzhi Du et al.
-
Summary of Elastogen: 4d Generative Elastodynamics, by Yutao Feng et al.
-
Summary of Message-passing Monte Carlo: Generating Low-discrepancy Point Sets Via Graph Neural Networks, by T. Konstantin Rusch et al.
-
Summary of Model-agnostic Utility-preserving Biometric Information Anonymization, by Chun-fu Chen et al.
-
Summary of A Classification Model Based on a Population Of Hypergraphs, by Samuel Barton et al.
-
Summary of Direct Preference Optimization with Unobserved Preference Heterogeneity, by Keertana Chidambaram et al.
-
Summary of 4+3 Phases Of Compute-optimal Neural Scaling Laws, by Elliot Paquette et al.
-
Summary of A Survey Of Distributed Learning in Cloud, Mobile, and Edge Settings, by Madison Threadgill et al.
-
Summary of Distributed Harmonization: Federated Clustered Batch Effect Adjustment and Generalization, by Bao Hoang et al.
-
Summary of Pure Exploration For Constrained Best Mixed Arm Identification with a Fixed Budget, by Dengwang Tang et al.
-
Summary of Ultra-mc: a Unified Approach to Learning Mixtures Of Markov Chains Via Hitting Times, by Fabian Spaeh et al.
-
Summary of Revisiting Day-ahead Electricity Price: Simple Model Save Millions, by Linian Wang et al.
-
Summary of Detail: Task Demonstration Attribution For Interpretable In-context Learning, by Zijian Zhou et al.
-
Summary of Bimix: a Bivariate Data Mixing Law For Language Model Pretraining, by Ce Ge et al.
-
Summary of Slim-llm: Salience-driven Mixed-precision Quantization For Large Language Models, by Wei Huang et al.
-
Summary of Analogcoder: Analog Circuit Design Via Training-free Code Generation, by Yao Lai et al.
-
Summary of How Does Bayes Error Limit Probabilistic Robust Accuracy, by Ruihan Zhang and Jun Sun
-
Summary of Mallowspo: Fine-tune Your Llm with Preference Dispersions, by Haoxian Chen et al.
-
Summary of Fast Bayesian Inference For Neutrino Non-standard Interactions at Dark Matter Direct Detection Experiments, by Dorian W. P. Amaral et al.
-
Summary of Interpretable and Editable Programmatic Tree Policies For Reinforcement Learning, by Hector Kohler et al.
-
Summary of Understanding the Dynamics Of the Frequency Bias in Neural Networks, by Juan Molina et al.
-
Summary of Sfddm: Single-fold Distillation For Diffusion Models, by Chi Hong et al.
-
Summary of Efficiently Training Deep-learning Parametric Policies Using Lagrangian Duality, by Andrew Rosemberg et al.
-
Summary of Mass: Multi-attribute Selective Suppression For Utility-preserving Data Transformation From An Information-theoretic Perspective, by Yizhuo Chen et al.
-
Summary of In-context Time Series Predictor, by Jiecheng Lu et al.
-
Summary of Hand Bone Age Estimation Using Divide and Conquer Strategy and Lightweight Convolutional Neural Networks, by Amin Ahmadi Kasani et al.
-
Summary of Linking In-context Learning in Transformers to Human Episodic Memory, by Li Ji-an et al.
-
Summary of Private Regression Via Data-dependent Sufficient Statistic Perturbation, by Cecilia Ferrando et al.
-
Summary of A Rescaling-invariant Lipschitz Bound Based on Path-metrics For Modern Relu Network Parameterizations, by Antoine Gonon et al.
-
Summary of Re-adapt: Reverse Engineered Adaptation Of Large Language Models, by William Fleshman and Benjamin Van Durme
-
Summary of Parameter-free Clipped Gradient Descent Meets Polyak, by Yuki Takezawa et al.
-
Summary of Editworld: Simulating World Dynamics For Instruction-following Image Editing, by Ling Yang et al.
-
Summary of Didi: Diffusion-guided Diversity For Offline Behavioral Generation, by Jinxin Liu et al.
-
Summary of Recurrent Early Exits For Federated Learning with Heterogeneous Clients, by Royson Lee et al.
-
Summary of Implicit Personalization in Language Models: a Systematic Study, by Zhijing Jin et al.
-
Summary of Scalable Optimization in the Modular Norm, by Tim Large and Yang Liu and Minyoung Huh and Hyojin Bahng and Phillip Isola and Jeremy Bernstein
-
Summary of Pagoda: Progressive Growing Of a One-step Generator From a Low-resolution Diffusion Teacher, by Dongjun Kim et al.
-
Summary of From Explicit Cot to Implicit Cot: Learning to Internalize Cot Step by Step, By Yuntian Deng et al.
-
Summary of Analysis Of Atom-level Pretraining with Quantum Mechanics (qm) Data For Graph Neural Networks Molecular Property Models, by Jose Arjona-medina and Ramil Nugmanov
-
Summary of Differentiable Annealed Importance Sampling Minimizes the Symmetrized Kullback-leibler Divergence Between Initial and Target Distribution, by Johannes Zenn and Robert Bamler
-
Summary of Local Causal Discovery For Structural Evidence Of Direct Discrimination, by Jacqueline Maasch et al.
-
Summary of Privileged Sensing Scaffolds Reinforcement Learning, by Edward S. Hu et al.
-
Summary of Pv-tuning: Beyond Straight-through Estimation For Extreme Llm Compression, by Vladimir Malinovskii et al.
-
Summary of Conditional Diffusion on Web-scale Image Pairs Leads to Diverse Image Variations, by Manoj Kumar et al.
-
Summary of Terdit: Ternary Diffusion Models with Transformers, by Xudong Lu et al.
-
Summary of Not All Language Model Features Are One-dimensionally Linear, by Joshua Engels et al.
-
Summary of A Nurse Is Blue and Elephant Is Rugby: Cross Domain Alignment in Large Language Models Reveal Human-like Patterns, by Asaf Yehudai et al.