Paper List
We recommend you use the search box as this list is very long.
-
Summary of Utility-directed Conformal Prediction: a Decision-aware Framework For Actionable Uncertainty Quantification, by Santiago Cortes-gomez et al.
-
Summary of Explainable Earth Surface Forecasting Under Extreme Events, by Oscar J. Pellicer-valero et al.
-
Summary of Trained Transformer Classifiers Generalize and Exhibit Benign Overfitting In-context, by Spencer Frei and Gal Vardi
-
Summary of Openmathinstruct-2: Accelerating Ai For Math with Massive Open-source Instruction Data, by Shubham Toshniwal et al.
-
Summary of Truncated Kernel Stochastic Gradient Descent on Spheres, by Jinhui Bai et al.
-
Summary of Coordinate-based Neural Representation Enabling Zero-shot Learning For 3d Multiparametric Quantitative Mri, by Guoyan Lao et al.
-
Summary of Fake It Until You Break It: on the Adversarial Robustness Of Ai-generated Image Detectors, by Sina Mavali et al.
-
Summary of Learning-augmented Robust Algorithmic Recourse, by Kshitij Kayastha et al.
-
Summary of Dynfrs: An Efficient Framework For Machine Unlearning in Random Forest, by Shurong Wang et al.
-
Summary of Entp: Encoder-only Next Token Prediction, by Ethan Ewer et al.
-
Summary of Upcycling Instruction Tuning From Dense to Mixture-of-experts Via Parameter Merging, by Tingfeng Hui et al.
-
Summary of Automated Red Teaming with Goat: the Generative Offensive Agent Tester, by Maya Pavlova et al.
-
Summary of Drupi: Dataset Reduction Using Privileged Information, by Shaobo Wang et al.
-
Summary of On Using Certified Training Towards Empirical Robustness, by Alessandro De Palma et al.
-
Summary of Does Graph Prompt Work? a Data Operation Perspective with Theoretical Analysis, by Qunzhong Wang et al.
-
Summary of Fira: Can We Achieve Full-rank Training Of Llms Under Low-rank Constraint?, by Xi Chen et al.
-
Summary of Moral Alignment For Llm Agents, by Elizaveta Tennant et al.
-
Summary of On the Adaptation Of Unlimiformer For Decoder-only Transformers, by Kian Ahrabian et al.
-
Summary of Stable Offline Value Function Learning with Bisimulation-based Representations, by Brahma S. Pavse et al.
-
Summary of Shapiq: Shapley Interactions For Machine Learning, by Maximilian Muschalik et al.
-
Summary of Conformal Generative Modeling with Improved Sample Efficiency Through Sequential Greedy Filtering, by Klaus-rudolf Kladny et al.
-
Summary of Sparse Covariance Neural Networks, by Andrea Cavallo et al.
-
Summary of Ensembles Provably Learn Equivariance Through Data Augmentation, by Oskar Nordenfors et al.
-
Summary of Circuit Compositions: Exploring Modular Structures in Transformer-based Language Models, by Philipp Mondorf et al.
-
Summary of Geometric Signatures Of Compositionality Across a Language Model’s Lifetime, by Jin Hwa Lee et al.
-
Summary of Verbalized Graph Representation Learning: a Fully Interpretable Graph Model Based on Large Language Models Throughout the Entire Process, by Xingyu Ji et al.
-
Summary of Selective Aggregation For Low-rank Adaptation in Federated Learning, by Pengxin Guo et al.
-
Summary of From Reward Shaping to Q-shaping: Achieving Unbiased Learning with Llm-guided Knowledge, by Xiefeng Wu
-
Summary of Introducing Flexible Monotone Multiple Choice Item Response Theory Models and Bit Scales, by Joakim Wallmark et al.
-
Summary of One Wave to Explain Them All: a Unifying Perspective on Post-hoc Explainability, by Gabriel Kasmi and Amandine Brunetto and Thomas Fel and Jayneel Parekh
-
Summary of Learnable Expansion Of Graph Operators For Multi-modal Feature Fusion, by Dexuan Ding et al.
-
Summary of Dlp-lora: Efficient Task-specific Lora Fusion with a Dynamic, Lightweight Plugin For Large Language Models, by Yuxuan Zhang et al.
-
Summary of Foldable Supernets: Scalable Merging Of Transformers with Different Initializations and Tasks, by Edan Kinderman et al.
-
Summary of Infinipot: Infinite Context Processing on Memory-constrained Llms, by Minsoo Kim et al.
-
Summary of Harmaug: Effective Data Augmentation For Knowledge Distillation Of Safety Guard Models, by Seanie Lee et al.
-
Summary of Bounds on Lp Errors in Density Ratio Estimation Via F-divergence Loss Functions, by Yoshiaki Kitazawa
-
Summary of Attention Layers Provably Solve Single-location Regression, by Pierre Marion et al.
-
Summary of Edge-preserving Noise For Diffusion Models, by Jente Vandersanden et al.
-
Summary of In-context Transfer Learning: Demonstration Synthesis by Transferring Similar Tasks, By Dingzirui Wang et al.
-
Summary of Integrative Decoding: Improve Factuality Via Implicit Self-consistency, by Yi Cheng et al.
-
Summary of Mitigating Copy Bias in In-context Learning Through Neuron Pruning, by Ameen Ali et al.
-
Summary of Speculative Coreset Selection For Task-specific Fine-tuning, by Xiaoyu Zhang et al.
-
Summary of Towards a Law Of Iterated Expectations For Heuristic Estimators, by Paul Christiano et al.
-
Summary of Revisiting Hierarchical Text Classification: Inference and Metrics, by Roman Plaud et al.
-
Summary of Rethinking Gnn Expressive Power Research in the Machine Learning Community: Limitations, Issues, and Corrections, by Guanyu Cui et al.
-
Summary of Sampling From Energy-based Policies Using Diffusion, by Vineet Jain et al.
-
Summary of Fair Class-incremental Learning Using Sample Weighting, by Jaeyoung Park et al.
-
Summary of Forte : Finding Outliers with Representation Typicality Estimation, by Debargha Ganguly et al.
-
Summary of Efficient Learning Of Pomdps with Known Observation Model in Average-reward Setting, by Alessio Russo et al.
-
Summary of Layer Swapping For Zero-shot Cross-lingual Transfer in Large Language Models, by Lucas Bandarkar et al.
-
Summary of Phympgn: Physics-encoded Message Passing Graph Network For Spatiotemporal Pde Systems, by Bocheng Zeng et al.
-
Summary of Flashmask: Efficient and Rich Mask Extension Of Flashattention, by Guoxia Wang et al.
-
Summary of Towards Dynamic Graph Neural Networks with Provably High-order Expressive Power, by Zhe Wang et al.
-
Summary of Flame: Adaptive and Reactive Concept Drift Mitigation For Federated Learning Deployments, by Ioannis Mavromatis and Stefano De Feo and Aftab Khan
-
Summary of On Expressive Power Of Looped Transformers: Theoretical Analysis and Enhancement Via Timestep Encoding, by Kevin Xu and Issei Sato
-
Summary of The Labyrinth Of Links: Navigating the Associative Maze Of Multi-modal Llms, by Hong Li et al.
-
Summary of Fair4free: Generating High-fidelity Fair Synthetic Samples Using Data Free Distillation, by Md Fahim Sikder et al.
-
Summary of The Great Contradiction Showdown: How Jailbreak and Stealth Wrestle in Vision-language Models?, by Ching-chia Kao et al.
-
Summary of Scalable Reinforcement Learning-based Neural Architecture Search, by Amber Cassimon et al.
-
Summary of Adaptive Teachers For Amortized Samplers, by Minsu Kim et al.
-
Summary of Text2pde: Latent Diffusion Models For Accessible Physics Simulation, by Anthony Zhou et al.
-
Summary of A Deep Learning Approach For Imbalanced Tabular Data in Advertiser Prospecting: a Case Of Direct Mail Prospecting, by Sadegh Farhang et al.
-
Summary of Efficient Pac Learning Of Halfspaces with Constant Malicious Noise Rate, by Jie Shen
-
Summary of [re] Network Deconvolution, by Rochana R. Obadage et al.
-
Summary of Stochastic Gradient Descent with Adaptive Data, by Ethan Che and Jing Dong and Xin T. Tong
-
Summary of Debiasing Federated Learning with Correlated Client Participation, by Zhenyu Sun et al.
-
Summary of Were Rnns All We Needed?, by Leo Feng et al.
-
Summary of Absolute State-wise Constrained Policy Optimization: High-probability State-wise Constraints Satisfaction, by Weiye Zhao et al.
-
Summary of Induced Covariance For Causal Discovery in Linear Sparse Structures, by Saeed Mohseni-sehdeh et al.
-
Summary of See Me and Believe Me: Causality and Intersectionality in Testimonial Injustice in Healthcare, by Kenya S. Andrews et al.
-
Summary of Equivariant Score-based Generative Models Provably Learn Distributions with Symmetries Efficiently, by Ziyu Chen et al.
-
Summary of Helpsteer2-preference: Complementing Ratings with Preferences, by Zhilin Wang et al.
-
Summary of Dual Approximation Policy Optimization, by Zhihan Xiong et al.
-
Summary of Revisiting Optimism and Model Complexity in the Wake Of Overparameterized Machine Learning, by Pratik Patil et al.
-
Summary of Improving Fine-grained Control Via Aggregation Of Multiple Diffusion Models, by Conghan Yue et al.
-
Summary of Transformers Handle Endogeneity in In-context Linear Regression, by Haodong Liang et al.
-
Summary of Deep Unlearn: Benchmarking Machine Unlearning, by Xavier F. Cadet et al.
-
Summary of Deep Learning and Machine Learning, Advancing Big Data Analytics and Management: Unveiling Ai’s Potential Through Tools, Techniques, and Applications, by Pohsun Feng et al.
-
Summary of Sparse Autoencoders Reveal Temporal Difference Learning in Large Language Models, by Can Demircan et al.
-
Summary of Uncertainty-aware Human Mobility Modeling and Anomaly Detection, by Haomin Wen et al.
-
Summary of Tackling the Accuracy-interpretability Trade-off in a Hierarchy Of Machine Learning Models For the Prediction Of Extreme Heatwaves, by Alessandro Lovo et al.
-
Summary of Back to Bayesics: Uncovering Human Mobility Distributions and Anomalies with An Integrated Statistical and Neural Framework, by Minxuan Duan et al.
-
Summary of Cktgen: Specification-conditioned Analog Circuit Generation, by Yuxuan Hou et al.
-
Summary of Investigating the Synergistic Effects Of Dropout and Residual Connections on Language Model Training, by Qingyang Li and Weimao Ke
-
Summary of Don’t Stop Me Now: Embedding Based Scheduling For Llms, by Rana Shahout et al.
-
Summary of Gptreeo: An R Package For Continual Regression with Dividing Local Gaussian Processes, by Timo Braun et al.
-
Summary of Convergent Privacy Loss Of Noisy-sgd Without Convexity and Smoothness, by Eli Chien et al.
-
Summary of Spherical Analysis Of Learning Nonlinear Functionals, by Zhenyu Yang et al.
-
Summary of Structure-preserving Operator Learning, by Nacime Bouziani et al.
-
Summary of Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-time, by Chiao-an Yang et al.
-
Summary of An Introduction to Deep Survival Analysis Models For Predicting Time-to-event Outcomes, by George H. Chen
-
Summary of Efficient and Private Marginal Reconstruction with Local Non-negativity, by Brett Mullins et al.
-
Summary of Statistical Inference on Black-box Generative Models in the Data Kernel Perspective Space, by Hayden Helm and Aranyak Acharyya and Brandon Duderstadt and Youngser Park and Carey E. Priebe
-
Summary of Exploiting Structure in Offline Multi-agent Rl: the Benefits Of Low Interaction Rank, by Wenhao Zhan et al.
-
Summary of Almost Free: Self-concordance in Natural Exponential Families and An Application to Bandits, by Shuai Liu et al.
-
Summary of Using Interleaved Ensemble Unlearning to Keep Backdoors at Bay For Finetuning Vision Transformers, by Zeyu Michael Li
-
Summary of Ngpt: Normalized Transformer with Representation Learning on the Hypersphere, by Ilya Loshchilov et al.