Paper List
We recommend you use the search box as this list is very long.
-
Summary of Clustering Ensemble Algorithm with High-order Consistency Learning, by Jianwen Gan et al.
-
Summary of A Systematic Review Of Neurips Dataset Management Practices, by Yiwei Wu et al.
-
Summary of Unsupervised Feature Selection Algorithm Based on Graph Filtering and Self-representation, by Yunhui Liang et al.
-
Summary of Linear Chain Transformation: Expanding Optimization Dynamics For Fine-tuning Large Language Models, by Yulong Wang and Chang Zuo and Yin Xuan and Hong Li and Ni Wei
-
Summary of Preserving Pre-trained Representation Space: on Effectiveness Of Prefix-tuning For Large Multi-modal Models, by Donghoon Kim et al.
-
Summary of Mimic-iv-ext-pe: Using a Large Language Model to Predict Pulmonary Embolism Phenotype in the Mimic-iv Dataset, by B. D. Lam et al.
-
Summary of Interpretable Language Modeling Via Induction-head Ngram Models, by Eunji Kim et al.
-
Summary of Label Noise: Ignorance Is Bliss, by Yilun Zhu et al.
-
Summary of Rps: a Generic Reservoir Patterns Sampler, by Lamine Diop and Marc Plantevit and Arnaud Soulet
-
Summary of Prospective Learning: Learning For a Dynamic Future, by Ashwin De Silva et al.
-
Summary of Lagrangian Neural Networks For Nonholonomic Mechanics, by Viviana Alejandra Diaz et al.
-
Summary of Beyond Accuracy: Ensuring Correct Predictions with Correct Rationales, by Tang Li et al.
-
Summary of Derivative-free Optimization Via Finite Difference Approximation: An Experimental Study, by Wang Du-yi and Liang Guo and Liu Guangwu and Zhang Kun
-
Summary of A Geometric Framework For Understanding Memorization in Generative Models, by Brendan Leigh Ross et al.
-
Summary of Llm-inference-bench: Inference Benchmarking Of Large Language Models on Ai Accelerators, by Krishna Teja Chitty-venkata et al.
-
Summary of Learning Local Discrete Features in Explainable-by-design Convolutional Neural Networks, by Pantelis I. Kaplanoglou et al.
-
Summary of Mutual Information Preserving Neural Network Pruning, by Charles Westphal et al.
-
Summary of Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models, by Haritz Puerto et al.
-
Summary of Using Deep Neural Networks to Quantify Parking Dwell Time, by Marcelo Eduardo Marques Ribas (1) et al.
-
Summary of Residual Deep Gaussian Processes on Manifolds, by Kacper Wyrwal et al.
-
Summary of Psl: Rethinking and Improving Softmax Loss From Pairwise Perspective For Recommendation, by Weiqin Yang et al.
-
Summary of Earl-bo: Reinforcement Learning For Multi-step Lookahead, High-dimensional Bayesian Optimization, by Mujin Cheon and Jay H. Lee and Dong-yeun Koh and Calvin Tsay
-
Summary of Apebench: a Benchmark For Autoregressive Neural Emulators Of Pdes, by Felix Koehler et al.
-
Summary of Reinforcement Learning Gradients As Vitamin For Online Finetuning Decision Transformers, by Kai Yan et al.
-
Summary of Aidovecl: Ai-generated Dataset Of Outpainted Vehicles For Eye-level Classification and Localization, by Amir Kazemi et al.
-
Summary of Q-learning For Quantile Mdps: a Decomposition, Performance, and Convergence Analysis, by Jia Lin Hau et al.
-
Summary of Projected Random Forests and Conformal Prediction Of Circular Data, by Paulo C. Marques F. et al.
-
Summary of Dense Associative Memory Through the Lens Of Random Features, by Benjamin Hoover et al.
-
Summary of Approaches to Human Activity Recognition Via Passive Radar, by Christian Bresciani et al.
-
Summary of Conformalized Prediction Of Post-fault Voltage Trajectories Using Pre-trained and Finetuned Attention-driven Neural Operators, by Amirhossein Mollaali et al.
-
Summary of The Importance Of Being Scalable: Improving the Speed and Accuracy Of Neural Network Interatomic Potentials Across Chemical Domains, by Eric Qu et al.
-
Summary of Ar-pro: Counterfactual Explanations For Anomaly Repair with Formal Properties, by Xiayan Ji et al.
-
Summary of Group Crosscoders For Mechanistic Analysis Of Symmetry, by Liv Gorton
-
Summary of Understanding Optimization in Deep Learning with Central Flows, by Jeremy M. Cohen and Alex Damian and Ameet Talwalkar and Zico Kolter and Jason D. Lee
-
Summary of Selfcodealign: Self-alignment For Code Generation, by Yuxiang Wei et al.
-
Summary of Tabm: Advancing Tabular Deep Learning with Parameter-efficient Ensembling, by Yury Gorishniy and Akim Kotelnikov and Artem Babenko
-
Summary of Arq: a Mixed-precision Quantization Framework For Accurate and Certifiably Robust Dnns, by Yuchen Yang et al.
-
Summary of Caadam: Improving Adam Optimizer Using Connection Aware Methods, by Remi Genet and Hugo Inzirillo
-
Summary of Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity Of Language Use, by Jiajun Xi et al.
-
Summary of Bridging Geometric States Via Geometric Diffusion Bridge, by Shengjie Luo et al.
-
Summary of Robust Gaussian Processes Via Relevance Pursuit, by Sebastian Ament et al.
-
Summary of Unsupervised Training Of Diffusion Models For Feasible Solution Generation in Neural Combinatorial Optimization, by Seong-hyun Hong et al.
-
Summary of Synergizing Llm Agents and Knowledge Graph For Socioeconomic Prediction in Lbsn, by Zhilun Zhou et al.
-
Summary of Diffusion Twigs with Loop Guidance For Conditional Graph Generation, by Giangiacomo Mercatali et al.
-
Summary of Approximate Attention with Mlp: a Pruning Strategy For Attention-based Model in Multivariate Time Series Forecasting, by Suhan Guo et al.
-
Summary of Bayesian-guided Label Mapping For Visual Reprogramming, by Chengyi Cai et al.
-
Summary of Maximum Entropy Hindsight Experience Replay, by Douglas C. Crowder et al.
-
Summary of Adaflow: Opportunistic Inference on Asynchronous Mobile Data with Generalized Affinity Control, by Fenmin Wu et al.
-
Summary of Clustering Head: a Visual Case Study Of the Training Dynamics in Transformers, by Ambroise Odonnat et al.
-
Summary of Joint Training For Selective Prediction, by Zhaohui Li et al.
-
Summary of Eigenvi: Score-based Variational Inference with Orthogonal Function Expansions, by Diana Cai et al.
-
Summary of Advanced Predictive Quality Assessment For Ultrasonic Additive Manufacturing with Deep Learning Model, by Lokendra Poudel et al.
-
Summary of Identifying General Mechanism Shifts in Linear Causal Representations, by Tianyu Chen et al.
-
Summary of Understanding Generalizability Of Diffusion Models Requires Rethinking the Hidden Gaussian Structure, by Xiang Li et al.
-
Summary of Dynamical Similarity Analysis Can Identify Compositional Dynamics Developing in Rnns, by Quentin Guilhot et al.
-
Summary of Local Linearity: the Key For No-regret Reinforcement Learning in Continuous Mdps, by Davide Maran et al.
-
Summary of Hamiltonian Monte Carlo Inference Of Marginalized Linear Mixed-effects Models, by Jinlin Lai et al.
-
Summary of In-context Fine-tuning For Time-series Foundation Models, by Abhimanyu Das et al.
-
Summary of Demystifying Linear Mdps and Novel Dynamics Aggregation Framework, by Joongkyu Lee et al.
-
Summary of Benchmark Data Repositories For Better Benchmarking, by Rachel Longjohn et al.
-
Summary of Matchmaker: Self-improving Large Language Model Programs For Schema Matching, by Nabeel Seedat et al.
-
Summary of On Sampling Strategies For Spectral Model Sharding, by Denis Korzhenkov and Christos Louizos
-
Summary of Directly Optimizing Explanations For Desired Properties, by Hiwot Belay Tadesse et al.
-
Summary of ‘no’ Matters: Out-of-distribution Detection in Multimodality Long Dialogue, by Rena Gao and Xuetong Wu and Siwen Luo and Caren Han and Feng Liu
-
Summary of Failure Modes Of Llms For Causal Reasoning on Narratives, by Khurram Yamin et al.
-
Summary of Diffbatt: a Diffusion Model For Battery Degradation Prediction and Synthesis, by Hamidreza Eivazi et al.
-
Summary of Rl-star: Theoretical Analysis Of Reinforcement Learning Frameworks For Self-taught Reasoner, by Fu-chieh Chang et al.
-
Summary of Neural Network Verification with Pyrat, by Augustin Lemesle et al.
-
Summary of Analyzing & Reducing the Need For Learning Rate Warmup in Gpt Training, by Atli Kosson et al.
-
Summary of Bitstack: Any-size Compression Of Large Language Models in Variable Memory Environments, by Xinghao Wang et al.
-
Summary of Robust Sparse Regression with Non-isotropic Designs, by Chih-hung Liu et al.
-
Summary of Learning Macroscopic Dynamics From Partial Microscopic Observations, by Mengyi Chen et al.
-
Summary of Transformers to Predict the Applicability Of Symbolic Integration Routines, by Rashid Barket et al.
-
Summary of Quantum Deep Equilibrium Models, by Philipp Schleich et al.
-
Summary of Scalable Kernel Inverse Optimization, by Youyuan Long et al.
-
Summary of Tract: Making First-layer Pre-activations Trainable, by Felix Petersen et al.
-
Summary of Representative Social Choice: From Learning Theory to Ai Alignment, by Tianyi Qiu
-
Summary of Ada-mshyper: Adaptive Multi-scale Hypergraph Transformer For Time Series Forecasting, by Zongjiang Shang et al.
-
Summary of Breaking Determinism: Fuzzy Modeling Of Sequential Recommendation Using Discrete State Space Diffusion Model, by Wenjia Xie et al.
-
Summary of Context-aware Testing: a New Paradigm For Model Testing with Large Language Models, by Paulius Rauba et al.
-
Summary of Diffpad: Denoising Diffusion-based Adversarial Patch Decontamination, by Jia Fu et al.
-
Summary of An Information Criterion For Controlled Disentanglement Of Multimodal Data, by Chenyu Wang et al.
-
Summary of What Happened in Llms Layers When Trained For Fast Vs. Slow Thinking: a Gradient Perspective, by Ming Li et al.
-
Summary of Syno: Structured Synthesis For Neural Operators, by Yongqi Zhuo et al.
-
Summary of Enhancing Chess Reinforcement Learning with Graph Representation, by Tomas Rigaux et al.
-
Summary of Lseattention Is All You Need For Time Series Forecasting, by Dizhen Liang
-
Summary of Exploring Consistency in Graph Representations:from Graph Kernels to Graph Neural Networks, by Xuyuan Liu et al.
-
Summary of Exacfs — a Cil Method to Mitigate Catastrophic Forgetting, by S Balasubramanian et al.
-
Summary of What Is Wrong with Perplexity For Long-context Language Modeling?, by Lizhe Fang et al.
-
Summary of Towards Convexity in Anomaly Detection: a New Formulation Of Sslm with Unique Optimal Solutions, by Hongying Liu et al.
-
Summary of One Sample Fits All: Approximating All Probabilistic Values Simultaneously and Efficiently, by Weida Li and Yaoliang Yu
-
Summary of Cale: Continuous Arcade Learning Environment, by Jesse Farebrother et al.
-
Summary of Graph Neural Networks Uncover Geometric Neural Representations in Reinforcement-based Motor Learning, by Federico Nardi et al.
-
Summary of Weight Decay Induces Low-rank Attention Layers, by Seijin Kobayashi et al.
-
Summary of Disentangling Disentangled Representations: Towards Improved Latent Units Via Diffusion Models, by Youngjun Jun et al.
-
Summary of Generative Ai-powered Plugin For Robust Federated Learning in Heterogeneous Iot Networks, by Youngjoon Lee et al.
-
Summary of Reducing Oversmoothing Through Informed Weight Initialization in Graph Neural Networks, by Dimitrios Kelesis et al.
-
Summary of Airway Labeling Meets Clinical Applications: Reflecting Topology Consistency and Outliers Via Learnable Attentions, by Chenyu Li et al.
-
Summary of Deterministic Exploration Via Stationary Bellman Error Maximization, by Sebastian Griesbach et al.