Paper List
We recommend you use the search box as this list is very long.
-
Summary of Hydragen: High-throughput Llm Inference with Shared Prefixes, by Jordan Juravsky et al.
-
Summary of Tighter Generalisation Bounds Via Interpolation, by Paul Viallard et al.
-
Summary of Hydra: Sequentially-dependent Draft Heads For Medusa Decoding, by Zachary Ankner et al.
-
Summary of Closing the Gap Between Sgp4 and High-precision Propagation Via Differentiable Programming, by Giacomo Acciarini et al.
-
Summary of Pseudo-labelling Meets Label Smoothing For Noisy Partial Label Learning, by Darshana Saravanan et al.
-
Summary of On the Completeness Of Invariant Geometric Deep Learning Models, by Zian Li et al.
-
Summary of Explaining Learned Reward Functions with Counterfactual Trajectories, by Jan Wehner et al.
-
Summary of Codeit: Self-improving Language Models with Prioritized Hindsight Replay, by Natasha Butt et al.
-
Summary of Multi-patch Prediction: Adapting Llms For Time Series Representation Learning, by Yuxuan Bian et al.
-
Summary of Learning by Doing: An Online Causal Reinforcement Learning Framework with Causal-aware Policy, By Ruichu Cai et al.
-
Summary of Choosing a Classical Planner with Graph Neural Networks, by Jana Vatter et al.
-
Summary of A Unified Gaussian Process For Branching and Nested Hyperparameter Optimization, by Jiazhao Zhang and Ying Hung and Chung-ching Lin and Zicheng Liu
-
Summary of On Provable Length and Compositional Generalization, by Kartik Ahuja et al.
-
Summary of Probabilistic Ml Verification Via Weighted Model Integration, by Paolo Morettin et al.
-
Summary of The Strain Of Success: a Predictive Model For Injury Risk Mitigation and Team Success in Soccer, by Gregory Everett et al.
-
Summary of L4q: Parameter Efficient Quantization-aware Fine-tuning on Large Language Models, by Hyesung Jeon et al.
-
Summary of Conformal Convolution and Monte Carlo Meta-learners For Predictive Inference Of Individual Treatment Effects, by Jef Jonkers et al.
-
Summary of Voronoi Candidates For Bayesian Optimization, by Nathan Wycoff et al.
-
Summary of Moco: a Learnable Meta Optimizer For Combinatorial Optimization, by Tim Dernedde et al.
-
Summary of Two Trades Is Not Baffled: Condensing Graph Via Crafting Rational Gradient Matching, by Tianle Zhang and Yuchen Zhang and Kun Wang and Kai Wang and Beining Yang and Kaipeng Zhang and Wenqi Shao and Ping Liu and Joey Tianyi Zhou and Yang You
-
Summary of Source-free Domain Adaptation with Diffusion-guided Source Data Generation, by Shivang Chopra et al.
-
Summary of Blue Noise For Diffusion Models, by Xingchang Huang et al.
-
Summary of Latent Plan Transformer For Trajectory Abstraction: Planning As Latent Space Inference, by Deqian Kong et al.
-
Summary of Open-vocabulary Calibration For Fine-tuned Clip, by Shuoyuan Wang et al.
-
Summary of A Perspective on Individualized Treatment Effects Estimation From Time-series Health Data, by Ghadeer O. Ghosheh et al.
-
Summary of Group Distributionally Robust Dataset Distillation with Risk Minimization, by Saeed Vahidian et al.
-
Summary of Faithlm: Towards Faithful Explanations For Large Language Models, by Yu-neng Chuang et al.
-
Summary of Learning Operators with Stochastic Gradient Descent in General Hilbert Spaces, by Lei Shi and Jia-qi Yang
-
Summary of Breaking Free: How to Hack Safety Guardrails in Black-box Diffusion Models!, by Shashank Kotyan et al.
-
Summary of From Explained Variance Of Correlated Components to Pca Without Orthogonality Constraints, by Marie Chavent (imb) et al.
-
Summary of Incorporating Retrieval-based Causal Learning with Information Bottlenecks For Interpretable Graph Neural Networks, by Jiahua Rao et al.
-
Summary of Non-parametric Estimation Of Multi-dimensional Marked Hawkes Processes, by Sobin Joseph and Shashi Jain
-
Summary of Graph Cuts with Arbitrary Size Constraints Through Optimal Transport, by Chakib Fettal et al.
-
Summary of Towards Aligned Layout Generation Via Diffusion Model with Aesthetic Constraints, by Jian Chen et al.
-
Summary of Progressive Gradient Flow For Robust N:m Sparsity Training in Transformers, by Abhimanyu Rajeshkumar Bambhaniya et al.
-
Summary of Code As Reward: Empowering Reinforcement Learning with Vlms, by David Venuto et al.
-
Summary of Color Recognition in Challenging Lighting Environments: Cnn Approach, by Nizamuddin Maitlo et al.
-
Summary of Analyzing the Neural Tangent Kernel Of Periodically Activated Coordinate Networks, by Hemanth Saratchandran et al.
-
Summary of A Fast Score-based Search Algorithm For Maximal Ancestral Graphs Using Entropy, by Zhongyi Hu and Robin Evans
-
Summary of Scalable Multi-view Clustering Via Explicit Kernel Features Maps, by Chakib Fettal et al.
-
Summary of Bowl: a Deceptively Simple Open World Learner, by Roshni .r. Kamath et al.
-
Summary of E(3)-equivariant Mesh Neural Networks, by Thuan Trang et al.
-
Summary of Online Cascade Learning For Efficient Inference Over Streams, by Lunyiu Nie et al.
-
Summary of Generalized Sobolev Transport For Probability Measures on a Graph, by Tam Le et al.
-
Summary of On Computational Limits Of Modern Hopfield Models: a Fine-grained Complexity Analysis, by Jerry Yao-chieh Hu et al.
-
Summary of Sumrec: a Framework For Recommendation Using Open-domain Dialogue, by Ryutaro Asahara et al.
-
Summary of Triplet Interaction Improves Graph Transformers: Accurate Molecular Graph Learning with Triplet Graph Transformers, by Md Shamim Hussain et al.
-
Summary of Learning Diverse Policies with Soft Self-generated Guidance, by Guojian Wang et al.
-
Summary of Riemann-lebesgue Forest For Regression, by Tian Qin et al.
-
Summary of Curvature-informed Sgd Via General Purpose Lie-group Preconditioners, by Omead Pooladzandi and Xi-lin Li
-
Summary of Oil-ad: An Anomaly Detection Framework For Sequential Decision Sequences, by Chen Wang et al.
-
Summary of Collective Counterfactual Explanations Via Optimal Transport, by Ahmad-reza Ehyaei et al.
-
Summary of Towards Improved Imbalance Robustness in Continual Multi-label Learning with Dual Output Spiking Architecture (dosa), by Sourav Mishra et al.
-
Summary of Meet Jeanie: a Similarity Measure For 3d Skeleton Sequences Via Temporal-viewpoint Alignment, by Lei Wang and Jun Liu and Liang Zheng and Tom Gedeon and Piotr Koniusz
-
Summary of Wasserstein Gradient Flows For Moreau Envelopes Of F-divergences in Reproducing Kernel Hilbert Spaces, by Viktor Stein et al.
-
Summary of Beyond Answers: Transferring Reasoning Capabilities to Smaller Llms Using Multi-teacher Knowledge Distillation, by Yijun Tian et al.
-
Summary of Infllm: Training-free Long-context Extrapolation For Llms with An Efficient Context Memory, by Chaojun Xiao et al.
-
Summary of Feature Distribution on Graph Topology Mediates the Effect Of Graph Convolution: Homophily Perspective, by Soo Yong Lee et al.
-
Summary of Levi: Generalizable Fine-tuning Via Layer-wise Ensemble Of Different Views, by Yuji Roh et al.
-
Summary of Block Sparse Bayesian Learning: a Diversified Scheme, by Yanhao Zhang et al.
-
Summary of Quip#: Even Better Llm Quantization with Hadamard Incoherence and Lattice Codebooks, by Albert Tseng et al.
-
Summary of Learning Under Temporal Label Noise, by Sujay Nagaraj et al.
-
Summary of Cehr-gpt: Generating Electronic Health Records with Chronological Patient Timelines, by Chao Pang et al.
-
Summary of Multimodal Unsupervised Domain Generalization by Retrieving Across the Modality Gap, By Christopher Liao et al.
-
Summary of Towards Fair, Robust and Efficient Client Contribution Evaluation in Federated Learning, by Meiying Zhang et al.
-
Summary of The Vampprior Mixture Model, by Andrew A. Stirn and David A. Knowles
-
Summary of Decentralized Blockchain-based Robust Multi-agent Multi-armed Bandit, by Mengfan Xu et al.
-
Summary of Learning to Extract Structured Entities Using Language Models, by Haolun Wu et al.
-
Summary of Exploring Higher-order Neural Network Node Interactions with Total Correlation, by Thomas Kerby et al.
-
Summary of Dyslim: Dynamics Stable Learning by Invariant Measure For Chaotic Systems, By Yair Schiff et al.
-
Summary of Iot Network Traffic Analysis with Deep Learning, by Mei Liu and Leon Yang
-
Summary of Incentivized Truthful Communication For Federated Bandits, by Zhepei Wei et al.
-
Summary of De-amplifying Bias From Differential Privacy in Language Model Fine-tuning, by Sanjari Srivastava et al.
-
Summary of Amortized Planning with Large-scale Transformers: a Case Study on Chess, by Anian Ruoss et al.
-
Summary of A Primal-dual Algorithm For Offline Constrained Reinforcement Learning with Linear Mdps, by Kihyuk Hong et al.
-
Summary of The Fine-grained Complexity Of Gradient Computation For Training Large Language Models, by Josh Alman et al.
-
Summary of Pathspace Kalman Filters with Dynamic Process Uncertainty For Analyzing Time-course Data, by Chaitra Agrahar et al.
-
Summary of Adaflow: Imitation Learning with Variance-adaptive Flow-based Policies, by Xixi Hu et al.
-
Summary of Pres: Toward Scalable Memory-based Dynamic Graph Neural Networks, by Junwei Su et al.
-
Summary of Billm: Pushing the Limit Of Post-training Quantization For Llms, by Wei Huang et al.
-
Summary of Multi-view Symbolic Regression, by Etienne Russeil et al.
-
Summary of Enhance Dnn Adversarial Robustness and Efficiency Via Injecting Noise to Non-essential Neurons, by Zhenyu Liu et al.
-
Summary of Less: Selecting Influential Data For Targeted Instruction Tuning, by Mengzhou Xia et al.
-
Summary of Does Confidence Calibration Improve Conformal Prediction?, by Huajun Xi et al.
-
Summary of Legallens: Leveraging Llms For Legal Violation Identification in Unstructured Text, by Dor Bernsohn et al.
-
Summary of The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry, by Michael Zhang et al.
-
Summary of Pqmass: Probabilistic Assessment Of the Quality Of Generative Models Using Probability Mass Estimation, by Pablo Lemos et al.
-
Summary of Adaptive Inference: Theoretical Limits and Unexplored Opportunities, by Soheil Hor et al.
-
Summary of Nercc: Nested-regression Coded Computing For Resilient Distributed Prediction Serving Systems, by Parsa Moradi et al.
-
Summary of Scaling Laws For Learning with Real and Surrogate Data, by Ayush Jain et al.
-
Summary of Bounding the Excess Risk For Linear Models Trained on Marginal-preserving, Differentially-private, Synthetic Data, by Yvonne Zhou et al.
-
Summary of Fine-tuned Language Models Generate Stable Inorganic Materials As Text, by Nate Gruver et al.
-
Summary of Counterfactual Generation with Answer Set Programming, by Sopam Dasgupta et al.