Paper List
We recommend you use the search box as this list is very long.
-
Summary of Training Compute-optimal Protein Language Models, by Xingyi Cheng et al.
-
Summary of Theoretical Characterisation Of the Gauss-newton Conditioning in Neural Networks, by Jim Zhao et al.
-
Summary of Le-pde++: Mamba For Accelerating Pdes Simulations, by Aoming Liang et al.
-
Summary of Fppl: An Efficient and Non-iid Robust Federated Continual Learning Framework, by Yuchen He et al.
-
Summary of Exploring the Landscape For Generative Sequence Models For Specialized Data Synthesis, by Mohammad Zbeeb et al.
-
Summary of Differentially Private and Decentralized Randomized Power Method, by Julien Nicolas et al.
-
Summary of Exagree: Towards Explanation Agreement in Explainable Machine Learning, by Sichao Li et al.
-
Summary of Unsegmedgat: Unsupervised Medical Image Segmentation Using Graph Attention Networks Clustering, by A. Mudit Adityaja et al.
-
Summary of N-gram Induction Heads For In-context Rl: Improving Stability and Reducing Data Needs, by Ilya Zisman et al.
-
Summary of Learning Controlled Stochastic Differential Equations, by Luc Brogat-motte et al.
-
Summary of Understanding Variational Autoencoders with Intrinsic Dimension and Information Imbalance, by Charles Camboulin et al.
-
Summary of Ask, and It Shall Be Given: on the Turing Completeness Of Prompting, by Ruizhong Qiu et al.
-
Summary of Local Loss Optimization in the Infinite Width: Stable Parameterization Of Predictive Coding Networks and Target Propagation, by Satoki Ishikawa et al.
-
Summary of Against Multifaceted Graph Heterogeneity Via Asymmetric Federated Prompt Learning, by Zhuoning Guo et al.
-
Summary of Culinary Class Wars: Evaluating Llms Using Ash in Cuisine Transfer Task, by Hoonick Lee et al.
-
Summary of Optimal Classification Under Performative Distribution Shift, by Edwige Cyffers (magnet) et al.
-
Summary of Theory-inspired Label Shift Adaptation Via Aligned Distribution Mixture, by Ruidong Fan et al.
-
Summary of Addressing Representation Collapse in Vector Quantized Models with One Linear Layer, by Yongxin Zhu et al.
-
Summary of R+r:understanding Hyperparameter Effects in Dp-sgd, by Felix Morsbach et al.
-
Summary of Tablegpt2: a Large Multimodal Model with Tabular Data Integration, by Aofeng Su et al.
-
Summary of Scalable Efficient Training Of Large Language Models with Low-dimensional Projected Attention, by Xingtai Lv et al.
-
Summary of Show, Don’t Tell: Learning Reward Machines From Demonstrations For Reinforcement Learning-based Cardiac Pacemaker Synthesis, by John Komp et al.
-
Summary of A Theoretical Characterization Of Optimal Data Augmentations in Self-supervised Learning, by Shlomo Libo Feigin et al.
-
Summary of Clustering Based on Density Propagation and Subcluster Merging, by Feiping Nie et al.
-
Summary of Thinking Forward and Backward: Effective Backward Planning with Large Language Models, by Allen Z. Ren et al.
-
Summary of Fast Semi-supervised Learning on Large Graphs: An Improved Green-function Method, by Feiping Nie et al.
-
Summary of Salsa: Soup-based Alignment Learning For Stronger Adaptation in Rlhf, by Atoosa Chegini et al.
-
Summary of Expanding Sparse Tuning For Low Memory Usage, by Shufan Shen et al.
-
Summary of Bootstrapping Top-down Information For Self-modulating Slot Attention, by Dongwon Kim et al.
-
Summary of Fixing the Loose Brake: Exponential-tailed Stopping Time in Best Arm Identification, by Kapilan Balagopalan et al.
-
Summary of High-pass Graph Convolutional Network For Enhanced Anomaly Detection: a Novel Approach, by Shelei Li et al.
-
Summary of Shrinking the Giant : Quasi-weightless Transformers For Low Energy Inference, by Shashank Nag et al.
-
Summary of Fedrema: Improving Personalized Federated Learning Via Leveraging the Most Relevant Clients, by Han Liang et al.
-
Summary of Formal Theorem Proving by Rewarding Llms to Decompose Proofs Hierarchically, By Kefan Dong et al.
-
Summary of Leveraging Label Semantics and Meta-label Refinement For Multi-label Question Classification, by Shi Dong and Xiaobei Niu and Rui Zhong and Zhifeng Wang and Mingzhang Zuo
-
Summary of Owmatch: Conditional Self-labeling with Consistency For Open-world Semi-supervised Learning, by Shengjie Niu et al.
-
Summary of Manibox: Enhancing Spatial Grasping Generalization Via Scalable Simulation Data Generation, by Hengkai Tan et al.
-
Summary of Metoken: Uniform Micro-environment Token Boosts Post-translational Modification Prediction, by Cheng Tan et al.
-
Summary of Causal Discovery and Classification Using Lempel-ziv Complexity, by Dhruthi et al.
-
Summary of Best-arm Identification in Unimodal Bandits, by Riccardo Poiani et al.
-
Summary of Gitsr: Graph Interaction Transformer-based Scene Representation For Multi Vehicle Collaborative Decision-making, by Xingyu Hu et al.
-
Summary of Counterfactual Explainability Of Black-box Prediction Models, by Zijun Gao and Qingyuan Zhao
-
Summary of Denoising Diffusions with Optimal Transport: Localization, Curvature, and Multi-scale Complexity, by Tengyuan Liang et al.
-
Summary of Multiclass Transductive Online Learning, by Steve Hanneke et al.
-
Summary of Lorentz-equivariant Quantum Graph Neural Network For High-energy Physics, by Md Abrar Jahin et al.
-
Summary of Quantum Rationale-aware Graph Contrastive Learning For Jet Discrimination, by Md Abrar Jahin et al.
-
Summary of Achieving Domain-independent Certified Robustness Via Knowledge Continuity, by Alan Sun et al.
-
Summary of Diagnosing Medical Datasets with Training Dynamics, by Laura Wenderoth
-
Summary of Unlocking the Theory Behind Scaling 1-bit Neural Networks, by Majid Daliri et al.
-
Summary of Graphxform: Graph Transformer For Computer-aided Molecular Design with Application to Extraction, by Jonathan Pirnay et al.
-
Summary of Robust Neural Processes For Noisy Data, by Chen Shapira et al.
-
Summary of Mitigating Matching Biases Through Score Calibration, by Mohammad Hossein Moslemi et al.
-
Summary of Uniguard: Towards Universal Safety Guardrails For Jailbreak Attacks on Multimodal Large Language Models, by Sejoon Oh et al.
-
Summary of Conformal Risk Minimization with Variance Reduction, by Sima Noorani et al.
-
Summary of Rethinking Weight Decay For Robust Fine-tuning Of Foundation Models, by Junjiao Tian et al.
-
Summary of A General Recipe For Contractive Graph Neural Networks — Technical Report, by Maya Bechler-speicher and Moshe Eliasof
-
Summary of 1st-order Magic: Analysis Of Sharpness-aware Minimization, by Nalin Tiwary and Siddarth Aananth
-
Summary of Learning From Convolution-based Unlearnable Datastes, by Dohyun Kim et al.
-
Summary of Mitigating Spurious Correlations Via Disagreement Probability, by Hyeonggeun Han et al.
-
Summary of Dpcl-diff: the Temporal Knowledge Graph Reasoning Based on Graph Node Diffusion Model with Dual-domain Periodic Contrastive Learning, by Yukun Cao et al.
-
Summary of Anomalous Client Detection in Federated Learning, by Dipanwita Thakur et al.
-
Summary of Sample-efficient Alignment For Llms, by Zichen Liu et al.
-
Summary of Dsde: Using Proportion Estimation to Improve Model Selection For Out-of-distribution Detection, by Jingyao Geng et al.
-
Summary of Facedig: Automated Tool For Placing Landmarks on Facial Portraits For Geometric Morphometrics Users, by Karel Kleisner et al.
-
Summary of Diversity Progress For Goal Selection in Discriminability-motivated Rl, by Erik M. Lintunen et al.
-
Summary of Performance Evaluation Of Deep Learning Models For Water Quality Index Prediction: a Comparative Study Of Lstm, Tcn, Ann, and Mlp, by Muhammad Ismail et al.
-
Summary of Sparc: Spectral Architectures Tackling the Cold-start Problem in Graph Learning, by Yahel Jacobs et al.
-
Summary of Enhancing Llm Evaluations: the Garbling Trick, by William F. Bradley
-
Summary of Customized Subgraph Selection and Encoding For Drug-drug Interaction Prediction, by Haotong Du et al.
-
Summary of Llms and the Madness Of Crowds, by William F. Bradley
-
Summary of Decoupling Dark Knowledge Via Block-wise Logit Distillation For Feature-level Alignment, by Chengting Yu et al.
-
Summary of Analysis Of Regularized Federated Learning, by Langming Liu and Dingxuan Zhou
-
Summary of Adaptive Conformal Inference by Particle Filtering Under Hidden Markov Models, By Xiaoyi Su et al.
-
Summary of Conditional Controllable Image Fusion, by Bing Cao et al.
-
Summary of Decision Trees For Interpretable Clusters in Mixture Models and Deep Representations, by Maximilian Fleissner et al.
-
Summary of Federated Learning Clients Clustering with Adaptation to Data Drifts, by Minghao Li (1) et al.
-
Summary of Strategic Conformal Prediction, by Daniel Csillag et al.
-
Summary of Graph Fourier Neural Odes: Modeling Spatial-temporal Multi-scales in Molecular Dynamics, by Fang Sun et al.
-
Summary of Filternet: Harnessing Frequency Filters For Time Series Forecasting, by Kun Yi et al.
-
Summary of Can Large Language Model Predict Employee Attrition?, by Xiaoye Ma et al.
-
Summary of Use Digital Twins to Support Fault Diagnosis From System-level Condition-monitoring Data, by Killian Mc Court et al.
-
Summary of Waka: Data Attribution Using K-nearest Neighbors and Membership Privacy Principles, by Patrick Mesana et al.
-
Summary of Network Causal Effect Estimation in Graphical Models Of Contagion and Latent Confounding, by Yufeng Wu et al.
-
Summary of Learning with Hidden Factorial Structure, by Charles Arnal et al.
-
Summary of Exploring the Edges Of Latent State Clusters For Goal-conditioned Reinforcement Learning, by Yuanlin Duan et al.
-
Summary of Multi-channel Hypergraph Contrastive Learning For Matrix Completion, by Xiang Li et al.
-
Summary of Hyperbox Mixture Regression For Process Performance Prediction in Antibody Production, by Ali Nik-khorasani et al.
-
Summary of Classifier-guided Gradient Modulation For Enhanced Multimodal Learning, by Zirun Guo et al.
-
Summary of Pagerank Bandits For Link Prediction, by Yikun Ban et al.
-
Summary of Enhancing Glucose Level Prediction Of Icu Patients Through Hierarchical Modeling Of Irregular Time-series, by Hadi Mehdizavareh et al.
-
Summary of Psformer: Parameter-efficient Transformer with Segment Attention For Time Series Forecasting, by Yanlong Wang et al.
-
Summary of Learning Hidden Subgoals Under Temporal Ordering Constraints in Reinforcement Learning, by Duo Xu et al.
-
Summary of Efficient Deep Learning Infrastructures For Embedded Computing Systems: a Comprehensive Survey and Future Envision, by Xiangzhong Luo et al.
-
Summary of Hobbit: a Mixed Precision Expert Offloading System For Fast Moe Inference, by Peng Tang et al.
-
Summary of Privacy-preserving Customer Churn Prediction Model in the Context Of Telecommunication Industry, by Joydeb Kumar Sana et al.
-
Summary of Online Relational Inference For Evolving Multi-agent Interacting Systems, by Beomseok Kang et al.