Paper List
We recommend you use the search box as this list is very long.
-
Summary of Scalable Nested Optimization For Deep Learning, by Jonathan Lorraine
-
Summary of Fairpriori: Improving Biased Subgroup Discovery For Deep Neural Network Fairness, by Kacy Zhou et al.
-
Summary of Maze Discovery Using Multiple Robots Via Federated Learning, by Kalpana Ranasinghe et al.
-
Summary of Physics-inspired Deep Learning and Transferable Models For Bridge Scour Prediction, by Negin Yousefpour et al.
-
Summary of Metric-entropy Limits on Nonlinear Dynamical System Learning, by Yang Pan et al.
-
Summary of Revisiting Random Walks For Learning on Graphs, by Jinwoo Kim et al.
-
Summary of Bridging Smoothness and Approximation: Theoretical Insights Into Over-smoothing in Graph Neural Networks, by Guangrui Yang et al.
-
Summary of Energy-aware Decentralized Learning with Intermittent Model Training, by Akash Dhasade et al.
-
Summary of Complementary Fusion Of Deep Network and Tree Model For Eta Prediction, by Yurui Huang et al.
-
Summary of We-math: Does Your Large Multimodal Model Achieve Human-like Mathematical Reasoning?, by Runqi Qiao et al.
-
Summary of Hypformer: Exploring Efficient Hyperbolic Transformer Fully in Hyperbolic Space, by Menglin Yang et al.
-
Summary of A Collaborative, Human-centred Taxonomy Of Ai, Algorithmic, and Automation Harms, by Gavin Abercrombie et al.
-
Summary of Unveiling the Unseen: Exploring Whitebox Membership Inference Through the Lens Of Explainability, by Chenxi Li et al.
-
Summary of Collaborative Performance Prediction For Large Language Models, by Qiyuan Zhang et al.
-
Summary of Multi-state-action Tokenisation in Decision Transformers For Multi-discrete Action Spaces, by Perusha Moodley et al.
-
Summary of Evaluating Model Performance Under Worst-case Subpopulations, by Mike Li et al.
-
Summary of Increasing Model Capacity For Free: a Simple Strategy For Parameter Efficient Fine-tuning, by Haobo Song et al.
-
Summary of Gradient-based Class Weighting For Unsupervised Domain Adaptation in Dense Prediction Visual Tasks, by Roberto Alcover-couso et al.
-
Summary of Deep Reinforcement Learning For Adverse Garage Scenario Generation, by Kai Li
-
Summary of Restyling Unsupervised Concept Based Interpretable Networks with Generative Models, by Jayneel Parekh et al.
-
Summary of Coordination Failure in Cooperative Offline Marl, by Callum Rhys Tilbury et al.
-
Summary of Tparafac2: Tracking Evolving Patterns in (incomplete) Temporal Data, by Christos Chatzis et al.
-
Summary of Binary Losses For Density Ratio Estimation, by Werner Zellinger
-
Summary of Se(3)-hyena Operator For Scalable Equivariant Learning, by Artem Moskalev and Mangal Prakash and Rui Liao and Tommaso Mansi
-
Summary of Joint Pruning and Channel-wise Mixed-precision Quantization For Efficient Deep Neural Networks, by Beatrice Alessandra Motetti et al.
-
Summary of Improve Roi with Causal Learning and Conformal Prediction, by Meng Ai et al.
-
Summary of Human-like Object Concept Representations Emerge Naturally in Multimodal Large Language Models, by Changde Du et al.
-
Summary of On Statistical Rates and Provably Efficient Criteria Of Latent Diffusion Transformers (dits), by Jerry Yao-chieh Hu et al.
-
Summary of Kolmogorov-arnold Convolutions: Design Principles and Empirical Studies, by Ivan Drokin
-
Summary of Explaining Length Bias in Llm-based Preference Evaluations, by Zhengyu Hu et al.
-
Summary of Proximity Matters: Local Proximity Preserved Balancing For Treatment Effect Estimation, by Hao Wang et al.
-
Summary of Calibrated Large Language Models For Binary Question Answering, by Patrizio Giovannotti and Alexander Gammerman
-
Summary of Eliminating Position Bias Of Language Models: a Mechanistic Approach, by Ziqi Wang et al.
-
Summary of Enabling Mixed Effects Neural Networks For Diverse, Clustered Data Using Monte Carlo Methods, by Andrej Tschalzev et al.
-
Summary of Wind Estimation in Unmanned Aerial Vehicles with Causal Machine Learning, by Abdulaziz Alwalan et al.
-
Summary of Cpt: Consistent Proxy Tuning For Black-box Optimization, by Yuanyang He et al.
-
Summary of Unaligning Everything: or Aligning Any Text to Any Image in Multimodal Models, by Shaeke Salman et al.
-
Summary of Benchmarking Predictive Coding Networks — Made Simple, by Luca Pinchetti and Chang Qi and Oleh Lokshyn and Gaspard Olivers and Cornelius Emde and Mufeng Tang and Amine M’charrak and Simon Frieder and Bayar Menzat and Rafal Bogacz and Thomas Lukasiewicz and Tommaso Salvatori
-
Summary of Neural Conditional Probability For Inference, by Vladimir R. Kostic et al.
-
Summary of A Learned Generalized Geodesic Distance Function-based Approach For Node Feature Augmentation on Graphs, by Amitoz Azad and Yuan Fang
-
Summary of Deep Learning Approach For Enhanced Transferability and Learning Capacity in Tool Wear Estimation, by Zongshuo Li et al.
-
Summary of Deep Learning Based Tool Wear Estimation Considering Cutting Conditions, by Zongshuo Li et al.
-
Summary of Efficient Cutting Tool Wear Segmentation Based on Segment Anything Model, by Zongshuo Li et al.
-
Summary of Foldgpt: Simple and Effective Large Language Model Compression Scheme, by Songwei Liu et al.
-
Summary of View From Above: a Framework For Evaluating Distribution Shifts in Model Behavior, by Tanush Chopra et al.
-
Summary of Efficient Expert Pruning For Sparse Mixture-of-experts Language Models: Enhancing Performance and Reducing Inference Costs, by Enshu Liu et al.
-
Summary of Splitlora: a Split Parameter-efficient Fine-tuning Framework For Large Language Models, by Zheng Lin et al.
-
Summary of Causal Bandits: the Pareto Optimal Frontier Of Adaptivity, a Reduction to Linear Bandits, and Limitations Around Unknown Marginals, by Ziyi Liu et al.
-
Summary of A Closer Look at Deep Learning Methods on Tabular Datasets, by Han-jia Ye et al.
-
Summary of Dynamic Universal Approximation Theory: the Basic Theory For Transformer-based Large Language Models, by Wei Wang et al.
-
Summary of Smoothed Analysis For Learning Concepts with Low Intrinsic Dimension, by Gautam Chandrasekaran et al.
-
Summary of How Does Overparameterization Affect Features?, by Ahmet Cagri Duzgun et al.
-
Summary of Hybrid Rag-empowered Multi-modal Llm For Secure Data Management in Internet Of Medical Things: a Diffusion-based Contract Approach, by Cheng Su et al.
-
Summary of Can Small Language Models Learn, Unlearn, and Retain Noise Patterns?, by Nicy Scaria et al.
-
Summary of Curls: Causal Rule Learning For Subgroups with Significant Treatment Effect, by Jiehui Zhou et al.
-
Summary of Flood Prediction Using Classical and Quantum Machine Learning Models, by Marek Grzesiak et al.
-
Summary of Marlp: Time-series Forecasting Control For Agricultural Managed Aquifer Recharge, by Yuning Chen et al.
-
Summary of Swish-t : Enhancing Swish Activation with Tanh Bias For Improved Neural Network Performance, by Youngmin Seo et al.
-
Summary of Bayesian Entropy Neural Networks For Physics-aware Prediction, by Rahul Rathnakumar et al.
-
Summary of Pocketllm: Enabling On-device Fine-tuning For Personalized Llms, by Dan Peng et al.
-
Summary of Overcoming Common Flaws in the Evaluation Of Selective Classification Systems, by Jeremias Traub et al.
-
Summary of Distml.js: Installation-free Distributed Deep Learning Framework For Web Browsers, by Masatoshi Hidaka et al.
-
Summary of Neural Networks Trained by Weight Permutation Are Universal Approximators, By Yongqiang Cai et al.
-
Summary of Posterior Sampling with Denoising Oracles Via Tilted Transport, by Joan Bruna and Jiequn Han
-
Summary of Self-consistent Deep Geometric Learning For Heterogeneous Multi-source Spatial Point Data Prediction, by Dazhou Yu et al.
-
Summary of Improved Graph-based Semi-supervised Learning Schemes, by Farid Bozorgnia
-
Summary of Improving the Performance Of Stein Variational Inference Through Extreme Sparsification Of Physically-constrained Neural Network Models, by Govinda Anantha Padmanabha et al.
-
Summary of Structured and Balanced Multi-component and Multi-layer Neural Networks, by Shijun Zhang et al.
-
Summary of Towards Faster Matrix Diagonalization with Graph Isomorphism Networks and the Alphazero Framework, by Geigh Zollicoffer et al.
-
Summary of Model-free Active Exploration in Reinforcement Learning, by Alessio Russo et al.
-
Summary of Benchmarks For Reinforcement Learning with Biased Offline Data and Imperfect Simulators, by Ori Linial et al.
-
Summary of Kernel Neural Operators (knos) For Scalable, Memory-efficient, Geometrically-flexible Operator Learning, by Matthew Lowery et al.
-
Summary of Muse-net: Missingness-aware Multi-branching Self-attention Encoder For Irregular Longitudinal Electronic Health Records, by Zekai Wang et al.
-
Summary of Towards Understanding Sensitive and Decisive Patterns in Explainable Ai: a Case Study Of Model Interpretation in Geometric Deep Learning, by Jiajun Zhu et al.
-
Summary of Silver Linings in the Shadows: Harnessing Membership Inference For Machine Unlearning, by Nexhi Sula et al.
-
Summary of Efficient Automated Circuit Discovery in Transformers Using Contextual Decomposition, by Aliyah R. Hsu et al.
-
Summary of From Introspection to Best Practices: Principled Analysis Of Demonstrations in Multimodal In-context Learning, by Nan Xu et al.
-
Summary of Zeroddi: a Zero-shot Drug-drug Interaction Event Prediction Method with Semantic Enhanced Learning and Dual-modal Uniform Alignment, by Ziyan Wang et al.
-
Summary of Gso-yolo: Global Stability Optimization Yolo For Construction Site Detection, by Yuming Zhang et al.
-
Summary of Deep Image-to-recipe Translation, by Jiangqin Ma et al.
-
Summary of Learnability Of Parameter-bounded Bayes Nets, by Arnab Bhattacharyya et al.
-
Summary of Learnability in Online Kernel Selection with Memory Constraint Via Data-dependent Regret Analysis, by Junfan Li et al.
-
Summary of Look Ahead or Look Around? a Theoretical Comparison Between Autoregressive and Masked Pretraining, by Qi Zhang et al.
-
Summary of Maximum Entropy Inverse Reinforcement Learning Of Diffusion Models with Energy-based Models, by Sangwoong Yoon et al.
-
Summary of Tarsier: Recipes For Training and Evaluating Large Video Description Models, by Jiawei Wang et al.
-
Summary of Clusterpath Gaussian Graphical Modeling, by D. J. W. Touw et al.
-
Summary of Establishing Deep Infomax As An Effective Self-supervised Learning Methodology in Materials Informatics, by Michael Moran et al.
-
Summary of Particle Semi-implicit Variational Inference, by Jen Ning Lim et al.
-
Summary of Teal: New Selection Strategy For Small Buffers in Experience Replay Class Incremental Learning, by Shahar Shaul-ariel et al.
-
Summary of Bapo: Base-anchored Preference Optimization For Overcoming Forgetting in Large Language Models Personalization, by Gihun Lee et al.
-
Summary of Graph in Graph Neural Network, by Jiongshu Wang et al.
-
Summary of Nourishnet: Proactive Severity State Forecasting Of Food Commodity Prices For Global Warning Systems, by Sydney Balboni et al.
-
Summary of Model-based Offline Reinforcement Learning with Lower Expectile Q-learning, by Kwanyoung Park and Youngwoon Lee
-
Summary of Heterogeneous Graph Contrastive Learning with Spectral Augmentation, by Jing Zhang et al.
-
Summary of Sum-of-norms Regularized Nonnegative Matrix Factorization, by Andersen Ang et al.
-
Summary of Directly Handling Missing Data in Linear Discriminant Analysis For Enhancing Classification Accuracy and Interpretability, by Tuan L. Vo et al.
-
Summary of Learning System Dynamics Without Forgetting, by Xikun Zhang et al.
-
Summary of D-cdlf: Decomposition Of Common and Distinctive Latent Factors For Multi-view High-dimensional Data, by Hai Shu
-
Summary of Locate&edit: Energy-based Text Editing For Efficient, Flexible, and Faithful Controlled Text Generation, by Hye Ryung Son and Jay-yoon Lee
-
Summary of Large Language Models Struggle in Token-level Clinical Named Entity Recognition, by Qiuhao Lu et al.