Paper List
We recommend you use the search box as this list is very long.
-
Summary of Training a Neural Netwok For Data Reduction and Better Generalization, by Sylvain Sardy and Maxime Van Cutsem and Xiaoyu Ma
-
Summary of X-meshgraphnet: Scalable Multi-scale Graph Neural Networks For Physics Simulation, by Mohammad Amin Nabian et al.
-
Summary of Learning Hierarchical Polynomials Of Multiple Nonlinear Features with Three-layer Networks, by Hengyu Fu et al.
-
Summary of Classifier-free Guidance Inside the Attraction Basin May Cause Memorization, by Anubhav Jain et al.
-
Summary of Priordiffusion: Leverage Language Prior in Diffusion Models For Monocular Depth Estimation, by Ziyao Zeng et al.
-
Summary of In-context Experience Replay Facilitates Safety Red-teaming Of Text-to-image Diffusion Models, by Zhi-yi Chin et al.
-
Summary of Libragrad: Balancing Gradient Flow For Universally Better Vision Transformer Attributions, by Faridoun Mehri (1) et al.
-
Summary of Parameter Efficient Instruction Tuning: An Empirical Study, by Pengfei He
-
Summary of Scaling Laws For Black Box Adversarial Attacks, by Chuan Liu et al.
-
Summary of Learning Predictive Checklists with Probabilistic Logic Programming, by Yukti Makhija et al.
-
Summary of Towards Efficient Model-heterogeneity Federated Learning For Large Models, by Ruofan Jia et al.
-
Summary of Enhancing In-hospital Mortality Prediction Using Multi-representational Learning with Llm-generated Expert Summaries, by Harshavardhan Battula et al.
-
Summary of Pathways on the Image Manifold: Image Editing Via Video Generation, by Noam Rotstein et al.
-
Summary of Kl-geodesics Flow Matching with a Novel Sampling Scheme, by Egor Sevriugov et al.
-
Summary of Decision Making Under the Exponential Family: Distributionally Robust Optimisation with Bayesian Ambiguity Sets, by Charita Dellaporta and Patrick O’hara and Theodoros Damoulas
-
Summary of Curvature in the Looking-glass: Optimal Methods to Exploit Curvature Of Expectation in the Loss Landscape, by Jed A. Duersch et al.
-
Summary of Edit Away and My Face Will Not Stay: Personal Biometric Defense Against Malicious Generative Editing, by Hanhui Wang et al.
-
Summary of Recast: Reparameterized, Compact Weight Adaptation For Sequential Tasks, by Nazia Tasnim and Bryan A. Plummer
-
Summary of Explainable Ai Approach Using Near Misses Analysis, by Eran Kaufman and Avivit Levy
-
Summary of Exptest: Automating Learning Rate Searching and Tuning with Insights From Linearized Neural Networks, by Zan Chaudhry and Naoko Mizuno
-
Summary of Probing the Limitations Of Multimodal Language Models For Chemistry and Materials Research, by Nawaf Alampara et al.
-
Summary of Clustering Time Series Data with Gaussian Mixture Embeddings in a Graph Autoencoder Framework, by Amirabbas Afzali et al.
-
Summary of Distributed, Communication-efficient, and Differentially Private Estimation Of Kl Divergence, by Mary Scott et al.
-
Summary of Fundamental Limits Of Prompt Tuning Transformers: Universality, Capacity and Efficiency, by Jerry Yao-chieh Hu et al.
-
Summary of Continual Deep Reinforcement Learning with Task-agnostic Policy Distillation, by Muhammad Burhan Hafez et al.
-
Summary of Transformers Are Deep Optimizers: Provable In-context Learning For Deep Model Training, by Weimin Wu et al.
-
Summary of Representation Collapsing Problems in Vector Quantization, by Wenhao Zhao et al.
-
Summary of Generating Out-of-distribution Scenarios Using Language Models, by Erfan Aasi et al.
-
Summary of Adversarial Attacks For Drift Detection, by Fabian Hinder et al.
-
Summary of Enhancing Few-shot Learning with Integrated Data and Gan Model Approaches, by Yinqiu Feng et al.
-
Summary of Enhancing Llm Reasoning Via Critique Models with Test-time and Training-time Supervision, by Zhiheng Xi et al.
-
Summary of Exploring Discrete Flow Matching For 3d De Novo Molecule Generation, by Ian Dunn et al.
-
Summary of Graph Pooling by Local Cluster Selection, By Yizhu Chen
-
Summary of Self-generated Critiques Boost Reward Modeling For Language Models, by Yue Yu et al.
-
Summary of Fast Training Of Large Kernel Models with Delayed Projections, by Amirhesam Abedsoltan et al.
-
Summary of Gaussian Process Priors For Boundary Value Problems Of Linear Partial Differential Equations, by Jianlei Huang et al.
-
Summary of Catnet: Effective Fdr Control in Lstm with Gaussian Mirrors and Shap Feature Importance, by Jiaan Han et al.
-
Summary of Quark: Real-time, High-resolution, and General Neural View Synthesis, by John Flynn et al.
-
Summary of Parce: Probabilistic and Reconstruction-based Competency Estimation For Cnn-based Image Classification, by Sara Pohland and Claire Tomlin
-
Summary of Learn2synth: Learning Optimal Data Synthesis Using Hypergradients For Brain Image Segmentation, by Xiaoling Hu et al.
-
Summary of Maximizing the Impact Of Deep Learning on Subseasonal-to-seasonal Climate Forecasting: the Essential Role Of Optimization, by Yizhen Guo et al.
-
Summary of Federated Learning in Chemical Engineering: a Tutorial on a Framework For Privacy-preserving Collaboration Across Distributed Data Sources, by Siddhant Dutta et al.
-
Summary of Diffdesign: Controllable Diffusion with Meta Prior For Efficient Interior Design Generation, by Yuxuan Yang et al.
-
Summary of Evaluating Rank-n-contrast: Continuous and Robust Representations For Regression, by Six Valentin et al.
-
Summary of Learning From Relevant Subgoals in Successful Dialogs Using Iterative Training For Task-oriented Dialog Systems, by Magdalena Kaiser et al.
-
Summary of Understanding Generalization Of Federated Learning: the Trade-off Between Model Stability and Optimization, by Dun Zeng et al.
-
Summary of Catp-llm: Empowering Large Language Models For Cost-aware Tool Planning, by Duo Wu et al.
-
Summary of Local Learning For Covariate Selection in Nonparametric Causal Effect Estimation with Latent Variables, by Zheng Li et al.
-
Summary of Towards Foundation Models For Critical Care Time Series, by Manuel Burger et al.
-
Summary of A Data-driven Approach to Dataflow-aware Online Scheduling For Graph Neural Network Inference, by Pol Puigdemont et al.
-
Summary of Machine Learning For Cerebral Blood Vessels’ Malformations, by Irem Topal et al.
-
Summary of A Review Of Bayesian Uncertainty Quantification in Deep Probabilistic Image Segmentation, by M.m.a. Valiuddin et al.
-
Summary of Turbofan Engine Remaining Useful Life (rul) Prediction Based on Bi-directional Long Short-term Memory (blstm), by Abedin Sherifi
-
Summary of Machine Learning For the Digital Typhoon Dataset: Extensions to Multiple Basins and New Developments in Representations and Tasks, by Asanobu Kitamoto et al.
-
Summary of Unsupervised Event Outlier Detection in Continuous Time, by Somjit Nath et al.
-
Summary of Privacy Protection in Personalized Diffusion Models Via Targeted Cross-attention Adversarial Attack, by Xide Xu et al.
-
Summary of Tifed: a Tiny Integer-based Federated Learning Algorithm with Direct Feedback Alignment, by Luca Colombo et al.
-
Summary of On the Reconstruction Of Training Data From Group Invariant Networks, by Ran Elbaz et al.
-
Summary of Lion Cub: Minimizing Communication Overhead in Distributed Lion, by Satoki Ishikawa et al.
-
Summary of No Identity, No Problem: Motion Through Detection For People Tracking, by Martin Engilberge et al.
-
Summary of Interpreting Language Reward Models Via Contrastive Explanations, by Junqi Jiang et al.
-
Summary of Distributed Online Optimization with Stochastic Agent Availability, by Juliette Achddou et al.
-
Summary of Context Awareness Gate For Retrieval Augmented Generation, by Mohammad Hassan Heydari et al.
-
Summary of Df-gnn: Dynamic Fusion Framework For Attention Graph Neural Networks on Gpus, by Jiahui Liu et al.
-
Summary of Beyond Task Vectors: Selective Task Arithmetic Based on Importance Metrics, by Tian Bowen et al.
-
Summary of Causal Adjacency Learning For Spatiotemporal Prediction Over Graphs, by Zhaobin Mo et al.
-
Summary of Videoorion: Tokenizing Object Dynamics in Videos, by Yicheng Feng et al.
-
Summary of Graph Adapter Of Eeg Foundation Models For Parameter Efficient Fine Tuning, by Toyotaro Suzumura et al.
-
Summary of Mixpe: Quantization and Hardware Co-design For Efficient Llm Inference, by Yu Zhang et al.
-
Summary of Sparse Patches Adversarial Attacks Via Extrapolating Point-wise Information, by Yaniv Nemcovsky et al.
-
Summary of Badsfl: Backdoor Attack Against Scaffold Federated Learning, by Xingshuo Han et al.
-
Summary of Learn From Foundation Model: Fruit Detection Model Without Manual Annotation, by Yanan Wang and Zhenghao Fei and Ruichen Li and Yibin Ying
-
Summary of Neural Network-based High-index Saddle Dynamics Method For Searching Saddle Points and Solution Landscape, by Yuankai Liu et al.
-
Summary of Video-text Dataset Construction From Multi-ai Feedback: Promoting Weak-to-strong Preference Learning For Video Large Language Models, by Hao Yi and Qingyang Li and Yulan Hu and Fuzheng Zhang and Di Zhang and Yong Liu
-
Summary of Batch Bayesian Optimization Via Expected Subspace Improvement, by Dawei Zhan et al.
-
Summary of Effective Non-random Extreme Learning Machine, by Daniela De Canditiis and Fabiano Veglianti
-
Summary of Efficient Pooling Of Predictions Via Kernel Embeddings, by Sam Allen et al.
-
Summary of Transparent Neighborhood Approximation For Text Classifier Explanation, by Yi Cai et al.
-
Summary of Even Sparser Graph Transformers, by Hamed Shirzad et al.
-
Summary of Unraveling Arithmetic in Large Language Models: the Role Of Algebraic Structures, by Fu-chieh Chang et al.
-
Summary of A Graph Neural Architecture Search Approach For Identifying Bots in Social Media, by Georgios Tzoumanekas et al.
-
Summary of Adaptive Methods Through the Lens Of Sdes: Theoretical Insights on the Role Of Noise, by Enea Monzio Compagnoni et al.
-
Summary of Stability Properties Of Gradient Flow Dynamics For the Symmetric Low-rank Matrix Factorization Problem, by Hesameddin Mohammadi et al.
-
Summary of Ensuring Fair Llm Serving Amid Diverse Applications, by Redwan Ibne Seraj Khan et al.
-
Summary of Pianist: Learning Partially Observable World Models with Llms For Multi-agent Decision Making, by Jonathan Light et al.
-
Summary of Efedllm: Efficient Llm Inference Based on Federated Learning, by Shengwen Ding and Chenhui Hu
-
Summary of M3: Mamba-assisted Multi-circuit Optimization Via Mbrl with Effective Scheduling, by Youngmin Oh et al.
-
Summary of Binary Search with Distributional Predictions, by Michael Dinitz et al.
-
Summary of Predicting Emergent Capabilities by Finetuning, By Charlie Snell et al.
-
Summary of Vicon: a Foundation Model For Multi-physics Fluid Dynamics Via Vision In-context Operator Networks, by Yadi Cao et al.
-
Summary of Soft-transformers For Continual Learning, by Haeyong Kang et al.
-
Summary of Boosting 3d Object Generation Through Pbr Materials, by Yitong Wang et al.
-
Summary of Cautious Optimizers: Improving Training with One Line Of Code, by Kaizhao Liang et al.
-
Summary of Exploring the Generalization Capabilities Of Aid-based Bi-level Optimization, by Congliang Chen et al.
-
Summary of Very Basics Of Tensors with Graphical Notations: Unfolding, Calculations, and Decompositions, by Tatsuya Yokota
-
Summary of Blendserve: Optimizing Offline Inference For Auto-regressive Large Models with Resource-aware Batching, by Yilong Zhao et al.
-
Summary of Ldacp: Long-delayed Ad Conversions Prediction Model For Bidding Strategy, by Peng Cui (1) et al.
-
Summary of Fun-ad: Fully Unsupervised Learning For Anomaly Detection with Noisy Training Data, by Jiin Im et al.