Paper List
We recommend you use the search box as this list is very long.
-
Summary of Uncertainty-aware Reward-free Exploration with General Function Approximation, by Junkai Zhang and Weitong Zhang and Dongruo Zhou and Quanquan Gu
-
Summary of Towards Scalable Exact Machine Unlearning Using Parameter-efficient Fine-tuning, by Somnath Basu Roy Chowdhury et al.
-
Summary of Reducing Fine-tuning Memory Overhead by Approximate and Memory-sharing Backpropagation, By Yuchen Yang et al.
-
Summary of Relaxing Continuous Constraints Of Equivariant Graph Neural Networks For Physical Dynamics Learning, by Zinan Zheng et al.
-
Summary of Privacy Preserving Machine Learning For Electronic Health Records Using Federated Learning and Differential Privacy, by Naif A. Ganadily et al.
-
Summary of Recall: Membership Inference Via Relative Conditional Log-likelihoods, by Roy Xie et al.
-
Summary of Evcl: Elastic Variational Continual Learning with Weight Consolidation, by Hunar Batra et al.
-
Summary of Learning with Noisy Ground Truth: From 2d Classification to 3d Reconstruction, by Yangdi Lu et al.
-
Summary of Bounding-box Inference For Error-aware Model-based Reinforcement Learning, by Erin J. Talvitie et al.
-
Summary of Found in the Middle: Calibrating Positional Attention Bias Improves Long Context Utilization, by Cheng-yu Hsieh et al.
-
Summary of Timeautodiff: Combining Autoencoder and Diffusion Model For Time Series Tabular Data Synthesizing, by Namjoon Suh et al.
-
Summary of Effect Of Random Learning Rate: Theoretical Analysis Of Sgd Dynamics in Non-convex Optimization Via Stationary Distribution, by Naoki Yoshida et al.
-
Summary of Meta-fl: a Novel Meta-learning Framework For Optimizing Heterogeneous Model Aggregation in Federated Learning, by Zahir Alsulaimawi
-
Summary of Combine and Conquer: a Meta-analysis on Data Shift and Out-of-distribution Detection, by Eduardo Dadalto et al.
-
Summary of Pivotal Auto-encoder Via Self-normalizing Relu, by Nelson Goldenstein et al.
-
Summary of Port: Preference Optimization on Reasoning Traces, by Salem Lahlou et al.
-
Summary of Detecting Abnormal Operations in Concentrated Solar Power Plants From Irregular Sequences Of Thermal Images, by Sukanya Patra et al.
-
Summary of Diffusion Spectral Representation For Reinforcement Learning, by Dmitry Shribak et al.
-
Summary of Monte Carlo Planning For Stochastic Control on Constrained Markov Decision Processes, by Larkin Liu et al.
-
Summary of An All-mlp Sequence Modeling Architecture That Excels at Copying, by Chenwei Cui et al.
-
Summary of Grapheval36k: Benchmarking Coding and Reasoning Capabilities Of Large Language Models on Graph Datasets, by Qiming Wu et al.
-
Summary of Evaluation and Comparison Of Emotionally Evocative Image Augmentation Methods, by Jan Ignatowicz et al.
-
Summary of Accelerating Matrix Diagonalization Through Decision Transformers with Epsilon-greedy Optimization, by Kshitij Bhatta et al.
-
Summary of Synergistic Deep Graph Clustering Network, by Benyu Wu et al.
-
Summary of Lamsum: Amplifying Voices Against Harassment Through Llm Guided Extractive Summarization Of User Incident Reports, by Garima Chhikara et al.
-
Summary of Intrinsic Dimension Correlation: Uncovering Nonlinear Connections in Multimodal Representations, by Lorenzo Basile et al.
-
Summary of Automatic Ai Model Selection For Wireless Systems: Online Learning Via Digital Twinning, by Qiushuo Hou et al.
-
Summary of Decentralized Transformers with Centralized Aggregation Are Sample-efficient Multi-agent World Models, by Yang Zhang et al.
-
Summary of The Effect Of Similarity Measures on Accurate Stability Estimates For Local Surrogate Models in Text-based Explainable Ai, by Christopher Burger et al.
-
Summary of Learning Abstract World Model For Value-preserving Planning with Options, by Rafael Rodriguez-sanchez and George Konidaris
-
Summary of Next Level Message-passing with Hierarchical Support Graphs, by Carlos Vonessen et al.
-
Summary of Injectivity Of Relu-layers: Tools From Frame Theory, by Daniel Haider and Martin Ehler and Peter Balazs
-
Summary of Real-time Speech Summarization For Medical Conversations, by Khai Le-duc et al.
-
Summary of Fast Tree-field Integrators: From Low Displacement Rank to Topological Transformers, by Krzysztof Choromanski et al.
-
Summary of Language Alignment Via Nash-learning and Adaptive Feedback, by Ari Azarafrooz et al.
-
Summary of Learning When the Concept Shifts: Confounding, Invariance, and Dimension Reduction, by Kulunu Dharmakeerthi et al.
-
Summary of Credit Attribution and Stable Compression, by Roi Livni et al.
-
Summary of Semantic Entropy Probes: Robust and Cheap Hallucination Detection in Llms, by Jannik Kossen et al.
-
Summary of Ruler: Improving Llm Controllability by Rule-based Data Recycling, By Ming Li et al.
-
Summary of Beyond Individual Facts: Investigating Categorical Knowledge Locality Of Taxonomy and Meronomy Concepts in Gpt Models, by Christopher Burger et al.
-
Summary of Towards Exact Computation Of Inductive Bias, by Akhilan Boopathy and William Yue and Jaedong Hwang and Abhiram Iyer and Ila Fiete
-
Summary of Optimizing Lanesegnet For Real-time Lane Topology Prediction in Autonomous Vehicles, by William Stevens et al.
-
Summary of Fair Clustering: Critique, Caveats, and Future Directions, by John Dickerson et al.
-
Summary of Datafreeshield: Defending Adversarial Attacks Without Training Data, by Hyeyoon Lee et al.
-
Summary of The Stochastic Occupation Kernel Method For System Identification, by Michael Wells et al.
-
Summary of Contextual Sprint Classification in Soccer Based on Deep Learning, by Hyunsung Kim et al.
-
Summary of Matching Problems to Solutions: An Explainable Way Of Solving Machine Learning Problems, by Lokman Saleh et al.
-
Summary of Flat Posterior Does Matter For Bayesian Model Averaging, by Sungjun Lim et al.
-
Summary of Teach Better or Show Smarter? on Instructions and Exemplars in Automatic Prompt Optimization, by Xingchen Wan et al.
-
Summary of Evaluating Large Vision-and-language Models on Children’s Mathematical Olympiads, by Anoop Cherian et al.
-
Summary of Ladder: a Model-agnostic Framework Boosting Llm-based Machine Translation to the Next Level, by Zhaopeng Feng et al.
-
Summary of Modeling Unknown Stochastic Dynamical System Subject to External Excitation, by Yuan Chen et al.
-
Summary of Multimodal Segmentation For Vocal Tract Modeling, by Rishi Jain et al.
-
Summary of The Perils Of Optimizing Learned Reward Functions: Low Training Error Does Not Guarantee Low Regret, by Lukas Fluri et al.
-
Summary of Edge-llm: Enabling Efficient Large Language Model Adaptation on Edge Devices Via Layerwise Unified Compression and Adaptive Layer Tuning and Voting, by Zhongzhi Yu et al.
-
Summary of Icm Ensemble with Novel Betting Functions For Concept Drift, by Charalambos Eliades and Harris Papadopoulos
-
Summary of Rethinking the Diffusion Models For Numerical Tabular Data Imputation From the Perspective Of Wasserstein Gradient Flow, by Zhichao Chen et al.
-
Summary of Allmatch: Exploiting All Unlabeled Data For Semi-supervised Learning, by Zhiyu Wu et al.
-
Summary of Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models Without Training Through Attention Calibration, by Zhongzhi Yu et al.
-
Summary of Continual Learning with Diffusion-based Generative Replay For Industrial Streaming Data, by Jiayi He et al.
-
Summary of What Matters in Transformers? Not All Attention Is Needed, by Shwai He et al.
-
Summary of Distributionally Robust Constrained Reinforcement Learning Under Strong Duality, by Zhengfei Zhang et al.
-
Summary of Privacy Implications Of Explainable Ai in Data-driven Systems, by Fatima Ezzeddine
-
Summary of Steering Without Side Effects: Improving Post-deployment Control Of Language Models, by Asa Cooper Stickland et al.
-
Summary of Multi-view Empowered Structural Graph Wordification For Language Models, by Zipeng Liu et al.
-
Summary of Few-shot Knowledge Graph Relational Reasoning Via Subgraph Adaptation, by Haochen Liu et al.
-
Summary of Unifying Unsupervised Graph-level Anomaly Detection and Out-of-distribution Detection: a Benchmark, by Yili Wang et al.
-
Summary of Rethinking Pruning Large Language Models: Benefits and Pitfalls Of Reconstruction Error Minimization, by Sungbin Shin et al.
-
Summary of Data Efficient Evaluation Of Large Language Models and Text-to-image Models Via Adaptive Sampling, by Cong Xu et al.
-
Summary of Geneverse: a Collection Of Open-source Multimodal Large Language Models For Genomic and Proteomic Research, by Tianyu Liu et al.
-
Summary of Unseen Object Reasoning with Shared Appearance Cues, by Paridhi Singh et al.
-
Summary of Sail: Self-improving Efficient Online Alignment Of Large Language Models, by Mucong Ding et al.
-
Summary of Dem: Distribution Edited Model For Training with Mixed Data Distributions, by Dhananjay Ram et al.
-
Summary of Robust Reinforcement Learning From Corrupted Human Feedback, by Alexander Bukharin et al.
-
Summary of Sketch-gnn: Scalable Graph Neural Networks with Sublinear Training Complexity, by Mucong Ding et al.
-
Summary of Pareto-optimal Learning From Preferences with Hidden Context, by Ryan Bahlous-boldi et al.
-
Summary of Catastrophic-risk-aware Reinforcement Learning with Extreme-value-theory-based Policy Gradients, by Parisa Davar et al.
-
Summary of Brownne: Brownian Nonlocal Neurons & Activation Functions, by Sriram Nagaraj and Truman Hickok
-
Summary of Mountaineer: Topology-driven Visual Analytics For Comparing Local Explanations, by Parikshit Solunke et al.
-
Summary of Physics Informed Machine Learning (piml) Methods For Estimating the Remaining Useful Lifetime (rul) Of Aircraft Engines, by Sriram Nagaraj and Truman Hickok
-
Summary of Shortcomings Of Llms For Low-resource Translation: Retrieval and Understanding Are Both the Problem, by Sara Court and Micha Elsner
-
Summary of Benchmarking Uncertainty Quantification Methods For Large Language Models with Lm-polygraph, by Roman Vashurin et al.
-
Summary of Testing the Feasibility Of Linear Programs with Bandit Feedback, by Aditya Gangrade et al.
-
Summary of Learning Spatio-temporal Patterns Of Polar Ice Layers with Physics-informed Graph Neural Network, by Zesheng Liu et al.
-
Summary of Advanced Multimodal Deep Learning Architecture For Image-text Matching, by Jinyin Wang et al.
-
Summary of Fine-grained Attention in Hierarchical Transformers For Tabular Time-series, by Raphael Azorin et al.
-
Summary of Multimodal Task Vectors Enable Many-shot Multimodal In-context Learning, by Brandon Huang et al.
-
Summary of Masked Extended Attention For Zero-shot Virtual Try-on in the Wild, by Nadav Orzech et al.
-
Summary of Genotex: a Benchmark For Evaluating Llm-based Exploration Of Gene Expression Data in Alignment with Bioinformaticians, by Haoyang Liu et al.
-
Summary of Privacy Preserved Blood Glucose Level Cross-prediction: An Asynchronous Decentralized Federated Learning Approach, by Chengzhe Piao et al.
-
Summary of Feature Purified Transformer with Cross-level Feature Guiding Decoder For Multi-class Ood and Anomaly Deteciton, by Jerry Chun-wei Lin et al.
-
Summary of Navsim: Data-driven Non-reactive Autonomous Vehicle Simulation and Benchmarking, by Daniel Dauner et al.
-
Summary of Deep Vision-based Framework For Coastal Flood Prediction Under Climate Change Impacts and Shoreline Adaptations, by Areg Karapetyan et al.
-
Summary of Mmlu-sr: a Benchmark For Stress-testing Reasoning Capability Of Large Language Models, by Wentian Wang et al.
-
Summary of Improving Large Models with Small Models: Lower Costs and Better Performance, by Dong Chen et al.
-
Summary of Hyperbolic Sentence Representations For Solving Textual Entailment, by Igor Petrovski
-
Summary of On Giant’s Shoulders: Effortless Weak to Strong by Dynamic Logits Fusion, By Chenghao Fan et al.
-
Summary of Twin-merging: Dynamic Integration Of Modular Expertise in Model Merging, by Zhenyi Lu et al.
-
Summary of Duplicate Detection with Genai, by Ian Ormesher