Paper List
We recommend you use the search box as this list is very long.
-
Summary of Linear Bellman Completeness Suffices For Efficient Online Reinforcement Learning with Few Actions, by Noah Golowich and Ankur Moitra
-
Summary of Diffusion Generative Modelling For Divide-and-conquer Mcmc, by C. Trojan et al.
-
Summary of Blob: Bayesian Low-rank Adaptation by Backpropagation For Large Language Models, By Yibin Wang et al.
-
Summary of Is Efficient Pac Learning Possible with An Oracle That Responds ‘yes’ or ‘no’?, by Constantinos Daskalakis and Noah Golowich
-
Summary of Deep-reinforcement-learning-based Aoi-aware Resource Allocation For Ris-aided Iov Networks, by Kangwei Qi et al.
-
Summary of Spot-mamba: Learning Long-range Dependency on Spatio-temporal Graphs with Selective State Spaces, by Jinhyeok Choi et al.
-
Summary of Excp: Extreme Llm Checkpoint Compression Via Weight-momentum Joint Shrinking, by Wenshuo Li et al.
-
Summary of Mint-1t: Scaling Open-source Multimodal Data by 10x: a Multimodal Dataset with One Trillion Tokens, By Anas Awadalla et al.
-
Summary of Statistical Learning Of Distributionally Robust Stochastic Control in Continuous State Spaces, by Shengbo Wang et al.
-
Summary of Enhancing and Assessing Instruction-following with Fine-grained Instruction Variants, by Jiuding Yang et al.
-
Summary of Management Decisions in Manufacturing Using Causal Machine Learning — to Rework, or Not to Rework?, by Philipp Schwarz et al.
-
Summary of Federated Active Learning Framework For Efficient Annotation Strategy in Skin-lesion Classification, by Zhipeng Deng et al.
-
Summary of Improved Algorithms For Contextual Dynamic Pricing, by Matilde Tullii et al.
-
Summary of Fairer Preferences Elicit Improved Human-aligned Large Language Model Judgments, by Han Zhou et al.
-
Summary of Cm2-net: Continual Cross-modal Mapping Network For Driver Action Recognition, by Ruoyu Wang et al.
-
Summary of They’re All Doctors: Synthesizing Diverse Counterfactuals to Mitigate Associative Bias, by Salma Abdel Magid et al.
-
Summary of Sefraud: Graph-based Self-explainable Fraud Detection Via Interpretative Mask Learning, by Kaidi Li et al.
-
Summary of P-ta: Using Proximal Policy Optimization to Enhance Tabular Data Augmentation Via Large Language Models, by Shuo Yang et al.
-
Summary of Distpred: a Distribution-free Probabilistic Inference Method For Regression and Forecasting, by Daojun Liang et al.
-
Summary of Are Small Language Models Ready to Compete with Large Language Models For Practical Applications?, by Neelabh Sinha et al.
-
Summary of Cross-domain Open-world Discovery, by Shuo Wen and Maria Brbic
-
Summary of Analysing the Behaviour Of Tree-based Neural Networks in Regression Tasks, by Peter Samoaa et al.
-
Summary of Calibrating Where It Matters: Constrained Temperature Scaling, by Stephen Mckenna and Jacob Carse
-
Summary of Adversaries with Incentives: a Strategic Alternative to Adversarial Robustness, by Maayan Ehrenberg et al.
-
Summary of Leveraging Foundation Models For Multi-modal Federated Learning with Incomplete Modality, by Liwei Che et al.
-
Summary of Advancing Solar Flare Prediction Using Deep Learning with Active Region Patches, by Chetraj Pandey et al.
-
Summary of Investigating Annotator Bias in Large Language Models For Hate Speech Detection, by Amit Das et al.
-
Summary of Model Adaptation For Time Constrained Embodied Control, by Jaehyun Song et al.
-
Summary of How Neural Networks Learn the Support Is An Implicit Regularization Effect Of Sgd, by Pierfrancesco Beneventano et al.
-
Summary of Reprompt: Planning by Automatic Prompt Engineering For Large Language Models Agents, By Weizhe Chen et al.
-
Summary of Active Search For Bifurcations, by Yorgos M. Psarellis et al.
-
Summary of Few-shot Recognition Via Stage-wise Retrieval-augmented Finetuning, by Tian Liu et al.
-
Summary of Distributed Stochastic Gradient Descent with Staleness: a Stochastic Delay Differential Equation Based Framework, by Siyuan Yu et al.
-
Summary of Watch Every Step! Llm Agent Learning Via Iterative Step-level Process Refinement, by Weimin Xiong et al.
-
Summary of Sugarcrepe++ Dataset: Vision-language Model Sensitivity to Semantic and Lexical Alterations, by Sri Harsha Dumpala et al.
-
Summary of Learning Iterative Reasoning Through Energy Diffusion, by Yilun Du et al.
-
Summary of Save It All: Enabling Full Parameter Tuning For Federated Large Language Models Via Cycle Block Gradient Descent, by Lin Wang et al.
-
Summary of Avatar: Optimizing Llm Agents For Tool Usage Via Contrastive Reasoning, by Shirley Wu et al.
-
Summary of Retraining with Predicted Hard Labels Provably Increases Model Accuracy, by Rudrajit Das et al.
-
Summary of Multimodal Needle in a Haystack: Benchmarking Long-context Capability Of Multimodal Large Language Models, by Hengyi Wang et al.
-
Summary of Qtip: Quantization with Trellises and Incoherence Processing, by Albert Tseng et al.
-
Summary of Probing the Decision Boundaries Of In-context Learning in Large Language Models, by Siyan Zhao et al.
-
Summary of The Benefits Of Power Regularization in Cooperative Reinforcement Learning, by Michelle Li and Michael Dennis
-
Summary of Relational Learning in Pre-trained Models: a Theory From Hypergraph Recovery Perspective, by Yang Chen et al.
-
Summary of New Solutions on Llm Acceleration, Optimization, and Application, by Yingbing Huang et al.
-
Summary of Multi-llm Qa with Embodied Exploration, by Bhrij Patel et al.
-
Summary of Bayesian Intervention Optimization For Causal Discovery, by Yuxuan Wang et al.
-
Summary of Investigating Video Reasoning Capability Of Large Language Models with Tropes in Movies, by Hung-ting Su et al.
-
Summary of Understanding Understanding: a Pragmatic Framework Motivated by Large Language Models, By Kevin Leyton-brown and Yoav Shoham
-
Summary of Effective Generative Ai: the Human-algorithm Centaur, by Soroush Saghafian et al.
-
Summary of Towards Efficient Target-level Machine Unlearning Based on Essential Graph, by Heng Xu et al.
-
Summary of Incorporating Uncertainty Quantification Into Travel Mode Choice Modeling: a Bayesian Neural Network (bnn) Approach and An Uncertainty-guided Active Survey Framework, by Shuwen Zheng et al.
-
Summary of Costa: Code-switched Speech Translation Using Aligned Speech-text Interleaving, by Bhavani Shankar et al.
-
Summary of Promoting Data and Model Privacy in Federated Learning Through Quantized Lora, by Jianhao Zhu et al.
-
Summary of Weshap: Weak Supervision Source Evaluation with Shapley Values, by Naiqing Guan and Nick Koudas
-
Summary of Concept-skill Transferability-based Data Selection For Large Vision-language Models, by Jaewoo Lee et al.
-
Summary of Data Shapley in One Training Run, by Jiachen T. Wang et al.
-
Summary of Latent Communication in Artificial Neural Networks, by Luca Moschella
-
Summary of Universal Cross-lingual Text Classification, by Riya Savant et al.
-
Summary of Optimized Speculative Sampling For Gpu Hardware Accelerators, by Dominik Wagner et al.
-
Summary of Curating Stopwords in Marathi: a Tf-idf Approach For Improved Text Analysis and Information Retrieval, by Rohan Chavan et al.
-
Summary of Evaluating the Performance Of Large Language Models Via Debates, by Behrad Moniri et al.
-
Summary of Kolmogorov Arnold Informed Neural Network: a Physics-informed Deep Learning Framework For Solving Forward and Inverse Problems Based on Kolmogorov Arnold Networks, by Yizheng Wang et al.
-
Summary of Guaranteed Sampling Flexibility For Low-tubal-rank Tensor Completion, by Bowen Su et al.
-
Summary of Improving Reward-conditioned Policies For Multi-armed Bandits Using Normalized Weight Functions, by Kai Xu et al.
-
Summary of Bayesian Networks and Machine Learning For Covid-19 Severity Explanation and Demographic Symptom Classification, by Oluwaseun T. Ajayi et al.
-
Summary of On the Effectiveness Of Supervision in Asymmetric Non-contrastive Learning, by Jeongheon Oh et al.
-
Summary of Improving Probabilistic Diffusion Models with Optimal Diagonal Covariance Matching, by Zijing Ou et al.
-
Summary of Cbgbench: Fill in the Blank Of Protein-molecule Complex Binding Graph, by Haitao Lin et al.
-
Summary of Exposing the Achilles’ Heel: Evaluating Llms Ability to Handle Mistakes in Mathematical Reasoning, by Joykirat Singh et al.
-
Summary of Knowledge Distillation in Federated Learning: a Survey on Long Lasting Challenges and New Solutions, by Laiqiao Qin et al.
-
Summary of Enriching the Machine Learning Workloads in Bigbench, by Matthias Polag et al.
-
Summary of Geometric-informed Gflownets For Structure-based Drug Design, by Grayson Lee et al.
-
Summary of Global-local Graph Neural Networks For Node-classification, by Moshe Eliasof et al.
-
Summary of Graph Neural Reaction Diffusion Models, by Moshe Eliasof et al.
-
Summary of Deep Neural Networks with Relu, Leaky Relu, and Softplus Activation Provably Overcome the Curse Of Dimensionality For Space-time Solutions Of Semilinear Partial Differential Equations, by Julia Ackermann et al.
-
Summary of Linkage on Security, Privacy and Fairness in Federated Learning: New Balances and New Perspectives, by Linlin Wang et al.
-
Summary of Distilling Opinions at Scale: Incremental Opinion Summarization Using Xl-opsumm, by Sri Raghava Muddu et al.
-
Summary of Rwku: Benchmarking Real-world Knowledge Unlearning For Large Language Models, by Zhuoran Jin et al.
-
Summary of Velociti: Can Video-language Models Bind Semantic Concepts Through Time?, by Darshana Saravanan et al.
-
Summary of Dipper: Direct Preference Optimization to Accelerate Primitive-enabled Hierarchical Reinforcement Learning, by Utsav Singh et al.
-
Summary of Benchmarking Label Noise in Instance Segmentation: Spatial Noise Matters, by Eden Grad et al.
-
Summary of Breaking the Attention Bottleneck, by Kalle Hilsenbek
-
Summary of First-order Manifold Data Augmentation For Regression Learning, by Ilya Kaufman and Omri Azencot
-
Summary of Last-iterate Convergence Separation Between Extra-gradient and Optimism in Constrained Periodic Games, by Yi Feng et al.
-
Summary of Hifgl: a Hierarchical Framework For Cross-silo Cross-device Federated Graph Learning, by Zhuoning Guo et al.
-
Summary of The Implicit Bias Of Adam on Separable Data, by Chenyang Zhang et al.
-
Summary of A Gpu-accelerated Large-scale Simulator For Transportation System Optimization Benchmarking, by Jun Zhang et al.
-
Summary of Unizero: Generalized and Efficient Planning with Scalable Latent World Models, by Yuan Pu and Yazhe Niu and Zhenjie Yang and Jiyuan Ren and Hongsheng Li and Yu Liu
-
Summary of Color-filter: Conditional Loss Reduction Filtering For Targeted Language Model Pre-training, by David Brandfonbrener et al.
-
Summary of Scale Equivariant Graph Metanetworks, by Ioannis Kalogeropoulos et al.
-
Summary of Graph Neural Thompson Sampling, by Shuang Wu et al.
-
Summary of Calibrating Neural Networks’ Parameters Through Optimal Contraction in a Prediction Problem, by Valdes Gonzalo
-
Summary of Genmm: Geometrically and Temporally Consistent Multimodal Data Generation For Video and Lidar, by Bharat Singh et al.
-
Summary of Text-space Graph Foundation Models: Comprehensive Benchmarks and New Insights, by Zhikai Chen et al.
-
Summary of A Comprehensive Survey Of Foundation Models in Medicine, by Wasif Khan et al.
-
Summary of Dpcore: Dynamic Prompt Coreset For Continual Test-time Adaptation, by Yunbei Zhang et al.
-
Summary of Adaptive Experimentation When You Can’t Experiment, by Yao Zhao et al.
-
Summary of Occam’s Razor For Self Supervised Learning: What Is Sufficient to Learn Good Representations?, by Mark Ibrahim et al.