Paper List
We recommend you use the search box as this list is very long.
-
Summary of Xgen-mm-vid (blip-3-video): You Only Need 32 Tokens to Represent a Video Even in Vlms, by Michael S. Ryoo et al.
-
Summary of Limit Theorems For Stochastic Gradient Descent with Infinite Variance, by Jose Blanchet et al.
-
Summary of Exploring How Deep Learning Decodes Anomalous Diffusion Via Grad-cam, by Jaeyong Bae et al.
-
Summary of 1024m at Smm4h 2024: Tasks 3, 5 & 6 — Ensembles Of Transformers and Large Language Models For Medical Text Classification, by Ram Mohan Rao Kadiyala et al.
-
Summary of Massimo: Public Queue Monitoring and Management Using Mass-spring Model, by Abhijeet Kumar et al.
-
Summary of Information-theoretic Minimax Regret Bounds For Reinforcement Learning Based on Duality, by Raghav Bongole et al.
-
Summary of Timemixer++: a General Time Series Pattern Machine For Universal Predictive Analysis, by Shiyu Wang et al.
-
Summary of Natural Galore: Accelerating Galore For Memory-efficient Llm Training and Fine-tuning, by Arijit Das
-
Summary of Treebon: Enhancing Inference-time Alignment with Speculative Tree-search and Best-of-n Sampling, by Jiahao Qiu et al.
-
Summary of Near-optimal Algorithm For Non-stationary Kernelized Bandits, by Shogo Iwazaki and Shion Takeno
-
Summary of On the Geometry Of Regularization in Adversarial Training: High-dimensional Asymptotics and Generalization Bounds, by Matteo Vilucchio et al.
-
Summary of Cartesianmoe: Boosting Knowledge Sharing Among Experts Via Cartesian Product Routing in Mixture-of-experts, by Zhenpeng Su et al.
-
Summary of Exdbn: Exact Learning Of Dynamic Bayesian Networks, by Pavel Rytir et al.
-
Summary of Ldadam: Adaptive Optimization From Low-dimensional Gradient Statistics, by Thomas Robert and Mher Safaryan and Ionut-vlad Modoranu and Dan Alistarh
-
Summary of Addressing Spectral Bias Of Deep Neural Networks by Multi-grade Deep Learning, By Ronglong Fang and Yuesheng Xu
-
Summary of Statistical Inference For Temporal Difference Learning with Linear Function Approximation, by Weichen Wu et al.
-
Summary of Interpreting Microbiome Relative Abundance Data Using Symbolic Regression, by Swagatam Haldar et al.
-
Summary of Seadag: Semi-autoregressive Diffusion For Conditional Directed Acyclic Graph Generation, by Xinyi Zhou et al.
-
Summary of Extracting Spatiotemporal Data From Gradients with Large Language Models, by Lele Zheng et al.
-
Summary of Mnist-nd: a Set Of Naturalistic Datasets to Benchmark Clustering Across Dimensions, by Polina Turishcheva et al.
-
Summary of Smart: Self-learning Meta-strategy Agent For Reasoning Tasks, by Rongxing Liu et al.
-
Summary of Beyond 2:4: Exploring V:n:m Sparsity For Efficient Transformer Inference on Gpus, by Kang Zhao et al.
-
Summary of Theoretical Insights Into Line Graph Transformation on Graph Learning, by Fan Yang and Xingyue Huang
-
Summary of Limtr: Time Series Motion Prediction For Diverse Road Users Through Multimodal Feature Integration, by Camiel Oerlemans et al.
-
Summary of Random Token Fusion For Multi-view Medical Diagnosis, by Jingyu Guo and Christos Matsoukas and Fredrik Strand and Kevin Smith
-
Summary of Explainability Of Highly Associated Fuzzy Churn Patterns in Binary Classification, by D.y.c. Wang et al.
-
Summary of Focus Where It Matters: Graph Selective State Focused Attention Networks, by Shikhar Vashistha et al.
-
Summary of Towards Optimal Adapter Placement For Efficient Transfer Learning, by Aleksandra I. Nowak et al.
-
Summary of Mesa-extrapolation: a Weave Position Encoding Method For Enhanced Extrapolation in Llms, by Xin Ma et al.
-
Summary of Enabling Asymmetric Knowledge Transfer in Multi-task Learning with Self-auxiliaries, by Olivier Graffeuille et al.
-
Summary of Distributed Learning For Uav Swarms, by Chen Hu et al.
-
Summary of Flickerfusion: Intra-trajectory Domain Generalizing Multi-agent Rl, by Woosung Koh et al.
-
Summary of Using Gpt Models For Qualitative and Quantitative News Analytics in the 2024 Us Presidental Election Process, by Bohdan M. Pavlyshenko
-
Summary of Diverse Policies Recovering Via Pointwise Mutual Information Weighted Imitation Learning, by Hanlin Yang et al.
-
Summary of Model Mimic Attack: Knowledge Distillation For Provably Transferable Adversarial Examples, by Kirill Lukyanov et al.
-
Summary of Grefel: Geometry-aware Reliable Facial Expression Learning Under Bias and Imbalanced Data Distribution, by Azmine Toushik Wasi and Taki Hasan Rafi and Raima Islam and Karlo Serbetar and Dong Kyu Chae
-
Summary of Karush-kuhn-tucker Condition-trained Neural Networks (kkt Nets), by Shreya Arvind et al.
-
Summary of Large Language Models For Cross-lingual Emotion Detection, by Ram Mohan Rao Kadiyala
-
Summary of Augmenting Legal Decision Support Systems with Llm-based Nli For Analyzing Social Media Evidence, by Ram Mohan Rao Kadiyala et al.
-
Summary of Robust Visual Representation Learning with Multi-modal Prior Knowledge For Image Classification Under Distribution Shift, by Hongkuan Zhou et al.
-
Summary of Multirc: Joint Learning For Time Series Anomaly Prediction and Detection with Multi-scale Reconstructive Contrast, by Shiyan Hu et al.
-
Summary of Calibration Of Ordinal Regression Networks, by Daehwan Kim et al.
-
Summary of Scalable Data Ablation Approximations For Language Models Through Modular Training and Merging, by Clara Na and Ian Magnusson and Ananya Harsh Jha and Tom Sherborne and Emma Strubell and Jesse Dodge and Pradeep Dasigi
-
Summary of Long Term Memory: the Foundation Of Ai Self-evolution, by Xun Jiang et al.
-
Summary of Federated Learning with Mmd-based Early Stopping For Adaptive Gnss Interference Classification, by Nishant S. Gaikwad and Lucas Heublein and Nisha L. Raichur and Tobias Feigl and Christopher Mutschler and Felix Ott
-
Summary of Rac: Efficient Llm Factuality Correction with Retrieval Augmentation, by Changmao Li and Jeffrey Flanigan
-
Summary of Enhancing Snn-based Spatio-temporal Learning: a Benchmark Dataset and Cross-modality Attention Model, by Shibo Zhou et al.
-
Summary of Residual Vector Quantization For Kv Cache Compression in Large Language Model, by Ankur Kumar
-
Summary of Solving Continual Offline Rl Through Selective Weights Activation on Aligned Spaces, by Jifeng Hu et al.
-
Summary of Estimating Individual Dose-response Curves Under Unobserved Confounders From Observational Data, by Shutong Chen and Yang Li
-
Summary of Offline Reinforcement Learning For Job-shop Scheduling Problems, by Imanol Echeverria et al.
-
Summary of Traffic Matrix Estimation Based on Denoising Diffusion Probabilistic Model, by Xinyu Yuan et al.
-
Summary of A Two-stage Learning-to-defer Approach For Multi-task Learning, by Yannis Montreuil et al.
-
Summary of S-cfe: Simple Counterfactual Explanations, by Shpresim Sadiku et al.
-
Summary of Object-centric Temporal Consistency Via Conditional Autoregressive Inductive Biases, by Cristian Meo et al.
-
Summary of Deepvigor+: Scalable and Accurate Semi-analytical Fault Resilience Analysis For Deep Neural Network, by Mohammad Hasan Ahmadilivani et al.
-
Summary of Optimal Query Allocation in Extractive Qa with Llms: a Learning-to-defer Framework with Theoretical Guarantees, by Yannis Montreuil et al.
-
Summary of Mislabeled Examples Detection Viewed As Probing Machine Learning Models: Concepts, Survey and Extensive Benchmark, by Thomas George et al.
-
Summary of Reducing Hallucinations in Vision-language Models Via Latent Space Steering, by Sheng Liu et al.
-
Summary of On the Vc Dimension Of Deep Group Convolutional Neural Networks, by Anna Sepliarskaia et al.
-
Summary of How to Find the Exact Pareto Front For Multi-objective Mdps?, by Yining Li et al.
-
Summary of Reward Maximization For Pure Exploration: Minimax Optimal Good Arm Identification For Nonparametric Multi-armed Bandits, by Brian Cho et al.
-
Summary of Stacking Small Language Models For Generalizability, by Laurence Liang
-
Summary of Pruning Foundation Models For High Accuracy Without Retraining, by Pu Zhao et al.
-
Summary of Generalized Probabilistic Attention Mechanism in Transformers, by Dongnyeong Heo and Heeyoul Choi
-
Summary of Language Models Are Symbolic Learners in Arithmetic, by Chunyuan Deng et al.
-
Summary of Multimodal Learning For Embryo Viability Prediction in Clinical Ivf, by Junsik Kim et al.
-
Summary of A Comprehensive Survey Of Direct Preference Optimization: Datasets, Theories, Variants, and Applications, by Wenyi Xiao et al.
-
Summary of All You Need Is An Improving Column: Enhancing Column Generation For Parallel Machine Scheduling Via Transformers, by Amira Hijazi et al.
-
Summary of On the Global Convergence Of Online Rlhf with Neural Parametrization, by Mudit Gaur et al.
-
Summary of Erasing Undesirable Concepts in Diffusion Models with Adversarial Preservation, by Anh Bui et al.
-
Summary of In-trajectory Inverse Reinforcement Learning: Learn Incrementally Before An Ongoing Trajectory Terminates, by Shicheng Liu et al.
-
Summary of Test-time Adaptation For Cross-modal Retrieval with Query Shift, by Haobin Li et al.
-
Summary of Deep Graph Attention Networks, by Jun Kato et al.
-
Summary of Improving Parallel Program Performance with Llm Optimizers Via Agent-system Interface, by Anjiang Wei et al.
-
Summary of Large Deviations and Improved Mean-squared Error Rates Of Nonlinear Sgd: Heavy-tailed Noise and Power Of Symmetry, by Aleksandar Armacki et al.
-
Summary of Linking Model Intervention to Causal Interpretation in Model Explanation, by Debo Cheng et al.
-
Summary of Understanding and Alleviating Memory Consumption in Rlhf For Llms, by Jin Zhou et al.
-
Summary of Accounting For Missing Covariates in Heterogeneous Treatment Estimation, by Khurram Yamin et al.
-
Summary of Accelerated Sub-image Search For Variable-size Patches Identification Based on Virtual Time Series Transformation and Segmentation, by Mogens Plessen
-
Summary of Power Plays: Unleashing Machine Learning Magic in Smart Grids, by Abdur Rashid et al.
-
Summary of Data Augmentation Via Diffusion Model to Enhance Ai Fairness, by Christina Hastings Blow et al.
-
Summary of Generative Models, Humans, Predictive Models: Who Is Worse at High-stakes Decision Making?, by Keri Mallari and Julius Adebayo and Kori Inkpen and Martin T. Wells and Albert Gordo and Sarah Tan
-
Summary of Multi-layer Feature Fusion with Cross-channel Attention-based U-net For Kidney Tumor Segmentation, by Fnu Neha et al.
-
Summary of A Bayesian Framework For Clustered Federated Learning, by Peng Wu et al.
-
Summary of Mitigating Forgetting in Llm Supervised Fine-tuning and Preference Learning, by Heshan Fernando et al.
-
Summary of Optimizing Backward Policies in Gflownets Via Trajectory Likelihood Maximization, by Timofei Gritsaev et al.
-
Summary of Reinforcement Learning For Dynamic Memory Allocation, by Arisrei Lim et al.
-
Summary of Structural Causality-based Generalizable Concept Discovery Models, by Sanchit Sinha et al.
-
Summary of Sea: State-exchange Attention For High-fidelity Physics Based Transformers, by Parsa Esmati et al.
-
Summary of Exploring Curriculum Learning For Vision-language Tasks: a Study on Small-scale Multimodal Training, by Rohan Saha et al.
-
Summary of M-rewardbench: Evaluating Reward Models in Multilingual Settings, by Srishti Gureja et al.
-
Summary of Mira: a Method Of Federated Multi-task Learning For Large Language Models, by Ahmed Elbakary et al.
-
Summary of Sdp4bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism For Llm Training, by Jinda Jia et al.
-
Summary of Grammatical Error Correction For Low-resource Languages: the Case Of Zarma, by Mamadou K. Keita et al.
-
Summary of Distributed Thompson Sampling Under Constrained Communication, by Saba Zerefa et al.