Paper List
We recommend you use the search box as this list is very long.
-
Summary of Cliqueformer: Model-based Optimization with Structured Transformers, by Jakub Grudzien Kuba et al.
-
Summary of A Little Human Data Goes a Long Way, by Dhananjay Ashok and Jonathan May
-
Summary of Communication-efficient and Tensorized Federated Fine-tuning Of Large Language Models, by Sajjad Ghiasvand et al.
-
Summary of In-context Kv-cache Eviction For Llms Via Attention-gate, by Zihao Zeng et al.
-
Summary of Improving Instruction-following in Language Models Through Activation Steering, by Alessandro Stolfo et al.
-
Summary of Towards More Effective Table-to-text Generation: Assessing In-context Learning and Self-evaluation with Open-source Models, by Sahar Iravani et al.
-
Summary of At-rag: An Adaptive Rag Model Enhancing Query Efficiency with Topic Filtering and Iterative Reasoning, by Mohammad Reza Rezaei et al.
-
Summary of Scaling Laws For Multilingual Language Models, by Yifei He et al.
-
Summary of Fair Clustering For Data Summarization: Improved Approximation Algorithms and Complexity Insights, by Ameet Gadekar et al.
-
Summary of Credal Two-sample Tests Of Epistemic Uncertainty, by Siu Lun Chau et al.
-
Summary of Sok: on Finding Common Ground in Loss Landscapes Using Deep Model Merging Techniques, by Arham Khan et al.
-
Summary of Merge to Learn: Efficiently Adding Skills to Language Models with Model Merging, by Jacob Morrison et al.
-
Summary of Multi-modal Graph Neural Networks For Localized Off-grid Weather Forecasting, by Qidong Yang et al.
-
Summary of Mechanistic Unlearning: Robust Knowledge Unlearning and Editing Via Mechanistic Localization, by Phillip Guo et al.
-
Summary of Syn2real Domain Generalization For Underwater Mine-like Object Detection Using Side-scan Sonar, by Aayush Agrawal et al.
-
Summary of A Note on Shumailov Et Al. (2024): `ai Models Collapse When Trained on Recursively Generated Data’, by Ali Borji
-
Summary of Reinforcement Learning with Euclidean Data Augmentation For State-based Continuous Control, by Jinzhu Luo et al.
-
Summary of Flash Inference: Near Linear Time Inference For Long Convolution Sequence Models and Beyond, by Costin-andrei Oncescu et al.
-
Summary of Sset: Swapping-sliding Explanation For Time Series Classifiers in Affect Detection, by Nazanin Fouladgar et al.
-
Summary of Double-bayesian Learning, by Stefan Jaeger
-
Summary of Hiding-in-plain-sight (hips) Attack on Clip For Targetted Object Removal From Images, by Arka Daw et al.
-
Summary of Llm Chain Ensembles For Scalable and Accurate Data Annotation, by David Farr et al.
-
Summary of Context-scaling Versus Task-scaling in In-context Learning, by Amirhesam Abedsoltan et al.
-
Summary of Judgebench: a Benchmark For Evaluating Llm-based Judges, by Sijun Tan et al.
-
Summary of Metal Price Spike Prediction Via a Neurosymbolic Ensemble Approach, by Nathaniel Lee et al.
-
Summary of Dual Prototype Evolving For Test-time Generalization Of Vision-language Models, by Ce Zhang et al.
-
Summary of Rethinking Misalignment in Vision-language Model Adaptation From a Causal Perspective, by Yanan Zhang et al.
-
Summary of Avid: Adapting Video Diffusion Models to World Models, by Marc Rigter et al.
-
Summary of Gcm-net: Graph-enhanced Cross-modal Infusion with a Metaheuristic-driven Network For Video Sentiment and Emotion Analysis, by Prasad Chaudhari et al.
-
Summary of Generative Reward Models, by Dakota Mahan et al.
-
Summary of Answering Questions in Stages: Prompt Chaining For Contract Qa, by Adam Roegiest et al.
-
Summary of Textlap: Customizing Language Models For Text-to-layout Planning, by Jian Chen et al.
-
Summary of Uniautoml: a Human-centered Framework For Unified Discriminative and Generative Automl with Large Language Models, by Jiayi Guo et al.
-
Summary of Recurformer: Not All Transformer Heads Need Self-attention, by Ruiqing Yan et al.
-
Summary of Diversity Of Thought Elicits Stronger Reasoning Capabilities in Multi-agent Debate Frameworks, by Mahmood Hegazy
-
Summary of The Large Language Model Greeklegalroberta, by Vasileios Saketos and Despina-athanasia Pantazi and Manolis Koubarakis
-
Summary of Towards Homogeneous Lexical Tone Decoding From Heterogeneous Intracranial Recordings, by Di Wu et al.
-
Summary of Elf-gym: Evaluating Large Language Models Generated Features For Tabular Prediction, by Yanlin Zhang et al.
-
Summary of Imas: a Comprehensive Agentic Approach to Rural Healthcare Delivery, by Agasthya Gangavarapu and Ananya Gangavarapu
-
Summary of Language Model Preference Evaluation with Multiple Weak Evaluators, by Zhengyu Hu et al.
-
Summary of Skill Learning Using Process Mining For Large Language Model Plan Generation, by Andrei Cosmin Redis et al.
-
Summary of Beyond Right and Wrong: Mitigating Cold Start in Knowledge Tracing Using Large Language Model and Option Weight, by Jongwoo Kim et al.
-
Summary of Position Specific Scoring Is All You Need? Revisiting Protein Sequence Classification Tasks, by Sarwan Ali et al.
-
Summary of Explanation-preserving Augmentation For Semi-supervised Graph Representation Learning, by Zhuomin Chen et al.
-
Summary of New Paradigm Of Adversarial Training: Breaking Inherent Trade-off Between Accuracy and Robustness Via Dummy Classes, by Yanyun Wang et al.
-
Summary of Optimizing Multi-task Learning For Accurate Spacecraft Pose Estimation, by Francesco Evangelisti et al.
-
Summary of Context Matters: Leveraging Contextual Features For Time Series Forecasting, by Sameep Chattopadhyay et al.
-
Summary of Automatic Mapping Of Anatomical Landmarks From Free-text Using Large Language Models: Insights From Llama-2, by Mohamad Abdi et al.
-
Summary of Local Transfer Learning Gaussian Process Modeling, with Applications to Surrogate Modeling Of Expensive Computer Simulators, by Xinming Wang et al.
-
Summary of Machine Learning Approach to Brain Tumor Detection and Classification, by Alice Oh et al.
-
Summary of Neural-based Control For Cubesat Docking Maneuvers, by Matteo Stoisa et al.
-
Summary of Embedding An Ethical Mind: Aligning Text-to-image Synthesis Via Lightweight Value Optimization, by Xingqi Wang et al.
-
Summary of Optimizing 3d Geometry Reconstruction From Implicit Neural Representations, by Shen Fan and Przemyslaw Musialski
-
Summary of How Does Variance Shape the Regret in Contextual Bandits?, by Zeyu Jia et al.
-
Summary of Counterfactual Generative Modeling with Variational Causal Inference, by Yulun Wu et al.
-
Summary of Cream: Consistency Regularized Self-rewarding Language Models, by Zhaoyang Wang et al.
-
Summary of Initialization Method For Factorization Machine Based on Low-rank Approximation For Constructing a Corrected Approximate Ising Model, by Yuya Seki et al.
-
Summary of Styledistance: Stronger Content-independent Style Embeddings with Synthetic Parallel Examples, by Ajay Patel et al.
-
Summary of The Non-local Model Merging Problem: Permutation Symmetries and Variance Collapse, by Ekansh Sharma et al.
-
Summary of Meta-unlearning on Diffusion Models: Preventing Relearning Unlearned Concepts, by Hongcheng Gao et al.
-
Summary of Kcmf: a Knowledge-compliant Framework For Schema and Entity Matching with Fine-tuning-free Llms, by Yongqin Xu et al.
-
Summary of End-to-end Planner Training For Language Modeling, by Nathan Cornille et al.
-
Summary of Data-driven Gyroscope Calibration, by Zeev Yampolsky and Itzik Klein
-
Summary of Ming: a Functional Approach to Learning Molecular Generative Models, by Van Khoa Nguyen et al.
-
Summary of Is Complex Query Answering Really Complex?, by Cosimo Gregucci et al.
-
Summary of Investigating Sensitive Directions in Gpt-2: An Improved Baseline and Comparative Analysis Of Saes, by Daniel J. Lee and Stefan Heimersheim
-
Summary of One Step Diffusion Via Shortcut Models, by Kevin Frans et al.
-
Summary of On the Role Of Activation Functions in Eeg-to-text Decoder, by Zenon Lamprou et al.
-
Summary of Cocoon: Robust Multi-modal Perception with Uncertainty-aware Sensor Fusion, by Minkyoung Cho et al.
-
Summary of Expand and Compress: Exploring Tuning Principles For Continual Spatio-temporal Graph Forecasting, by Wei Chen et al.
-
Summary of Personalized Prediction Models For Changes in Knee Pain Among Patients with Osteoarthritis Participating in Supervised Exercise and Education, by M. Rafiei et al.
-
Summary of Self-supervised Learning Of Disentangled Representations For Multivariate Time-series, by Ching Chang et al.
-
Summary of The Bayesian Confidence (bacon) Estimator For Deep Neural Networks, by Patrick D. Kee et al.
-
Summary of Towards Graph Foundation Models: the Perspective Of Zero-shot Reasoning on Knowledge Graphs, by Kai Wang et al.
-
Summary of Weak-to-strong Generalization Beyond Accuracy: a Pilot Study in Safety, Toxicity, and Legal Reasoning, by Ruimeng Ye et al.
-
Summary of Low-rank Adversarial Pgd Attack, by Dayana Savostianova et al.
-
Summary of Exploring Model Kinship For Merging Large Language Models, by Yedi Hu et al.
-
Summary of An Exact Finite-dimensional Explicit Feature Map For Kernel Functions, by Kamaledin Ghiasi-shirazi et al.
-
Summary of Explainable Moral Values: a Neuro-symbolic Approach to Value Classification, by Nicolas Lazzari et al.
-
Summary of Constrained Posterior Sampling: Time Series Generation with Hard Constraints, by Sai Shankar Narasimhan et al.
-
Summary of A Numerical Study Of Chaotic Dynamics Of K-s Equation with Fnos, by Surbhi Khetrapal and Jaswin Kasi
-
Summary of Conjunction Subspaces Test For Conformal and Selective Classification, by Zengyou He and Zerun Li and Junjie Dong and Xinying Liu and Mudi Jiang and Lianyu Hu
-
Summary of Consistency Calibration: Improving Uncertainty Calibration Via Consistency Among Perturbed Neighbors, by Linwei Tao et al.
-
Summary of Dat: Improving Adversarial Robustness Via Generative Amplitude Mix-up in Frequency Domain, by Fengpeng Li et al.
-
Summary of Tpfl: a Trustworthy Personalized Federated Learning Framework Via Subjective Logic, by Jinqian Chen et al.
-
Summary of Max: Masked Autoencoder For X-ray Fluorescence in Geological Investigation, by An-sheng Lee et al.
-
Summary of Federated Temporal Graph Clustering, by Zihao Zhou et al.
-
Summary of Towards Neural Scaling Laws For Time Series Foundation Models, by Qingren Yao et al.
-
Summary of Tracking Universal Features Through Fine-tuning and Model Merging, by Niels Horn and Desmond Elliott
-
Summary of Perseus: Leveraging Common Data Patterns with Curriculum Learning For More Robust Graph Neural Networks, by Kaiwen Xia et al.
-
Summary of Conlux: Concept-based Local Unified Explanations, by Junhao Liu et al.
-
Summary of Loss Landscape Characterization Of Neural Networks Without Over-parametrization, by Rustem Islamov et al.
-
Summary of Training Neural Samplers with Reverse Diffusive Kl Divergence, by Jiajun He et al.
-
Summary of Sharpness-aware Black-box Optimization, by Feiyang Ye et al.
-
Summary of Helm: Hierarchical Encoding For Mrna Language Modeling, by Mehdi Yazdani-jahromi and Mangal Prakash and Tommaso Mansi and Artem Moskalev and Rui Liao
-
Summary of Challenges, Methods, Data — a Survey Of Machine Learning in Water Distribution Networks, by Valerie Vaquet et al.