Paper List
We recommend you use the search box as this list is very long.
-
Summary of Optimistic Information Directed Sampling, by Gergely Neu et al.
-
Summary of G-repsnet: a Fast and General Construction Of Equivariant Networks For Arbitrary Matrix Groups, by Sourya Basu et al.
-
Summary of Does Combining Parameter-efficient Modules Improve Few-shot Transfer Accuracy?, by Nader Asadi et al.
-
Summary of Calibration Of Deep Learning Classification Models in Fnirs, by Zhihao Cao et al.
-
Summary of Smoothed Graph Contrastive Learning Via Seamless Proximity Integration, by Maysam Behmanesh et al.
-
Summary of Classification Under Strategic Self-selection, by Guy Horowitz et al.
-
Summary of Optimized Deployment Of Deep Neural Networks For Visual Pose Estimation on Nano-drones, by Matteo Risso et al.
-
Summary of When in Doubt, Think Slow: Iterative Reasoning with Latent Imagination, by Martin Benfeghoul et al.
-
Summary of Spatiotemporal Observer Design For Predictive Learning Of High-dimensional Data, by Tongyi Liang and Han-xiong Li
-
Summary of Generative Modelling with Tensor Train Approximations Of Hamilton–jacobi–bellman Equations, by David Sommer et al.
-
Summary of Let’s Rectify Step by Step: Improving Aspect-based Sentiment Analysis with Diffusion Models, By Shunyu Liu et al.
-
Summary of Efficientstate Space Model Viafast Tensor Convolutionand Block Diagonalization, by Tongyi Liang and Han-xiong Li
-
Summary of Semi-supervised Counting Via Pixel-by-pixel Density Distribution Modelling, by Hui Lin and Zhiheng Ma and Rongrong Ji and Yaowei Wang and Zhou Su and Xiaopeng Hong and Deyu Meng
-
Summary of Seeing Is Believing: Mitigating Hallucination in Large Vision-language Models Via Clip-guided Decoding, by Ailin Deng et al.
-
Summary of Causal Graph Discovery with Retrieval-augmented Generation Based Large Language Models, by Yuzhe Zhang et al.
-
Summary of Representing Online Handwriting For Recognition in Large Vision-language Models, by Anastasiia Fadeeva et al.
-
Summary of Counterfactual Generation with Identifiability Guarantees, by Hanqi Yan et al.
-
Summary of Arabiangpt: Native Arabic Gpt-based Large Language Model, by Anis Koubaa et al.
-
Summary of On Minimal Depth in Neural Networks, by Juan L. Valerdi
-
Summary of Gptvq: the Blessing Of Dimensionality For Llm Quantization, by Mart Van Baalen et al.
-
Summary of Opensun3d: 1st Workshop Challenge on Open-vocabulary 3d Scene Understanding, by Francis Engelmann et al.
-
Summary of Understanding Oversmoothing in Diffusion-based Gnns From the Perspective Of Operator Semigroup Theory, by Weichen Zhao et al.
-
Summary of Towards Principled Task Grouping For Multi-task Learning, by Chenguang Wang et al.
-
Summary of The Surprising Effectiveness Of Skip-tuning in Diffusion Sampling, by Jiajun Ma et al.
-
Summary of Towards Efficient and Optimal Covariance-adaptive Algorithms For Combinatorial Semi-bandits, by Julien Zhou (thoth et al.
-
Summary of Second-order Fine-tuning Without Pain For Llms:a Hessian Informed Zeroth-order Optimizer, by Yanjun Zhao et al.
-
Summary of Attention-guided Masked Autoencoders For Learning Image Representations, by Leon Sick et al.
-
Summary of Unified View Of Grokking, Double Descent and Emergent Abilities: a Perspective From Circuits Competition, by Yufei Huang et al.
-
Summary of Break the Breakout: Reinventing Lm Defense Against Jailbreak Attacks with Self-refinement, by Heegyu Kim et al.
-
Summary of Advancing Parameter Efficiency in Fine-tuning Via Representation Editing, by Muling Wu et al.
-
Summary of Graphedit: Large Language Models For Graph Structure Learning, by Zirui Guo et al.
-
Summary of Parameter-free Algorithms For Performative Regret Minimization Under Decision-dependent Distributions, by Sungwoo Park et al.
-
Summary of Biomedical Entity Linking As Multiple Choice Question Answering, by Zhenxi Lin et al.
-
Summary of Fine-tuning Of Continuous-time Diffusion Models As Entropy-regularized Control, by Masatoshi Uehara et al.
-
Summary of Bidirectional Uncertainty-based Active Learning For Open Set Annotation, by Chen-chen Zong et al.
-
Summary of Statistical Agnostic Regression: a Machine Learning Method to Validate Regression Models, by Juan M Gorriz et al.
-
Summary of Chunkattention: Efficient Self-attention with Prefix-aware Kv Cache and Two-phase Partition, by Lu Ye et al.
-
Summary of Fixed Random Classifier Rearrangement For Continual Learning, by Shengyang Huang and Jianwen Mo
-
Summary of Which Model to Transfer? a Survey on Transferability Estimation, by Yuhe Ding et al.
-
Summary of Unsupervised Domain Adaptation For Brain Vessel Segmentation Through Transwarp Contrastive Learning, by Fengming Lin et al.
-
Summary of Gs-ema: Integrating Gradient Surgery Exponential Moving Average with Boundary-aware Contrastive Learning For Enhanced Domain Generalization in Aneurysm Segmentation, by Fengming Lin et al.
-
Summary of A Bargaining-based Approach For Feature Trading in Vertical Federated Learning, by Yue Cui et al.
-
Summary of Optimal Transport For Structure Learning Under Missing Data, by Vy Vo et al.
-
Summary of Learning Solution Operators Of Pdes Defined on Varying Domains Via Mionet, by Shanshan Xiao et al.
-
Summary of Trajectory-wise Iterative Reinforcement Learning Framework For Auto-bidding, by Haoming Li et al.
-
Summary of Sampling-based Distributed Training with Message Passing Neural Network, by Priyesh Kakka et al.
-
Summary of Remaining-data-free Machine Unlearning by Suppressing Sample Contribution, By Xinwen Cheng and Zhehao Huang and Wenxin Zhou and Zhengbao He and Ruikai Yang and Yingwen Wu and Xiaolin Huang
-
Summary of Mspipe: Efficient Temporal Gnn Training Via Staleness-aware Pipeline, by Guangming Sheng et al.
-
Summary of Physics-constrained Polynomial Chaos Expansion For Scientific Machine Learning and Uncertainty Quantification, by Himanshu Sharma et al.
-
Summary of Fine-tuning Clip Text Encoders with Two-step Paraphrasing, by Hyunjae Kim et al.
-
Summary of Accelerating Convergence Of Stein Variational Gradient Descent Via Deep Unfolding, by Yuya Kawamura and Satoshi Takabe
-
Summary of Multi-armed Bandits with Abstention, by Junwen Yang et al.
-
Summary of Improving Sentence Embeddings with Automatic Generation Of Training Data Using Few-shot Examples, by Soma Sato et al.
-
Summary of Deep Coupling Network For Multivariate Time Series Forecasting, by Kun Yi et al.
-
Summary of Puad: Frustratingly Simple Method For Robust Anomaly Detection, by Shota Sugawara et al.
-
Summary of The Cost Of Parallelizing Boosting, by Xin Lyu et al.
-
Summary of Convergence Analysis Of Blurring Mean Shift, by Ryoya Yamasaki et al.
-
Summary of Self-adaptive Reconstruction with Contrastive Learning For Unsupervised Sentence Embeddings, by Junlong Liu et al.
-
Summary of On the Duality Between Sharpness-aware Minimization and Adversarial Training, by Yihao Zhang et al.
-
Summary of Machine Unlearning Of Pre-trained Large Language Models, by Jin Yao et al.
-
Summary of Spatially-aware Transformer For Embodied Agents, by Junmo Cho et al.
-
Summary of Entity-level Factual Adaptiveness Of Fine-tuning Based Abstractive Summarization Models, by Jongyoon Song et al.
-
Summary of Has the Deep Neural Network Learned the Stochastic Process? An Evaluation Viewpoint, by Harshit Kumar et al.
-
Summary of Quantum Theory and Application Of Contextual Optimal Transport, by Nicola Mariella et al.
-
Summary of Tinybenchmarks: Evaluating Llms with Fewer Examples, by Felipe Maia Polo et al.
-
Summary of How Important Is Tokenization in French Medical Masked Language Models?, by Yanis Labrak et al.
-
Summary of Comparison Of Machine Learning Classification Algorithms and Application to the Framingham Heart Study, by Nabil Kahouadji
-
Summary of Towards Few-shot Adaptation Of Foundation Models Via Multitask Finetuning, by Zhuoyan Xu et al.
-
Summary of Divide-or-conquer? Which Part Should You Distill Your Llm?, by Zhuofeng Wu et al.
-
Summary of Unintended Impacts Of Llm Alignment on Global Representation, by Michael J. Ryan et al.
-
Summary of Consistency-guided Temperature Scaling Using Style and Content Information For Out-of-domain Calibration, by Wonjeong Choi et al.
-
Summary of Fiducial Focus Augmentation For Facial Landmark Detection, by Purbayan Kar et al.
-
Summary of Towards Probabilistically-sound Beam Search with Masked Language Models, by Creston Brooks et al.
-
Summary of Nonlinear Bayesian Optimal Experimental Design Using Logarithmic Sobolev Inequalities, by Fengyi Li et al.
-
Summary of Kieval: a Knowledge-grounded Interactive Evaluation Framework For Large Language Models, by Zhuohao Yu et al.
-
Summary of Interpreting Context Look-ups in Transformers: Investigating Attention-mlp Interactions, by Clement Neo et al.
-
Summary of Fine-tuning Large Language Models For Domain-specific Machine Translation, by Jiawei Zheng et al.
-
Summary of Don’t Just Say “i Don’t Know”! Self-aligning Large Language Models For Responding to Unknown Questions with Explanations, by Yang Deng et al.
-
Summary of Enhancing One-shot Federated Learning Through Data and Ensemble Co-boosting, by Rong Dai et al.
-
Summary of Cost-adaptive Recourse Recommendation by Adaptive Preference Elicitation, By Duy Nguyen et al.
-
Summary of Pemt: Multi-task Correlation Guided Mixture-of-experts Enables Parameter-efficient Transfer Learning, by Zhisheng Lin et al.
-
Summary of Attributionbench: How Hard Is Automatic Attribution Evaluation?, by Yifei Li et al.
-
Summary of Multimodal Transformer with a Low-computational-cost Guarantee, by Sungjin Park and Edward Choi
-
Summary of Chain-of-thought Unfaithfulness As Disguised Accuracy, by Oliver Bentham et al.
-
Summary of Stop Reasoning! When Multimodal Llm with Chain-of-thought Reasoning Meets Adversarial Image, by Zefeng Wang et al.
-
Summary of Tokenization Counts: the Impact Of Tokenization on Arithmetic in Frontier Llms, by Aaditya K. Singh et al.
-
Summary of Practical Insights Into Knowledge Distillation For Pre-trained Models, by Norah Alballa and Marco Canini
-
Summary of Federated Fairness Without Access to Sensitive Groups, by Afroditi Papadaki et al.
-
Summary of Boosting Gets Full Attention For Relational Learning, by Mathieu Guillame-bert and Richard Nock
-
Summary of Sok: Analyzing Adversarial Examples: a Framework to Study Adversary Knowledge, by Lucas Fenaux and Florian Kerschbaum
-
Summary of Re-examine Distantly Supervised Ner: a New Benchmark and a Simple Approach, by Yuepei Li et al.
-
Summary of Enhancing Power Quality Event Classification with Ai Transformer Models, by Ahmad Mohammad Saber et al.
-
Summary of Smoothness Adaptive Hypothesis Transfer Learning, by Haotian Lin et al.
-
Summary of In-context Learning Of a Linear Transformer Block: Benefits Of the Mlp Component and One-step Gd Initialization, by Ruiqi Zhang et al.
-
Summary of The Common Stability Mechanism Behind Most Self-supervised Learning Approaches, by Abhishek Jha et al.
-
Summary of Genception: Evaluate Vision Llms with Unlabeled Unimodal Data, by Lele Cao et al.
-
Summary of Unsupervised Domain Adaptation Within Deep Foundation Latent Spaces, by Dmitry Kangin et al.
-
Summary of Privacy-enhancing Collaborative Information Sharing Through Federated Learning — a Case Of the Insurance Industry, by Panyi Dong et al.
-
Summary of Optimizing Language Models For Human Preferences Is a Causal Inference Problem, by Victoria Lin et al.