Paper List

We recommend you use the search box as this list is very long.

Summary of Parametric Feature Transfer: One-shot Federated Learning with Foundation Models, by Mahdi Beitollahi et al.
Summary of Dfml: Decentralized Federated Mutual Learning, by Yasser H. Khalil et al.
Summary of Leveraging Large Language Models For Structure Learning in Prompted Weak Supervision, by Jinyan Su et al.
Summary of Multiverse: Exposing Large Language Model Alignment Problems in Diverse Worlds, by Xiaolong Jin et al.
Summary of Socially Aware Synthetic Data Generation For Suicidal Ideation Detection Using Large Language Models, by Hamideh Ghanadian et al.
Summary of Prompting Large Language Models For Zero-shot Clinical Prediction with Structured Longitudinal Electronic Health Record Data, by Yinghao Zhu et al.
Summary of Tricy: Trigger-guided Data-to-text Generation with Intent Aware Attention-copy, by Vibhav Agarwal et al.
Summary of Measuring Moral Inconsistencies in Large Language Models, by Vamshi Krishna Bonagiri et al.
Summary of Cerm: Context-aware Literature-based Discovery Via Sentiment Analysis, by Julio Christian Young and Uchenna Akujuobi
Summary of Cftm: Continuous Time Fractional Topic Model, by Kei Nakagawa et al.
Summary of Systematic Literature Review: Computational Approaches For Humour Style Classification, by Mary Ogbuka Kenneth et al.
Summary of Towards Optimizing the Costs Of Llm Usage, by Shivanshu Shekhar et al.
Summary of Openmoe: An Early Effort on Open Mixture-of-experts Language Models, by Fuzhao Xue et al.
Summary of Rethinking Interpretability in the Era Of Large Language Models, by Chandan Singh et al.
Summary of Llm Voting: Human Choices and Ai Collective Decision Making, by Joshua C. Yang et al.
Summary of Enriched Physics-informed Neural Networks For Dynamic Poisson-nernst-planck Systems, by Xujia Huang et al.
Summary of Blackmamba: Mixture Of Experts For State-space Models, by Quentin Anthony et al.
Summary of Hiqa: a Hierarchical Contextual Augmentation Rag For Multi-documents Qa, by Xinyue Chen et al.
Summary of Disentangling the Roles Of Target-side Transfer and Regularization in Multilingual Machine Translation, by Yan Meng and Christof Monz
Summary of Hierarchical Multi-label Classification Of Online Vaccine Concerns, by Chloe Qinyu Zhu et al.
Summary of When Benchmarks Are Targets: Revealing the Sensitivity Of Large Language Model Leaderboards, by Norah Alzahrani et al.
Summary of Coa-gpt: Generative Pre-trained Transformers For Accelerated Course Of Action Development in Military Operations, by Vinicius G. Goecks et al.
Summary of Doublemldeep: Estimation Of Causal Effects with Multimodal Data, by Sven Klaassen et al.
Summary of Closing the Gap in Human Behavior Analysis: a Pipeline For Synthesizing Trimodal Data, by Christian Stippel et al.
Summary of Decoding Speculative Decoding, by Minghao Yan et al.
Summary of Adaptive Optimization For Prediction with Missing Data, by Dimitris Bertsimas et al.
Summary of Privacy-preserving Distributed Learning For Residential Short-term Load Forecasting, by Yi Dong et al.
Summary of Trustagent: Towards Safe and Trustworthy Llm-based Agents, by Wenyue Hua et al.
Summary of Understanding Adam Optimizer Via Online Learning Of Updates: Adam Is Ftrl in Disguise, by Kwangjun Ahn et al.
Summary of Natural Counterfactuals with Necessary Backtracking, by Guang-yuan Hao et al.
Summary of L2g2g: a Scalable Local-to-global Network Embedding with Graph Autoencoders, by Ruikang Ouyang et al.
Summary of Contingency Analysis Of a Grid Of Connected Evs For Primary Frequency Control Of An Industrial Microgrid Using Efficient Control Scheme, by J.n. Sabhahit et al.
Summary of Stochastic Two Points Method For Deep Model Zeroth-order Optimization, by Yijiang Pang et al.
Summary of Position Paper: Generalized Grammar Rules and Structure-based Generalization Beyond Classical Equivariance For Lexical Tasks and Transduction, by Mircea Petrache et al.
Summary of Detection Of Machine-generated Text: Literature Survey, by Dmytro Valiaiev
Summary of Time-varying Gaussian Process Bandits with Unknown Prior, by Juliusz Ziomek et al.
Summary of A Framework to Implement 1+n Multi-task Fine-tuning Pattern in Llms Using the Cgc-lora Algorithm, by Chao Song and Zhihao Ye and Qiqiang Lin and Qiuying Peng and Jun Wang
Summary of L-tuning: Synchronized Label Tuning For Prompt and Prefix in Llms, by Md. Kowsher et al.
Summary of Maximizing Data Efficiency For Cross-lingual Tts Adaptation by Self-supervised Representation Mixing and Embedding Initialization, By Wei-ping Huang et al.
Summary of Linguistic-based Mild Cognitive Impairment Detection Using Informative Loss, by Ali Pourramezan Fard et al.
Summary of Language-guided World Models: a Model-based Approach to Ai Control, by Alex Zhang et al.
Summary of Args: Alignment As Reward-guided Search, by Maxim Khanov et al.
Summary of Higen: Hierarchy-aware Sequence Generation For Hierarchical Text Classification, by Vidit Jain et al.
Summary of Lotr: Low Tensor Rank Weight Adaptation, by Daniel Bershatsky et al.
Summary of Emergence Of Heavy Tails in Homogenized Stochastic Gradient Descent, by Zhe Jiao et al.
Summary of Alert-transformer: Bridging Asynchronous and Synchronous Machine Learning For Real-time Event-based Spatio-temporal Data, by Carmen Martin-turrero et al.
Summary of A Probabilistic Model Behind Self-supervised Learning, by Alice Bizeul et al.
Summary of Query-efficient Correlation Clustering with Noisy Oracle, by Yuko Kuroki et al.
Summary of An Information Theoretic Approach to Machine Unlearning, by Jack Foster et al.
Summary of Counterfactual Concept Bottleneck Models, by Gabriele Dominici et al.
Summary of Xai For Skin Cancer Detection with Prototypes and Non-expert Supervision, by Miguel Correia et al.
Summary of Approximate Control For Continuous-time Pomdps, by Yannick Eich et al.
Summary of Conditioning Non-linear and Infinite-dimensional Diffusion Processes, by Elizabeth Louise Baker et al.
Summary of From Words to Molecules: a Survey Of Large Language Models in Chemistry, by Chang Liao et al.
Summary of A Survey Of Few-shot Learning on Graphs: From Meta-learning to Pre-training and Prompt Learning, by Xingtong Yu et al.
Summary of Mission Critical — Satellite Data Is a Distinct Modality in Machine Learning, by Esther Rolf et al.
Summary of Integrating Large Language Models in Causal Discovery: a Statistical Causal Approach, by Masayuki Takayama et al.
Summary of Self-attention Through Kernel-eigen Pair Sparse Variational Gaussian Processes, by Yingyi Chen et al.
Summary of Pre-training Protein Bi-level Representation Through Span Mask Strategy on 3d Protein Chains, by Jiale Zhao et al.
Summary of Connecting the Dots: Is Mode-connectedness the Key to Feasible Sample-based Inference in Bayesian Neural Networks?, by Emanuel Sommer et al.
Summary of Why Do Random Forests Work? Understanding Tree Ensembles As Self-regularizing Adaptive Smoothers, by Alicia Curth and Alan Jeffares and Mihaela Van Der Schaar
Summary of Mapping the Multiverse Of Latent Representations, by Jeremy Wayland et al.
Summary of Enhancing Stochastic Gradient Descent: a Unified Framework and Novel Acceleration Methods For Faster Convergence, by Yichuan Deng et al.
Summary of A Differentiable Partially Observable Generalized Linear Model with Forward-backward Message Passing, by Chengrui Li et al.
Summary of Can Mllms Perform Text-to-image In-context Learning?, by Yuchen Zeng et al.
Summary of Extremecast: Boosting Extreme Value Prediction For Global Weather Forecast, by Wanghan Xu et al.
Summary of A Unified Framework For Center-based Clustering Of Distributed Data, by Aleksandar Armacki et al.
Summary of Bi-cryptonets: Leveraging Different-level Privacy For Encrypted Inference, by Man-jie Yuan et al.
Summary of Characterizing Overfitting in Kernel Ridgeless Regression Through the Eigenspectrum, by Tin Sum Cheng and Aurelien Lucchi and Anastasis Kratsios and David Belius
Summary of Supervised Algorithmic Fairness in Distribution Shifts: a Survey, by Minglai Shao et al.
Summary of Kto: Model Alignment As Prospect Theoretic Optimization, by Kawin Ethayarajh et al.
Summary of Signsgd with Federated Defense: Harnessing Adversarial Attacks Through Gradient Sign Decoding, by Chanho Park et al.
Summary of Fundamental Properties Of Causal Entropy and Information Gain, by Francisco N. F. Q. Simoes et al.
Summary of Training-time Neuron Alignment Through Permutation Subspace For Improving Linear Mode Connectivity and Model Fusion, by Zexi Li et al.
Summary of Monotone, Bi-lipschitz, and Polyak-lojasiewicz Networks, by Ruigang Wang et al.
Summary of Shapelet-based Model-agnostic Counterfactual Local Explanations For Time Series Classification, by Qi Huang et al.
Summary of Core: Mitigating Catastrophic Forgetting in Continual Learning Through Cognitive Replay, by Jianshu Zhang et al.
Summary of Pfedmoe: Data-level Personalization with Mixture Of Experts For Model-heterogeneous Personalized Federated Learning, by Liping Yi et al.
Summary of Continual Learning For Large Language Models: a Survey, by Tongtong Wu et al.
Summary of Tesseract: Eliminating Experimental Bias in Malware Classification Across Space and Time (extended Version), by Zeliang Kan et al.
Summary of To the Max: Reinventing Reward in Reinforcement Learning, by Grigorii Veviurko et al.
Summary of On the Multi-modal Vulnerability Of Diffusion Models, by Dingcheng Yang et al.
Summary of Two-timescale Critic-actor For Average Reward Mdps with Function Approximation, by Prashansa Panda and Shalabh Bhatnagar
Summary of Learning Network Representations with Disentangled Graph Auto-encoder, by Di Fan et al.
Summary of Efficient Reinforcement Learning For Routing Jobs in Heterogeneous Queueing Systems, by Neharika Jali et al.
Summary of Limited Memory Online Gradient Descent For Kernelized Pairwise Learning with Dynamic Averaging, by Hilal Alquabeh et al.
Summary of Truncated Non-uniform Quantization For Distributed Sgd, by Guangfeng Yan et al.
Summary of Efficient Prompt Caching Via Embedding Similarity, by Hanlin Zhu et al.
Summary of Conditional Normalizing Flows For Active Learning Of Coarse-grained Molecular Representations, by Henrik Schopmans et al.
Summary of Few-shot Class-incremental Learning with Prior Knowledge, by Wenhao Jiang et al.
Summary of A Survey on Self-supervised Learning For Non-sequential Tabular Data, by Wei-yao Wang et al.
Summary of Neural Language Of Thought Models, by Yi-fu Wu et al.
Summary of Comparative Evaluation Of Weather Forecasting Using Machine Learning Models, by Md Saydur Rahman et al.
Summary of Efficient Causal Graph Discovery Using Large Language Models, by Thomas Jiralerspong et al.
Summary of Location Agnostic Adaptive Rain Precipitation Prediction Using Deep Learning, by Md Shazid Islam et al.
Summary of Hw-sw Optimization Of Dnns For Privacy-preserving People Counting on Low-resolution Infrared Arrays, by Matteo Risso et al.
Summary of Two Heads Are Better Than One: Boosting Graph Sparse Training Via Semantic and Topological Awareness, by Guibin Zhang et al.
Summary of Unveiling Delay Effects in Traffic Forecasting: a Perspective From Spatial-temporal Delay Differential Equations, by Qingqing Long et al.
Summary of Flexible Variational Information Bottleneck: Achieving Diverse Compression with a Single Training, by Sota Kudo et al.
Summary of Transformers Learn Nonlinear Features in Context: Nonconvex Mean-field Dynamics on the Attention Landscape, by Juno Kim and Taiji Suzuki