Paper List
We recommend you use the search box as this list is very long.
-
Summary of Parametric Feature Transfer: One-shot Federated Learning with Foundation Models, by Mahdi Beitollahi et al.
-
Summary of Dfml: Decentralized Federated Mutual Learning, by Yasser H. Khalil et al.
-
Summary of Multiverse: Exposing Large Language Model Alignment Problems in Diverse Worlds, by Xiaolong Jin et al.
-
Summary of Socially Aware Synthetic Data Generation For Suicidal Ideation Detection Using Large Language Models, by Hamideh Ghanadian et al.
-
Summary of Prompting Large Language Models For Zero-shot Clinical Prediction with Structured Longitudinal Electronic Health Record Data, by Yinghao Zhu et al.
-
Summary of Tricy: Trigger-guided Data-to-text Generation with Intent Aware Attention-copy, by Vibhav Agarwal et al.
-
Summary of Measuring Moral Inconsistencies in Large Language Models, by Vamshi Krishna Bonagiri et al.
-
Summary of Cerm: Context-aware Literature-based Discovery Via Sentiment Analysis, by Julio Christian Young and Uchenna Akujuobi
-
Summary of Cftm: Continuous Time Fractional Topic Model, by Kei Nakagawa et al.
-
Summary of Systematic Literature Review: Computational Approaches For Humour Style Classification, by Mary Ogbuka Kenneth et al.
-
Summary of Towards Optimizing the Costs Of Llm Usage, by Shivanshu Shekhar et al.
-
Summary of Openmoe: An Early Effort on Open Mixture-of-experts Language Models, by Fuzhao Xue et al.
-
Summary of Rethinking Interpretability in the Era Of Large Language Models, by Chandan Singh et al.
-
Summary of Llm Voting: Human Choices and Ai Collective Decision Making, by Joshua C. Yang et al.
-
Summary of Enriched Physics-informed Neural Networks For Dynamic Poisson-nernst-planck Systems, by Xujia Huang et al.
-
Summary of Blackmamba: Mixture Of Experts For State-space Models, by Quentin Anthony et al.
-
Summary of Hiqa: a Hierarchical Contextual Augmentation Rag For Multi-documents Qa, by Xinyue Chen et al.
-
Summary of Disentangling the Roles Of Target-side Transfer and Regularization in Multilingual Machine Translation, by Yan Meng and Christof Monz
-
Summary of Hierarchical Multi-label Classification Of Online Vaccine Concerns, by Chloe Qinyu Zhu et al.
-
Summary of When Benchmarks Are Targets: Revealing the Sensitivity Of Large Language Model Leaderboards, by Norah Alzahrani et al.
-
Summary of Coa-gpt: Generative Pre-trained Transformers For Accelerated Course Of Action Development in Military Operations, by Vinicius G. Goecks et al.
-
Summary of Doublemldeep: Estimation Of Causal Effects with Multimodal Data, by Sven Klaassen et al.
-
Summary of Closing the Gap in Human Behavior Analysis: a Pipeline For Synthesizing Trimodal Data, by Christian Stippel et al.
-
Summary of Decoding Speculative Decoding, by Minghao Yan et al.
-
Summary of Adaptive Optimization For Prediction with Missing Data, by Dimitris Bertsimas et al.
-
Summary of Privacy-preserving Distributed Learning For Residential Short-term Load Forecasting, by Yi Dong et al.
-
Summary of Trustagent: Towards Safe and Trustworthy Llm-based Agents, by Wenyue Hua et al.
-
Summary of Understanding Adam Optimizer Via Online Learning Of Updates: Adam Is Ftrl in Disguise, by Kwangjun Ahn et al.
-
Summary of Natural Counterfactuals with Necessary Backtracking, by Guang-yuan Hao et al.
-
Summary of L2g2g: a Scalable Local-to-global Network Embedding with Graph Autoencoders, by Ruikang Ouyang et al.
-
Summary of Contingency Analysis Of a Grid Of Connected Evs For Primary Frequency Control Of An Industrial Microgrid Using Efficient Control Scheme, by J.n. Sabhahit et al.
-
Summary of Stochastic Two Points Method For Deep Model Zeroth-order Optimization, by Yijiang Pang et al.
-
Summary of Position Paper: Generalized Grammar Rules and Structure-based Generalization Beyond Classical Equivariance For Lexical Tasks and Transduction, by Mircea Petrache et al.
-
Summary of Detection Of Machine-generated Text: Literature Survey, by Dmytro Valiaiev
-
Summary of Time-varying Gaussian Process Bandits with Unknown Prior, by Juliusz Ziomek et al.
-
Summary of A Framework to Implement 1+n Multi-task Fine-tuning Pattern in Llms Using the Cgc-lora Algorithm, by Chao Song and Zhihao Ye and Qiqiang Lin and Qiuying Peng and Jun Wang
-
Summary of L-tuning: Synchronized Label Tuning For Prompt and Prefix in Llms, by Md. Kowsher et al.
-
Summary of Maximizing Data Efficiency For Cross-lingual Tts Adaptation by Self-supervised Representation Mixing and Embedding Initialization, By Wei-ping Huang et al.
-
Summary of Language-guided World Models: a Model-based Approach to Ai Control, by Alex Zhang et al.
-
Summary of Args: Alignment As Reward-guided Search, by Maxim Khanov et al.
-
Summary of Higen: Hierarchy-aware Sequence Generation For Hierarchical Text Classification, by Vidit Jain et al.
-
Summary of Lotr: Low Tensor Rank Weight Adaptation, by Daniel Bershatsky et al.
-
Summary of Emergence Of Heavy Tails in Homogenized Stochastic Gradient Descent, by Zhe Jiao et al.
-
Summary of Alert-transformer: Bridging Asynchronous and Synchronous Machine Learning For Real-time Event-based Spatio-temporal Data, by Carmen Martin-turrero et al.
-
Summary of A Probabilistic Model Behind Self-supervised Learning, by Alice Bizeul et al.
-
Summary of Query-efficient Correlation Clustering with Noisy Oracle, by Yuko Kuroki et al.
-
Summary of An Information Theoretic Approach to Machine Unlearning, by Jack Foster et al.
-
Summary of Counterfactual Concept Bottleneck Models, by Gabriele Dominici et al.
-
Summary of Xai For Skin Cancer Detection with Prototypes and Non-expert Supervision, by Miguel Correia et al.
-
Summary of Approximate Control For Continuous-time Pomdps, by Yannick Eich et al.
-
Summary of Conditioning Non-linear and Infinite-dimensional Diffusion Processes, by Elizabeth Louise Baker et al.
-
Summary of From Words to Molecules: a Survey Of Large Language Models in Chemistry, by Chang Liao et al.
-
Summary of Mission Critical — Satellite Data Is a Distinct Modality in Machine Learning, by Esther Rolf et al.
-
Summary of Integrating Large Language Models in Causal Discovery: a Statistical Causal Approach, by Masayuki Takayama et al.
-
Summary of Self-attention Through Kernel-eigen Pair Sparse Variational Gaussian Processes, by Yingyi Chen et al.
-
Summary of Pre-training Protein Bi-level Representation Through Span Mask Strategy on 3d Protein Chains, by Jiale Zhao et al.
-
Summary of Connecting the Dots: Is Mode-connectedness the Key to Feasible Sample-based Inference in Bayesian Neural Networks?, by Emanuel Sommer et al.
-
Summary of Why Do Random Forests Work? Understanding Tree Ensembles As Self-regularizing Adaptive Smoothers, by Alicia Curth and Alan Jeffares and Mihaela Van Der Schaar
-
Summary of Mapping the Multiverse Of Latent Representations, by Jeremy Wayland et al.
-
Summary of Enhancing Stochastic Gradient Descent: a Unified Framework and Novel Acceleration Methods For Faster Convergence, by Yichuan Deng et al.
-
Summary of A Differentiable Partially Observable Generalized Linear Model with Forward-backward Message Passing, by Chengrui Li et al.
-
Summary of Can Mllms Perform Text-to-image In-context Learning?, by Yuchen Zeng et al.
-
Summary of Extremecast: Boosting Extreme Value Prediction For Global Weather Forecast, by Wanghan Xu et al.
-
Summary of A Unified Framework For Center-based Clustering Of Distributed Data, by Aleksandar Armacki et al.
-
Summary of Bi-cryptonets: Leveraging Different-level Privacy For Encrypted Inference, by Man-jie Yuan et al.
-
Summary of Characterizing Overfitting in Kernel Ridgeless Regression Through the Eigenspectrum, by Tin Sum Cheng and Aurelien Lucchi and Anastasis Kratsios and David Belius
-
Summary of Supervised Algorithmic Fairness in Distribution Shifts: a Survey, by Minglai Shao et al.
-
Summary of Kto: Model Alignment As Prospect Theoretic Optimization, by Kawin Ethayarajh et al.
-
Summary of Signsgd with Federated Defense: Harnessing Adversarial Attacks Through Gradient Sign Decoding, by Chanho Park et al.
-
Summary of Fundamental Properties Of Causal Entropy and Information Gain, by Francisco N. F. Q. Simoes et al.
-
Summary of Training-time Neuron Alignment Through Permutation Subspace For Improving Linear Mode Connectivity and Model Fusion, by Zexi Li et al.
-
Summary of Monotone, Bi-lipschitz, and Polyak-lojasiewicz Networks, by Ruigang Wang et al.
-
Summary of Shapelet-based Model-agnostic Counterfactual Local Explanations For Time Series Classification, by Qi Huang et al.
-
Summary of Core: Mitigating Catastrophic Forgetting in Continual Learning Through Cognitive Replay, by Jianshu Zhang et al.
-
Summary of Pfedmoe: Data-level Personalization with Mixture Of Experts For Model-heterogeneous Personalized Federated Learning, by Liping Yi et al.
-
Summary of Continual Learning For Large Language Models: a Survey, by Tongtong Wu et al.
-
Summary of Tesseract: Eliminating Experimental Bias in Malware Classification Across Space and Time (extended Version), by Zeliang Kan et al.
-
Summary of To the Max: Reinventing Reward in Reinforcement Learning, by Grigorii Veviurko et al.
-
Summary of On the Multi-modal Vulnerability Of Diffusion Models, by Dingcheng Yang et al.
-
Summary of Two-timescale Critic-actor For Average Reward Mdps with Function Approximation, by Prashansa Panda and Shalabh Bhatnagar
-
Summary of Learning Network Representations with Disentangled Graph Auto-encoder, by Di Fan et al.
-
Summary of Efficient Reinforcement Learning For Routing Jobs in Heterogeneous Queueing Systems, by Neharika Jali et al.
-
Summary of Limited Memory Online Gradient Descent For Kernelized Pairwise Learning with Dynamic Averaging, by Hilal Alquabeh et al.
-
Summary of Truncated Non-uniform Quantization For Distributed Sgd, by Guangfeng Yan et al.
-
Summary of Efficient Prompt Caching Via Embedding Similarity, by Hanlin Zhu et al.
-
Summary of Conditional Normalizing Flows For Active Learning Of Coarse-grained Molecular Representations, by Henrik Schopmans et al.
-
Summary of A Survey on Self-supervised Learning For Non-sequential Tabular Data, by Wei-yao Wang et al.
-
Summary of Neural Language Of Thought Models, by Yi-fu Wu et al.
-
Summary of Comparative Evaluation Of Weather Forecasting Using Machine Learning Models, by Md Saydur Rahman et al.
-
Summary of Efficient Causal Graph Discovery Using Large Language Models, by Thomas Jiralerspong et al.
-
Summary of Location Agnostic Adaptive Rain Precipitation Prediction Using Deep Learning, by Md Shazid Islam et al.
-
Summary of Hw-sw Optimization Of Dnns For Privacy-preserving People Counting on Low-resolution Infrared Arrays, by Matteo Risso et al.
-
Summary of Two Heads Are Better Than One: Boosting Graph Sparse Training Via Semantic and Topological Awareness, by Guibin Zhang et al.
-
Summary of Unveiling Delay Effects in Traffic Forecasting: a Perspective From Spatial-temporal Delay Differential Equations, by Qingqing Long et al.
-
Summary of Flexible Variational Information Bottleneck: Achieving Diverse Compression with a Single Training, by Sota Kudo et al.
-
Summary of Transformers Learn Nonlinear Features in Context: Nonconvex Mean-field Dynamics on the Attention Landscape, by Juno Kim and Taiji Suzuki