Paper List
We recommend you use the search box as this list is very long.
-
Summary of Increasing Transformer Token Length with a Maximum Entropy Principle Method, by R. I. Cukier
-
Summary of Adapmoe: Adaptive Sensitivity-based Expert Gating and Management For Efficient Moe Inference, by Shuzhang Zhong et al.
-
Summary of Batgpt-chem: a Foundation Large Model For Retrosynthesis Prediction, by Yifei Yang et al.
-
Summary of Preference-optimized Pareto Set Learning For Blackbox Optimization, by Zhang Haishan et al.
-
Summary of Uniting Contrastive and Generative Learning For Event Sequences Models, by Aleksandr Yugay and Alexey Zaytsev
-
Summary of The Fairness-quality Trade-off in Clustering, by Rashida Hakim et al.
-
Summary of Unlocking the Power Of Lstm For Long Term Time Series Forecasting, by Yaxuan Kong et al.
-
Summary of Pinnde: Physics-informed Neural Networks For Solving Differential Equations, by Jason Matthews et al.
-
Summary of Exploiting Fine-grained Prototype Distribution For Boosting Unsupervised Class Incremental Learning, by Jiaming Liu et al.
-
Summary of Facial Wrinkle Segmentation For Cosmetic Dermatology: Pretraining with Texture Map-based Weak Supervision, by Junho Moon et al.
-
Summary of Efficient Exploration in Deep Reinforcement Learning: a Novel Bayesian Actor-critic Algorithm, by Nikolai Rozanov
-
Summary of Personalizing Reinforcement Learning From Human Feedback with Variational Preference Learning, by Sriyash Poddar et al.
-
Summary of Federated Frank-wolfe Algorithm, by Ali Dadras et al.
-
Summary of Masala: Model-agnostic Surrogate Explanations by Locality Adaptation, By Saif Anwar et al.
-
Summary of Tango: Clustering with Typicality-aware Nonlocal Mode-seeking and Graph-cut Optimization, by Haowen Ma et al.
-
Summary of Perturb-and-compare Approach For Detecting Out-of-distribution Samples in Constrained Access Environments, by Heeyoung Lee et al.
-
Summary of Molecular Graph Representation Learning Integrating Large Language Models with Domain-specific Small Models, by Tianyu Zhang and Yuxiang Ren and Chengbin Hou and Hairong Lv and Xuegong Zhang
-
Summary of Learning Brave Assumption-based Argumentation Frameworks Via Asp, by Emanuele De Angelis (1) et al.
-
Summary of Robust Spectral Clustering with Rank Statistics, by Joshua Cape and Xianshi Yu and Jonquil Z. Liao
-
Summary of In-context Learning with Representations: Contextual Generalization Of Trained Transformers, by Tong Yang et al.
-
Summary of Multilingual Needle in a Haystack: Investigating Long-context Behavior Of Multilingual Large Language Models, by Amey Hengle et al.
-
Summary of Transformers to Ssms: Distilling Quadratic Knowledge to Subquadratic Models, by Aviv Bick et al.
-
Summary of A Population-to-individual Tuning Framework For Adapting Pretrained Lm to On-device User Intent Prediction, by Jiahui Gong et al.
-
Summary of Symplectic Neural Networks Based on Dynamical Systems, by Benjamin K Tapley
-
Summary of Machine Learning with Physics Knowledge For Prediction: a Survey, by Joe Watson et al.
-
Summary of Mitigating the Stability-plasticity Dilemma in Adaptive Train Scheduling with Curriculum-driven Continual Dqn Expansion, by Achref Jaziri et al.
-
Summary of Shortcircuit: Alphazero-driven Circuit Design, by Dimitrios Tsaras et al.
-
Summary of 3d-aware Instance Segmentation and Tracking in Egocentric Videos, by Yash Bhalgat et al.
-
Summary of Maple: Enhancing Review Generation with Multi-aspect Prompt Learning in Explainable Recommendation, by Ching-wen Yang et al.
-
Summary of New Spectral Imaging Biomarkers For Sepsis and Mortality in Intensive Care, by Silvia Seidlitz et al.
-
Summary of Gino-q: Learning An Asymptotically Optimal Index Policy For Restless Multi-armed Bandits, by Gongpu Chen et al.
-
Summary of Differential Private Stochastic Optimization with Heavy-tailed Data: Towards Optimal Rates, by Puning Zhao et al.
-
Summary of Performance Law Of Large Language Models, by Chuhan Wu et al.
-
Summary of Instruction-based Molecular Graph Generation with Unified Text-graph Diffusion Model, by Yuran Xiang et al.
-
Summary of Active Learning For Identifying Disaster-related Tweets: a Comparison with Keyword Filtering and Generic Fine-tuning, by David Hanny et al.
-
Summary of Data Augmentation Of Contrastive Learning Is Estimating Positive-incentive Noise, by Hongyuan Zhang et al.
-
Summary of Weakly Supervised Pretraining and Multi-annotator Supervised Finetuning For Facial Wrinkle Detection, by Ik Jun Moon et al.
-
Summary of Adaresnet: Enhancing Residual Networks with Dynamic Weight Adjustment For Improved Feature Integration, by Hong Su
-
Summary of Mask in the Mirror: Implicit Sparsification, by Tom Jacobs and Rebekka Burkholz
-
Summary of Unsupervised Machine Learning Hybrid Approach Integrating Linear Programming in Loss Function: a Robust Optimization Technique, by Andrew Kiruluta et al.
-
Summary of The Exploration-exploitation Dilemma Revisited: An Entropy Perspective, by Renye Yan et al.
-
Summary of Modegpt: Modular Decomposition For Large Language Model Compression, by Chi-heng Lin et al.
-
Summary of On the Necessity Of World Knowledge For Mitigating Missing Labels in Extreme Classification, by Jatin Prakash et al.
-
Summary of Attention Is a Smoothed Cubic Spline, by Zehua Lai et al.
-
Summary of Regularization For Adversarial Robust Learning, by Jie Wang and Rui Gao and Yao Xie
-
Summary of Contextual Bandits For Unbounded Context Distributions, by Puning Zhao et al.
-
Summary of Community-centric Graph Unlearning, by Yi Li et al.
-
Summary of Lightweather: Harnessing Absolute Positional Encoding to Efficient and Scalable Global Weather Forecasting, by Yisong Fu et al.
-
Summary of Confirmation Bias in Gaussian Mixture Models, by Amnon Balanov et al.
-
Summary of Towards Few-shot Learning in the Open World: a Review and Beyond, by Hui Xue et al.
-
Summary of Stransformer: a Modular Approach For Extracting Inter-sequential and Temporal Information For Time-series Forecasting, by Jiaheng Yin et al.
-
Summary of Strategic Demonstration Selection For Improved Fairness in Llm In-context Learning, by Jingyu Hu et al.
-
Summary of Baby Bear: Seeking a Just Right Rating Scale For Scalar Annotations, by Xu Han et al.
-
Summary of Sequential Federated Learning in Hierarchical Architecture on Non-iid Datasets, by Xingrun Yan et al.
-
Summary of Faster Adaptive Decentralized Learning Algorithms, by Feihu Huang and Jianyu Zhao
-
Summary of Structure-enhanced Contrastive Learning For Graph Clustering, by Xunlian Wu et al.
-
Summary of Altbi: Constructing Improved Outlier Detection Models Via Optimization Of Inlier-memorization Effect, by Seoyoung Cho et al.
-
Summary of Enhance Modality Robustness in Text-centric Multimodal Alignment with Adversarial Prompting, by Yun-da Tsai et al.
-
Summary of Unsupervised Composable Representations For Audio, by Giovanni Bindi et al.
-
Summary of Parameterized Physics-informed Neural Networks For Parameterized Pdes, by Woojin Cho et al.
-
Summary of Attention Is Not What You Need: Revisiting Multi-instance Learning For Whole Slide Image Classification, by Xin Liu et al.
-
Summary of Reparameterized Multi-resolution Convolutions For Long Sequence Modelling, by Harry Jake Cunningham et al.
-
Summary of Advances in Multiple Instance Learning For Whole Slide Image Analysis: Techniques, Challenges, and Future Directions, by Jun Wang et al.
-
Summary of Advancements in Molecular Property Prediction: a Survey Of Single and Multimodal Approaches, by Tanya Liyaqat et al.
-
Summary of Mitigating Noise Detriment in Differentially Private Federated Learning with Model Pre-training, by Huitong Jin et al.
-
Summary of Leveraging Invariant Principle For Heterophilic Graph Structure Distribution Shifts, by Jinluan Yang et al.
-
Summary of Directed Exploration in Reinforcement Learning From Linear Temporal Logic, by Marco Bagatella et al.
-
Summary of Ancestral Reinforcement Learning: Unifying Zeroth-order Optimization and Genetic Algorithms For Reinforcement Learning, by So Nakashima and Tetsuya J. Kobayashi
-
Summary of Out-of-distribution Generalization Via Composition: a Lens Through Induction Heads in Transformers, by Jiajun Song et al.
-
Summary of A Unified Framework For Interpretable Transformers Using Pdes and Information Theory, by Yukun Zhang
-
Summary of Fine-gained Air Quality Inference Based on Low-quality Sensing Data Using Self-supervised Learning, by Meng Xu et al.
-
Summary of Deep Limit Model-free Prediction in Regression, by Kejin Wu and Dimitris N. Politis
-
Summary of Sample-optimal Large-scale Optimal Subset Selection, by Zaile Li et al.
-
Summary of Byzantine-resilient Federated Learning Employing Normalized Gradients on Non-iid Datasets, by Shiyuan Zuo et al.
-
Summary of Seamless Integration: Sampling Strategies in Federated Learning Systems, by Tatjana Legler et al.
-
Summary of Say My Name: a Model’s Bias Discovery Framework, by Massimiliano Ciranni et al.
-
Summary of Addressing Heterogeneity in Federated Learning: Challenges and Solutions For a Shared Production Environment, by Tatjana Legler et al.
-
Summary of A Markov Random Field Multi-modal Variational Autoencoder, by Fouad Oubari et al.
-
Summary of Convolutional Conditional Neural Processes, by Wessel P. Bruinsma
-
Summary of Padetbench: Towards Benchmarking Physical Attacks Against Object Detection, by Jiawei Lian et al.
-
Summary of A Deep Neural Network Framework For Solving Forward and Inverse Problems in Delay Differential Equations, by Housen Wang et al.
-
Summary of Drl-based Resource Allocation For Motion Blur Resistant Federated Self-supervised Learning in Iov, by Xueying Gu et al.
-
Summary of Scalable and Certifiable Graph Unlearning: Overcoming the Approximation Error Barrier, by Lu Yi et al.
-
Summary of On the Improvement Of Generalization and Stability Of Forward-only Learning Via Neural Polarization, by Erik B. Terres-escudero et al.
-
Summary of Graph Classification with Gnns: Optimisation, Representation and Inductive Bias, by P. Krishna Kumar a and Harish G. Ramaswamy
-
Summary of Premap: a Unifying Preimage Approximation Framework For Neural Networks, by Xiyue Zhang et al.
-
Summary of A Benchmark Time Series Dataset For Semiconductor Fabrication Manufacturing Constructed Using Component-based Discrete-event Simulation Models, by Vamsi Krishna Pendyala et al.
-
Summary of Narrowing the Focus: Learned Optimizers For Pretrained Models, by Gus Kristiansen et al.
-
Summary of Learning Fair Invariant Representations Under Covariate and Correlation Shifts Simultaneously, by Dong Li et al.
-
Summary of Predicting Travel Demand Of a Bike Sharing System Using Graph Convolutional Neural Networks, by Ali Behroozi and Ali Edrisi
-
Summary of A Probabilistic Framework For Adapting to Changing and Recurring Concepts in Data Streams, by Ben Halstead et al.
-
Summary of Threshold Filtering Packing For Supervised Fine-tuning: Training Related Samples Within Packs, by Jiancheng Dong et al.
-
Summary of Improvement Of Bayesian Pinn Training Convergence in Solving Multi-scale Pdes with Noise, by Yilong Hou et al.
-
Summary of E-cgl: An Efficient Continual Graph Learner, by Jianhao Guo et al.
-
Summary of Mutual Information Multinomial Estimation, by Yanzhi Chen et al.
-
Summary of Detecting the Undetectable: Combining Kolmogorov-arnold Networks and Mlp For Ai-generated Image Detection, by Taharim Rahman Anon et al.