Paper List
We recommend you use the search box as this list is very long.
-
Summary of Beyond Position: the Emergence Of Wavelet-like Properties in Transformers, by Valeria Ruscio et al.
-
Summary of Pod-attention: Unlocking Full Prefill-decode Overlap For Faster Llm Inference, by Aditya K Kamath et al.
-
Summary of Training Free Guided Flow Matching with Optimal Control, by Luran Wang et al.
-
Summary of Petah: Parameter Efficient Task Adaptation For Hybrid Transformers in a Resource-limited Context, by Maximilian Augustin et al.
-
Summary of Beware Of Calibration Data For Pruning Large Language Models, by Yixin Ji et al.
-
Summary of Continual Learning on a Data Diet, by Elif Ceren Gok Yildirim et al.
-
Summary of Visage: Video Synthesis Using Action Graphs For Surgery, by Yousef Yeganeh et al.
-
Summary of Topology Meets Machine Learning: An Introduction Using the Euler Characteristic Transform, by Bastian Rieck
-
Summary of Learning Versatile Skills with Curriculum Masking, by Yao Tang et al.
-
Summary of Anomaly Resilient Temporal Qos Prediction Using Hypergraph Convoluted Transformer Network, by Suraj Kumar et al.
-
Summary of Escaping the Forest: Sparse Interpretable Neural Networks For Tabular Data, by Salvatore Raieli et al.
-
Summary of Faster Language Models with Better Multi-token Prediction Using Tensor Decomposition, by Artem Basharin et al.
-
Summary of Enhancing Federated Learning Convergence with Dynamic Data Queue and Data Entropy-driven Participant Selection, by Charuka Herath et al.
-
Summary of Small Singular Values Matter: a Random Matrix Analysis Of Transformer Models, by Max Staats et al.
-
Summary of A Comprehensive Analysis on the Learning Curve in Kernel Ridge Regression, by Tin Sum Cheng and Aurelien Lucchi and Anastasis Kratsios and David Belius
-
Summary of Att2cpc: Attention-guided Lossy Attribute Compression Of Point Clouds, by Kai Liu et al.
-
Summary of Is the Gpu Half-empty or Half-full? Practical Scheduling Techniques For Llms, by Ferdi Kossmann et al.
-
Summary of Optimal Streaming Algorithms For Multi-armed Bandits, by Tianyuan Jin et al.
-
Summary of The Probabilistic Tsetlin Machine: a Novel Approach to Uncertainty Quantification, by K. Darshana Abeyrathna et al.
-
Summary of Population Stratification For Prediction Of Mortality in Post-aki Patients, by Flavio S. Correa Da Silva et al.
-
Summary of Relaxed Equivariance Via Multitask Learning, by Ahmed A. Elhag et al.
-
Summary of Identifiable Representation and Model Learning For Latent Dynamic Systems, by Congxi Zhang et al.
-
Summary of Adarankgrad: Adaptive Gradient-rank and Moments For Memory-efficient Llms Training and Fine-tuning, by Yehonathan Refael et al.
-
Summary of Wagle: Strategic Weight Attribution For Effective and Modular Unlearning in Large Language Models, by Jinghan Jia et al.
-
Summary of Time and Frequency Synergy For Source-free Time-series Domain Adaptations, by Muhammad Tanzil Furqon et al.
-
Summary of Mobilesafetybench: Evaluating Safety Of Autonomous Agents in Mobile Device Control, by Juyong Lee et al.
-
Summary of Gdda: Semantic Ood Detection on Graphs Under Covariate Shift Via Score-based Diffusion Models, by Zhixia He and Chen Zhao and Minglai Shao and Yujie Lin and Dong Li and Qin Tian
-
Summary of Primal-dual Spectral Representation For Off-policy Evaluation, by Yang Hu et al.
-
Summary of Multimodal Information Bottleneck For Deep Reinforcement Learning with Multiple Sensors, by Bang You et al.
-
Summary of Predicting 30-day Hospital Readmission in Medicare Patients: Insights From An Lstm Deep Learning Model, by Xintao Li et al.
-
Summary of Disengcd: a Meta Multigraph-assisted Disentangled Graph Learning Framework For Cognitive Diagnosis, by Shangshang Yang et al.
-
Summary of Differentially Private Learning Needs Better Model Initialization and Self-distillation, by Ivoline C. Ngong et al.
-
Summary of Securing Federated Learning Against Novel and Classic Backdoor Threats During Foundation Model Integration, by Xiaohuan Bi et al.
-
Summary of Adversarial Domain Adaptation For Metal Cutting Sound Detection: Leveraging Abundant Lab Data For Scarce Industry Data, by Mir Imtiaz Mostafiz (1) et al.
-
Summary of A Kernel Perspective on Distillation-based Collaborative Learning, by Sejun Park et al.
-
Summary of Self-supervised Graph Neural Networks For Enhanced Feature Extraction in Heterogeneous Information Networks, by Jianjun Wei et al.
-
Summary of Incremental Learning Of Affordances Using Markov Logic Networks, by George Potter et al.
-
Summary of Bonsai: Gradient-free Graph Distillation For Node Classification, by Mridul Gupta and Samyak Jain and Vansh Ramani and Hariprasad Kodamana and Sayan Ranu
-
Summary of Entity-based Reinforcement Learning For Autonomous Cyber Defence, by Isaac Symes Thompson et al.
-
Summary of Is Smoothness the Key to Robustness? a Comparison Of Attention and Convolution Models Using a Novel Metric, by Baiyuan Chen
-
Summary of Scalable Random Feature Latent Variable Models, by Ying Li et al.
-
Summary of Towards Active Participant Centric Vertical Federated Learning: Some Representations May Be All You Need, by Jon Irureta et al.
-
Summary of Optimizing Load Scheduling in Power Grids Using Reinforcement Learning and Markov Decision Processes, by Dongwen Luo
-
Summary of Enhancing Robustness and Efficiency Of Least Square Twin Svm Via Granular Computing, by M. Tanveer et al.
-
Summary of Hierarchical Multi-agent Reinforcement Learning For Cyber Network Defense, by Aditya Vikram Singh et al.
-
Summary of Amusd: Asynchronous Multi-device Speculative Decoding For Llm Acceleration, by Bradley Mcdanel
-
Summary of Episodic Future Thinking Mechanism For Multi-agent Reinforcement Learning, by Dongsu Lee et al.
-
Summary of Cooperative Multi-agent Constrained Stochastic Linear Bandits, by Amirhossein Afsharrad et al.
-
Summary of Packetlstm: Dynamic Lstm Framework For Streaming Data with Varying Feature Space, by Rohit Agarwal et al.
-
Summary of Geometric Graph Neural Network Modeling Of Human Interactions in Crowded Environments, by Sara Honarvar et al.
-
Summary of End-to-end Optimization and Learning Of Fair Court Schedules, by My H Dinh and James Kotary and Lauryn P. Gouldin and William Yeoh and Ferdinando Fioretto
-
Summary of Interpreting Affine Recurrence Learning in Gpt-style Transformers, by Samarth Bhargav et al.
-
Summary of Guaranteeing Conservation Laws with Projection in Physics-informed Neural Networks, by Anthony Baez et al.
-
Summary of Detecting Adversarial Examples, by Furkan Mumcu et al.
-
Summary of Data Obfuscation Through Latent Space Projection (lsp) For Privacy-preserving Ai Governance: Case Studies in Medical Diagnosis and Finance Fraud Detection, by Mahesh Vaijainthymala Krishnamoorthy
-
Summary of Scalable Implicit Graphon Learning, by Ali Azizpour et al.
-
Summary of Drop: Distributional and Regular Optimism and Pessimism For Reinforcement Learning, by Taisuke Kobayashi
-
Summary of Evolution with Opponent-learning Awareness, by Yann Bouteiller et al.
-
Summary of Do Robot Snakes Dream Like Electric Sheep? Investigating the Effects Of Architectural Inductive Biases on Hallucination, by Jerry Huang et al.
-
Summary of Which Client Is Reliable?: a Reliable and Personalized Prompt-based Federated Learning For Medical Image Question Answering, by He Zhu et al.
-
Summary of Mitigating Graph Covariate Shift Via Score-based Out-of-distribution Augmentation, by Bohan Wang et al.
-
Summary of Congestion Forecast For Trains with Railroad-graph-based Semi-supervised Learning Using Sparse Passenger Reports, by Soto Anno et al.
-
Summary of Can General-purpose Large Language Models Generalize to English-thai Machine Translation ?, by Jirat Chiaranaipanich et al.
-
Summary of Lino: Advancing Recursive Residual Decomposition Of Linear and Nonlinear Patterns For Robust Time Series Forecasting, by Guoqi Yu et al.
-
Summary of Lines: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging, by Ke Wang et al.
-
Summary of On Functional Dimension and Persistent Pseudodimension, by J. Elisenda Grigsby and Kathryn Lindsey
-
Summary of Hierarchical Upper Confidence Bounds For Constrained Online Learning, by Ali Baheri
-
Summary of Dhoroni: Exploring Bengali Climate Change and Environmental Views with a Multi-perspective News Dataset and Natural Language Processing, by Azmine Toushik Wasi et al.
-
Summary of Representation Shattering in Transformers: a Synthetic Study with Knowledge Editing, by Kento Nishi et al.
-
Summary of Fine-tuning Large Language Models to Appropriately Abstain with Semantic Entropy, by Benedict Aaron Tjandra et al.
-
Summary of Large Language Models Are In-context Preference Learners, by Chao Yu et al.
-
Summary of Sela: Tree-search Enhanced Llm Agents For Automated Machine Learning, by Yizhou Chi et al.
-
Summary of Lvsm: a Large View Synthesis Model with Minimal 3d Inductive Bias, by Haian Jin et al.
-
Summary of Inference with K-means, by Alfred K. Adzika and Prudence Djagba
-
Summary of Audio-driven Emotional 3d Talking-head Generation, by Wenqing Wang and Yun Fu
-
Summary of An Effective Theory Of Bias Amplification, by Arjun Subramonian et al.
-
Summary of Federated Brain Tumor Segmentation: An Extensive Benchmark, by Matthis Manthe (liris et al.
-
Summary of Automated Quality Control System For Canned Tuna Production Using Artificial Vision, by Sendey Vera et al.
-
Summary of Improving Insurance Catastrophic Data with Resampling and Gan Methods, by Norbert Dzadz and Maciej Romaniuk
-
Summary of Error Estimates Between Sgd with Momentum and Underdamped Langevin Diffusion, by Arnaud Guillin (lmbp) et al.
-
Summary of Literature Meets Data: a Synergistic Approach to Hypothesis Generation, by Haokun Liu et al.
-
Summary of Computing Optimal Regularizers For Online Linear Optimization, by Khashayar Gatmiry et al.
-
Summary of Xlstm-mixer: Multivariate Time Series Forecasting by Mixing Via Scalar Memories, By Maurice Kraus et al.
-
Summary of Bayes Without Underfitting: Fully Correlated Deep Learning Posteriors Via Alternating Projections, by Marco Miani et al.
-
Summary of Pyramid Vector Quantization For Llms, by Tycho F. A. Van Der Ouderaa et al.
-
Summary of Graph Neural Networks For Edge Signals: Orientation Equivariance and Invariance, by Dominik Fuchsgruber et al.
-
Summary of Isimed: a Framework For Self-supervised Learning Using Intrinsic Spatial Information in Medical Images, by Nabil Jabareen et al.
-
Summary of Sample-efficient Bayesian Optimisation Using Known Invariances, by Theodore Brown et al.
-
Summary of Learning Mathematical Rules with Large Language Models, by Antoine Gorceix et al.
-
Summary of Sample-efficient Geometry Reconstruction From Euclidean Distances Using Non-convex Optimization, by Ipsita Ghosh et al.
-
Summary of Lfme: a Simple Framework For Learning From Multiple Experts in Domain Generalization, by Liang Chen et al.
-
Summary of Deep Memory Search: a Metaheuristic Approach For Optimizing Heuristic Search, by Abdel-rahman Hedar and Alaa E. Abdel-hakim and Wael Deabes and Youseef Alotaibi and Kheir Eddine Bouazza
-
Summary of Optimizing Mixture-of-experts Inference Time Combining Model Deployment and Communication Scheduling, by Jialong Li et al.
-
Summary of Unstar: Unlearning with Self-taught Anti-sample Reasoning For Llms, by Yash Sinha et al.
-
Summary of Neuronal Competition Groups with Supervised Stdp For Spike-based Classification, by Gaspard Goupy and Pierre Tirilly and Ioan Marius Bilasco
-
Summary of Combinatorial Logistic Bandits, by Xutong Liu et al.
-
Summary of Optimal Design For Reward Modeling in Rlhf, by Antoine Scheid et al.
-
Summary of Human-llm Hybrid Text Answer Aggregation For Crowd Annotations, by Jiyi Li
-
Summary of Permutation Picture Of Graph Combinatorial Optimization Problems, by Yimeng Min