Paper List

We recommend you use the search box as this list is very long.

Summary of Mind the Gap Between Prototypes and Images in Cross-domain Finetuning, by Hongduan Tian et al.
Summary of Retrieval-reasoning Large Language Model-based Synthetic Clinical Trial Generation, by Zerui Xu et al.
Summary of Sac-glam: Improving Online Rl For Llm Agents with Soft Actor-critic and Hindsight Relabeling, by Loris Gaven et al.
Summary of Model Balancing Helps Low-data Training and Fine-tuning, by Zihang Liu et al.
Summary of Exotst: Exogenous-aware Temporal Sequence Transformer For Time Series Prediction, by Kshitij Tayal et al.
Summary of Daq: Density-aware Post-training Weight-only Quantization For Llms, by Yingsong Luo et al.
Summary of Potential-based Intrinsic Motivation: Preserving Optimality with Complex, Non-markovian Shaping Rewards, by Grant C. Forbes et al.
Summary of Abnormality Forecasting: Time Series Anomaly Prediction Via Future Context Modeling, by Sinong Zhao et al.
Summary of Divide-verify-refine: Can Llms Self-align with Complex Instructions?, by Xianren Zhang et al.
Summary of Global Censored Quantile Random Forest, by Siyu Zhou et al.
Summary of Enhancing Llm Agents For Code Generation with Possibility and Pass-rate Prioritized Experience Replay, by Yuyang Chen et al.
Summary of Off-dynamics Conditional Diffusion Planners, by Wen Zheng Terence Ng et al.
Summary of Causally-aware Unsupervised Feature Selection Learning, by Zongxin Shen et al.
Summary of Devil in the Tail: a Multi-modal Framework For Drug-drug Interaction Prediction in Long Tail Distinction, by Liangwei Nathan Zheng et al.
Summary of Transfer Learning on Multi-dimensional Data: a Novel Approach to Neural Network-based Surrogate Modeling, by Adrienne M. Propp et al.
Summary of Dual Action Policy For Robust Sim-to-real Reinforcement Learning, by Ng Wen Zheng Terence et al.
Summary of Irregularity-informed Time Series Analysis: Adaptive Modelling Of Spatial and Temporal Dynamics, by Liangwei Nathan Zheng et al.
Summary of Understanding Expert Structures on Minimax Parameter Estimation in Contaminated Mixture Of Experts, by Fanqi Yan et al.
Summary of Optimizing Yolov5s Object Detection Through Knowledge Distillation Algorithm, by Guanming Huang et al.
Summary of Catch: Channel-aware Multivariate Time Series Anomaly Detection Via Frequency Patching, by Xingjian Wu et al.
Summary of Game Theory Meets Statistical Mechanics in Deep Learning Design, by Djamel Bouchaffra et al.
Summary of Stress Assessment with Convolutional Neural Network Using Ppg Signals, by Yasin Hasanpoor et al.
Summary of Ai-aided Kalman Filters, by Nir Shlezinger et al.
Summary of Bias Similarity Across Large Language Models, by Hyejun Jeong et al.
Summary of Moe-pruner: Pruning Mixture-of-experts Large Language Model Using the Hints From Its Router, by Yanyue Xie et al.
Summary of Geometric Inductive Biases Of Deep Networks: the Role Of Data and Architecture, by Sajad Movahedi et al.
Summary of A Survey on Deep Tabular Learning, by Shriyank Somvanshi et al.
Summary of Deep Optimal Sensor Placement For Black Box Stochastic Simulations, by Paula Cordero-encinar et al.
Summary of Testing Causal Explanations: a Case Study For Understanding the Effect Of Interventions on Chronic Kidney Disease, by Panayiotis Petousis et al.
Summary of Legallens Shared Task 2024: Legal Violation Identification in Unstructured Text, by Ben Hagag et al.
Summary of Taking Off the Rose-tinted Glasses: a Critical Look at Adversarial Ml Through the Lens Of Evasion Attacks, by Kevin Eykholt and Farhan Ahmed and Pratik Vaishnavi and Amir Rahmati
Summary of Comparative Performance Of Collaborative Bandit Algorithms: Effect Of Sparsity and Exploration Intensity, by Eren Ozbay
Summary of Bridging Large Language Models and Graph Structure Learning Models For Robust Representation Learning, by Guangxin Su et al.
Summary of The Persian Rug: Solving Toy Models Of Superposition Using Large-scale Symmetries, by Aditya Cowsik et al.
Summary of Scaling Laws For Post Training Quantized Large Language Models, by Zifei Xu et al.
Summary of What Do Llms Need to Understand Graphs: a Survey Of Parametric Representation Of Graphs, by Dongqi Fu et al.
Summary of Preference Optimization with Multi-sample Comparisons, by Chaoqi Wang et al.
Summary of Fragnet: a Graph Neural Network For Molecular Property Prediction with Four Layers Of Interpretability, by Gihan Panapitiya et al.
Summary of When to Trust Your Data: Enhancing Dyna-style Model-based Reinforcement Learning with Data Filter, by Yansong Li et al.
Summary of Table-llm-specialist: Language Model Specialists For Tables Using Iterative Generator-validator Fine-tuning, by Junjie Xing et al.
Summary of Nssi-net: a Multi-concept Gan For Non-suicidal Self-injury Detection Using High-dimensional Eeg in a Semi-supervised Framework, by Zhen Liang et al.
Summary of Reclaiming the Source Of Programmatic Policies: Programmatic Versus Latent Spaces, by Tales H. Carvalho et al.
Summary of Expected Sliced Transport Plans, by Xinran Liu et al.
Summary of Improving Long-text Alignment For Text-to-image Diffusion Models, by Luping Liu et al.
Summary of Bayesian Experimental Design Via Contrastive Diffusions, by Jacopo Iollo et al.
Summary of Mitigating Suboptimality Of Deterministic Policy Gradients in Complex Q-functions, by Ayush Jain et al.
Summary of A Hitchhiker’s Guide to Scaling Law Estimation, by Leshem Choshen et al.
Summary of A Robust Multisource Remote Sensing Image Matching Method Utilizing Attention and Feature Enhancement Against Noise Interference, by Yuan Li et al.
Summary of Moh: Multi-head Attention As Mixture-of-head Attention, by Peng Jin et al.
Summary of Neural Metamorphosis, by Xingyi Yang et al.
Summary of Simulation-based Inference with Scattering Representations: Scattering Is All You Need, by Kiyam Lin et al.
Summary of Flare: Faithful Logic-aided Reasoning and Exploration, by Erik Arakelyan et al.
Summary of A Scalable Communication Protocol For Networks Of Large Language Models, by Samuele Marro et al.
Summary of Spatial-temporal Bearing Fault Detection Using Graph Attention Networks and Lstm, by Moirangthem Tiken Singh et al.
Summary of Nrformer: Nationwide Nuclear Radiation Forecasting with Spatio-temporal Transformer, by Tengfei Lyu et al.
Summary of A Complete Decomposition Of Kl Error Using Refined Information and Mode Interaction Selection, by James Enouen and Mahito Sugiyama
Summary of Integrating Artificial Intelligence Models and Synthetic Image Data For Enhanced Asset Inspection and Defect Identification, by Reddy Mandati et al.
Summary of The Fair Language Model Paradox, by Andrea Pinto and Tomer Galanti and Randall Balestriero
Summary of Ddil: Improved Diffusion Distillation with Imitation Learning, by Risheek Garrepalli et al.
Summary of Disp-llm: Dimension-independent Structural Pruning For Large Language Models, by Shangqian Gao and Chi-heng Lin and Ting Hua and Tang Zheng and Yilin Shen and Hongxia Jin and Yen-chang Hsu
Summary of Age-of-gradient Updates For Federated Learning Over Random Access Channels, by Yu Heng Wu et al.
Summary of Parametric Model Reduction Of Mean-field and Stochastic Systems Via Higher-order Action Matching, by Jules Berman et al.
Summary of Beyond Labels: a Self-supervised Framework with Masked Autoencoders and Random Cropping For Breast Cancer Subtype Classification, by Annalisa Chiocchetti et al.
Summary of Improve Value Estimation Of Q Function and Reshape Reward with Monte Carlo Tree Search, by Jiamian Li
Summary of Rs-moco: a Deep Learning-based Topology-preserving Image Registration Method For Cardiac T1 Mapping, by Chiyi Huang et al.
Summary of Understanding Likelihood Over-optimisation in Direct Alignment Algorithms, by Zhengyan Shi et al.
Summary of Llm-mixer: Multiscale Mixing in Llms For Time Series Forecasting, by Md Kowsher et al.
Summary of State-space Models Can Learn In-context by Gradient Descent, By Neeraj Mohan Sushma et al.
Summary of Blendrl: a Framework For Merging Symbolic and Neural Policy Learning, by Hikaru Shindo et al.
Summary of On the Potential Of Optimal Transport in Geospatial Data Science, by Nina Wiedemann et al.
Summary of Zero-shot Model-based Reinforcement Learning Using Large Language Models, by Abdelhakim Benechehab et al.
Summary of Dyspec: Faster Speculative Decoding with Dynamic Token Tree Structure, by Yunfan Xiong et al.
Summary of Losam: Local Search in Additive Noise Models with Mixed Mechanisms and General Noise For Global Causal Discovery, by Sujai Hiremath et al.
Summary of Analyzing (in)abilities Of Saes Via Formal Languages, by Abhinav Menon et al.
Summary of Ecgn: a Cluster-aware Approach to Graph Neural Networks For Imbalanced Classification, by Bishal Thapaliya et al.
Summary of Encoding Architecture Algebra, by Stephane Bersier et al.
Summary of Layer-wise Importance Matters: Less Memory For Better Performance in Parameter-efficient Fine-tuning Of Large Language Models, by Kai Yao et al.
Summary of On the Training Convergence Of Transformers For In-context Classification Of Gaussian Mixtures, by Wei Shen et al.
Summary of Mllm Can See? Dynamic Correction Decoding For Hallucination Mitigation, by Chenxi Wang et al.
Summary of Language Models Encode Numbers Using Digit Representations in Base 10, by Amit Arnold Levy et al.
Summary of Selection-p: Self-supervised Task-agnostic Prompt Compression For Faithfulness and Transferability, by Tsz Ting Chung et al.
Summary of Foundts: Comprehensive and Unified Benchmarking Of Foundation Models For Time Series Forecasting, by Zhe Li et al.
Summary of Adaptive Data Optimization: Dynamic Sample Selection with Scaling Laws, by Yiding Jiang et al.
Summary of Can Sparse Autoencoders Make Sense Of Latent Representations?, by Viktoria Schuster
Summary of How Transformers Get Rich: Approximation and Dynamics Analysis, by Mingze Wang et al.
Summary of Poisson-dirac Neural Networks For Modeling Coupled Dynamical Systems Across Domains, by Razmik Arman Khosrovian et al.
Summary of Advancing Training Efficiency Of Deep Spiking Neural Networks Through Rate-based Backpropagation, by Chengting Yu et al.
Summary of On Rank-dependent Generalisation Error Bounds For Transformers, by Lan V. Truong
Summary of Offline Model-based Optimization by Learning to Rank, By Rong-xi Tan et al.
Summary of Network Representation Learning For Biophysical Neural Network Analysis, by Youngmok Ha et al.
Summary of Efficiera Residual Networks: Hardware-friendly Fully Binary Weight with 2-bit Activation Model Achieves Practical Imagenet Accuracy, by Shuntaro Takahashi and Takuya Wakisaka and Hiroyuki Tokunaga
Summary of Loko: Low-rank Kalman Optimizer For Online Fine-tuning Of Large Models, by Hossein Abdi et al.
Summary of Data Quality Control in Federated Instruction-tuning Of Large Language Models, by Yaxin Du and Rui Ye and Fengting Yuchi and Wanru Zhao and Jingjing Qu and Yanfeng Wang and Siheng Chen
Summary of Why Go Full? Elevating Federated Learning Through Partial Network Updates, by Haolin Wang et al.
Summary of The Best Of Both Worlds: on the Dilemma Of Out-of-distribution Detection, by Qingyang Zhang et al.
Summary of Machine Learning Via Rough Mereology, by Lech T. Polkowski
Summary of On-the-fly Modulation For Balanced Multimodal Learning, by Yake Wei et al.
Summary of Baseflow Identification Via Explainable Ai with Kolmogorov-arnold Networks, by Chuyang Liu et al.
Summary of Paste: Improving the Efficiency Of Visual Anomaly Detection at the Edge, by Manuel Barusco et al.
Summary of Black-box Uncertainty Quantification Method For Llm-as-a-judge, by Nico Wagner et al.