Paper List
We recommend you use the search box as this list is very long.
-
Summary of Mind the Gap Between Prototypes and Images in Cross-domain Finetuning, by Hongduan Tian et al.
-
Summary of Retrieval-reasoning Large Language Model-based Synthetic Clinical Trial Generation, by Zerui Xu et al.
-
Summary of Sac-glam: Improving Online Rl For Llm Agents with Soft Actor-critic and Hindsight Relabeling, by Loris Gaven et al.
-
Summary of Model Balancing Helps Low-data Training and Fine-tuning, by Zihang Liu et al.
-
Summary of Exotst: Exogenous-aware Temporal Sequence Transformer For Time Series Prediction, by Kshitij Tayal et al.
-
Summary of Daq: Density-aware Post-training Weight-only Quantization For Llms, by Yingsong Luo et al.
-
Summary of Potential-based Intrinsic Motivation: Preserving Optimality with Complex, Non-markovian Shaping Rewards, by Grant C. Forbes et al.
-
Summary of Abnormality Forecasting: Time Series Anomaly Prediction Via Future Context Modeling, by Sinong Zhao et al.
-
Summary of Divide-verify-refine: Can Llms Self-align with Complex Instructions?, by Xianren Zhang et al.
-
Summary of Global Censored Quantile Random Forest, by Siyu Zhou et al.
-
Summary of Enhancing Llm Agents For Code Generation with Possibility and Pass-rate Prioritized Experience Replay, by Yuyang Chen et al.
-
Summary of Off-dynamics Conditional Diffusion Planners, by Wen Zheng Terence Ng et al.
-
Summary of Causally-aware Unsupervised Feature Selection Learning, by Zongxin Shen et al.
-
Summary of Devil in the Tail: a Multi-modal Framework For Drug-drug Interaction Prediction in Long Tail Distinction, by Liangwei Nathan Zheng et al.
-
Summary of Transfer Learning on Multi-dimensional Data: a Novel Approach to Neural Network-based Surrogate Modeling, by Adrienne M. Propp et al.
-
Summary of Dual Action Policy For Robust Sim-to-real Reinforcement Learning, by Ng Wen Zheng Terence et al.
-
Summary of Irregularity-informed Time Series Analysis: Adaptive Modelling Of Spatial and Temporal Dynamics, by Liangwei Nathan Zheng et al.
-
Summary of Understanding Expert Structures on Minimax Parameter Estimation in Contaminated Mixture Of Experts, by Fanqi Yan et al.
-
Summary of Optimizing Yolov5s Object Detection Through Knowledge Distillation Algorithm, by Guanming Huang et al.
-
Summary of Catch: Channel-aware Multivariate Time Series Anomaly Detection Via Frequency Patching, by Xingjian Wu et al.
-
Summary of Game Theory Meets Statistical Mechanics in Deep Learning Design, by Djamel Bouchaffra et al.
-
Summary of Stress Assessment with Convolutional Neural Network Using Ppg Signals, by Yasin Hasanpoor et al.
-
Summary of Ai-aided Kalman Filters, by Nir Shlezinger et al.
-
Summary of Bias Similarity Across Large Language Models, by Hyejun Jeong et al.
-
Summary of Moe-pruner: Pruning Mixture-of-experts Large Language Model Using the Hints From Its Router, by Yanyue Xie et al.
-
Summary of Geometric Inductive Biases Of Deep Networks: the Role Of Data and Architecture, by Sajad Movahedi et al.
-
Summary of A Survey on Deep Tabular Learning, by Shriyank Somvanshi et al.
-
Summary of Deep Optimal Sensor Placement For Black Box Stochastic Simulations, by Paula Cordero-encinar et al.
-
Summary of Testing Causal Explanations: a Case Study For Understanding the Effect Of Interventions on Chronic Kidney Disease, by Panayiotis Petousis et al.
-
Summary of Legallens Shared Task 2024: Legal Violation Identification in Unstructured Text, by Ben Hagag et al.
-
Summary of Taking Off the Rose-tinted Glasses: a Critical Look at Adversarial Ml Through the Lens Of Evasion Attacks, by Kevin Eykholt and Farhan Ahmed and Pratik Vaishnavi and Amir Rahmati
-
Summary of Comparative Performance Of Collaborative Bandit Algorithms: Effect Of Sparsity and Exploration Intensity, by Eren Ozbay
-
Summary of Bridging Large Language Models and Graph Structure Learning Models For Robust Representation Learning, by Guangxin Su et al.
-
Summary of The Persian Rug: Solving Toy Models Of Superposition Using Large-scale Symmetries, by Aditya Cowsik et al.
-
Summary of Scaling Laws For Post Training Quantized Large Language Models, by Zifei Xu et al.
-
Summary of What Do Llms Need to Understand Graphs: a Survey Of Parametric Representation Of Graphs, by Dongqi Fu et al.
-
Summary of Preference Optimization with Multi-sample Comparisons, by Chaoqi Wang et al.
-
Summary of Fragnet: a Graph Neural Network For Molecular Property Prediction with Four Layers Of Interpretability, by Gihan Panapitiya et al.
-
Summary of When to Trust Your Data: Enhancing Dyna-style Model-based Reinforcement Learning with Data Filter, by Yansong Li et al.
-
Summary of Table-llm-specialist: Language Model Specialists For Tables Using Iterative Generator-validator Fine-tuning, by Junjie Xing et al.
-
Summary of Nssi-net: a Multi-concept Gan For Non-suicidal Self-injury Detection Using High-dimensional Eeg in a Semi-supervised Framework, by Zhen Liang et al.
-
Summary of Reclaiming the Source Of Programmatic Policies: Programmatic Versus Latent Spaces, by Tales H. Carvalho et al.
-
Summary of Expected Sliced Transport Plans, by Xinran Liu et al.
-
Summary of Improving Long-text Alignment For Text-to-image Diffusion Models, by Luping Liu et al.
-
Summary of Bayesian Experimental Design Via Contrastive Diffusions, by Jacopo Iollo et al.
-
Summary of Mitigating Suboptimality Of Deterministic Policy Gradients in Complex Q-functions, by Ayush Jain et al.
-
Summary of A Hitchhiker’s Guide to Scaling Law Estimation, by Leshem Choshen et al.
-
Summary of A Robust Multisource Remote Sensing Image Matching Method Utilizing Attention and Feature Enhancement Against Noise Interference, by Yuan Li et al.
-
Summary of Moh: Multi-head Attention As Mixture-of-head Attention, by Peng Jin et al.
-
Summary of Neural Metamorphosis, by Xingyi Yang et al.
-
Summary of Simulation-based Inference with Scattering Representations: Scattering Is All You Need, by Kiyam Lin et al.
-
Summary of Flare: Faithful Logic-aided Reasoning and Exploration, by Erik Arakelyan et al.
-
Summary of A Scalable Communication Protocol For Networks Of Large Language Models, by Samuele Marro et al.
-
Summary of Spatial-temporal Bearing Fault Detection Using Graph Attention Networks and Lstm, by Moirangthem Tiken Singh et al.
-
Summary of Nrformer: Nationwide Nuclear Radiation Forecasting with Spatio-temporal Transformer, by Tengfei Lyu et al.
-
Summary of A Complete Decomposition Of Kl Error Using Refined Information and Mode Interaction Selection, by James Enouen and Mahito Sugiyama
-
Summary of Integrating Artificial Intelligence Models and Synthetic Image Data For Enhanced Asset Inspection and Defect Identification, by Reddy Mandati et al.
-
Summary of The Fair Language Model Paradox, by Andrea Pinto and Tomer Galanti and Randall Balestriero
-
Summary of Ddil: Improved Diffusion Distillation with Imitation Learning, by Risheek Garrepalli et al.
-
Summary of Age-of-gradient Updates For Federated Learning Over Random Access Channels, by Yu Heng Wu et al.
-
Summary of Parametric Model Reduction Of Mean-field and Stochastic Systems Via Higher-order Action Matching, by Jules Berman et al.
-
Summary of Beyond Labels: a Self-supervised Framework with Masked Autoencoders and Random Cropping For Breast Cancer Subtype Classification, by Annalisa Chiocchetti et al.
-
Summary of Improve Value Estimation Of Q Function and Reshape Reward with Monte Carlo Tree Search, by Jiamian Li
-
Summary of Rs-moco: a Deep Learning-based Topology-preserving Image Registration Method For Cardiac T1 Mapping, by Chiyi Huang et al.
-
Summary of Understanding Likelihood Over-optimisation in Direct Alignment Algorithms, by Zhengyan Shi et al.
-
Summary of Llm-mixer: Multiscale Mixing in Llms For Time Series Forecasting, by Md Kowsher et al.
-
Summary of State-space Models Can Learn In-context by Gradient Descent, By Neeraj Mohan Sushma et al.
-
Summary of Blendrl: a Framework For Merging Symbolic and Neural Policy Learning, by Hikaru Shindo et al.
-
Summary of On the Potential Of Optimal Transport in Geospatial Data Science, by Nina Wiedemann et al.
-
Summary of Zero-shot Model-based Reinforcement Learning Using Large Language Models, by Abdelhakim Benechehab et al.
-
Summary of Dyspec: Faster Speculative Decoding with Dynamic Token Tree Structure, by Yunfan Xiong et al.
-
Summary of Losam: Local Search in Additive Noise Models with Mixed Mechanisms and General Noise For Global Causal Discovery, by Sujai Hiremath et al.
-
Summary of Analyzing (in)abilities Of Saes Via Formal Languages, by Abhinav Menon et al.
-
Summary of Ecgn: a Cluster-aware Approach to Graph Neural Networks For Imbalanced Classification, by Bishal Thapaliya et al.
-
Summary of Encoding Architecture Algebra, by Stephane Bersier et al.
-
Summary of Layer-wise Importance Matters: Less Memory For Better Performance in Parameter-efficient Fine-tuning Of Large Language Models, by Kai Yao et al.
-
Summary of On the Training Convergence Of Transformers For In-context Classification Of Gaussian Mixtures, by Wei Shen et al.
-
Summary of Mllm Can See? Dynamic Correction Decoding For Hallucination Mitigation, by Chenxi Wang et al.
-
Summary of Language Models Encode Numbers Using Digit Representations in Base 10, by Amit Arnold Levy et al.
-
Summary of Selection-p: Self-supervised Task-agnostic Prompt Compression For Faithfulness and Transferability, by Tsz Ting Chung et al.
-
Summary of Foundts: Comprehensive and Unified Benchmarking Of Foundation Models For Time Series Forecasting, by Zhe Li et al.
-
Summary of Adaptive Data Optimization: Dynamic Sample Selection with Scaling Laws, by Yiding Jiang et al.
-
Summary of Can Sparse Autoencoders Make Sense Of Latent Representations?, by Viktoria Schuster
-
Summary of How Transformers Get Rich: Approximation and Dynamics Analysis, by Mingze Wang et al.
-
Summary of Poisson-dirac Neural Networks For Modeling Coupled Dynamical Systems Across Domains, by Razmik Arman Khosrovian et al.
-
Summary of Advancing Training Efficiency Of Deep Spiking Neural Networks Through Rate-based Backpropagation, by Chengting Yu et al.
-
Summary of On Rank-dependent Generalisation Error Bounds For Transformers, by Lan V. Truong
-
Summary of Offline Model-based Optimization by Learning to Rank, By Rong-xi Tan et al.
-
Summary of Network Representation Learning For Biophysical Neural Network Analysis, by Youngmok Ha et al.
-
Summary of Efficiera Residual Networks: Hardware-friendly Fully Binary Weight with 2-bit Activation Model Achieves Practical Imagenet Accuracy, by Shuntaro Takahashi and Takuya Wakisaka and Hiroyuki Tokunaga
-
Summary of Loko: Low-rank Kalman Optimizer For Online Fine-tuning Of Large Models, by Hossein Abdi et al.
-
Summary of Data Quality Control in Federated Instruction-tuning Of Large Language Models, by Yaxin Du and Rui Ye and Fengting Yuchi and Wanru Zhao and Jingjing Qu and Yanfeng Wang and Siheng Chen
-
Summary of Why Go Full? Elevating Federated Learning Through Partial Network Updates, by Haolin Wang et al.
-
Summary of The Best Of Both Worlds: on the Dilemma Of Out-of-distribution Detection, by Qingyang Zhang et al.
-
Summary of Machine Learning Via Rough Mereology, by Lech T. Polkowski
-
Summary of On-the-fly Modulation For Balanced Multimodal Learning, by Yake Wei et al.
-
Summary of Baseflow Identification Via Explainable Ai with Kolmogorov-arnold Networks, by Chuyang Liu et al.
-
Summary of Paste: Improving the Efficiency Of Visual Anomaly Detection at the Edge, by Manuel Barusco et al.
-
Summary of Black-box Uncertainty Quantification Method For Llm-as-a-judge, by Nico Wagner et al.