Paper List
We recommend you use the search box as this list is very long.
-
Summary of A Hopfieldian View-based Interpretation For Chain-of-thought Reasoning, by Lijie Hu et al.
-
Summary of Self-supervised Time-series Anomaly Detection Using Learnable Data Augmentation, by Kukjin Choi et al.
-
Summary of Investigating Data Usage For Inductive Conformal Predictors, by Yizirui Fang and Anthony Bellotti
-
Summary of Sagdfn: a Scalable Adaptive Graph Diffusion Forecasting Network For Multivariate Time Series Forecasting, by Yue Jiang et al.
-
Summary of Soft Prompting For Unlearning in Large Language Models, by Karuna Bhaila et al.
-
Summary of Not All Prompts Are Made Equal: Prompt-based Pruning Of Text-to-image Diffusion Models, by Alireza Ganjdanesh et al.
-
Summary of Fawn: Floor-and-walls Normal Regularization For Direct Neural Tsdf Reconstruction, by Anna Sokolova et al.
-
Summary of Learning Molecular Representation in a Cell, by Gang Liu et al.
-
Summary of A Scalable and Effective Alternative to Graph Transformers, by Kaan Sancak et al.
-
Summary of Not Eliminate but Aggregate: Post-hoc Control Over Mixture-of-experts to Address Shortcut Shifts in Natural Language Understanding, by Ukyo Honda et al.
-
Summary of Entropic Regression Dmd (erdmd) Discovers Informative Sparse and Nonuniformly Time Delayed Models, by Christopher W. Curtis et al.
-
Summary of Stnagnn: Spatiotemporal Node Attention Graph Neural Network For Task-based Fmri Analysis, by Jiyao Wang et al.
-
Summary of Dtgb: a Comprehensive Benchmark For Dynamic Text-attributed Graphs, by Jiasheng Zhang et al.
-
Summary of Multi-dimensional Pruning: Joint Channel, Layer and Block Pruning with Latency Constraint, by Xinglong Sun et al.
-
Summary of Cuqds: Conformal Uncertainty Quantification Under Distribution Shift For Trajectory Prediction, by Huiqun Huang et al.
-
Summary of Is Poisoning a Real Threat to Llm Alignment? Maybe More So Than You Think, by Pankayaraj Pathmanathan et al.
-
Summary of Uncertainty Modeling For Fine-tuned Implicit Functions, by Anna Susmelj et al.
-
Summary of End-to-end Text-to-sql Generation Within An Analytics Insight Engine, by Karime Maamari et al.
-
Summary of Enhancing Text Classification Through Llm-driven Active Learning and Human Annotation, by Hamidreza Rouzegar et al.
-
Summary of Adding Conditional Control to Diffusion Models with Reinforcement Learning, by Yulai Zhao et al.
-
Summary of Efficient Sequential Decision Making with Large Language Models, by Dingyang Chen et al.
-
Summary of Deploying Scalable Traffic Prediction Models For Efficient Management in Real-world Large Transportation Networks During Hurricane Evacuations, by Qinhua Jiang et al.
-
Summary of Cot Flow: Learning Optimal-transport Image Sampling and Editing by Contrastive Pairs, By Xinrui Zu et al.
-
Summary of Graph Knowledge Distillation to Mixture Of Experts, by Pavel Rumiantsev and Mark Coates
-
Summary of Interpretable Modulated Differentiable Stft and Physics-informed Balanced Spectrum Metric For Freight Train Wheelset Bearing Cross-machine Transfer Fault Diagnosis Under Speed Fluctuations, by Chao He and Hongmei Shi and Ruixin Li and Jianbo Li and Zujun Yu
-
Summary of Rethinking Spatio-temporal Transformer For Traffic Prediction:multi-level Multi-view Augmented Learning Framework, by Jiaqi Lin and Qianqian Ren
-
Summary of Job-sdf: a Multi-granularity Dataset For Job Skill Demand Forecasting and Benchmarking, by Xi Chen et al.
-
Summary of Flexcare: Leveraging Cross-task Synergy For Flexible Multimodal Healthcare Prediction, by Muhao Xu et al.
-
Summary of From Crowdsourced Data to High-quality Benchmarks: Arena-hard and Benchbuilder Pipeline, by Tianle Li et al.
-
Summary of Long-time Asymptotics Of Noisy Svgd Outside the Population Limit, by Victor Priser (s2a et al.
-
Summary of Bridging Design Gaps: a Parametric Data Completion Approach with Graph Guided Diffusion Models, by Rui Zhou et al.
-
Summary of Transcoders Find Interpretable Llm Feature Circuits, by Jacob Dunefsky and Philippe Chlenski and Neel Nanda
-
Summary of Gaugllm: Improving Graph Contrastive Learning For Text-attributed Graphs with Large Language Models, by Yi Fang et al.
-
Summary of Dialogue Action Tokens: Steering Language Models in Goal-directed Dialogue with a Multi-turn Planner, by Kenneth Li et al.
-
Summary of Decomposed Evaluations Of Geographic Disparities in Text-to-image Models, by Abhishek Sureddy et al.
-
Summary of The Benefits and Risks Of Transductive Approaches For Ai Fairness, by Muhammed Razzak et al.
-
Summary of Prefixing Attention Sinks Can Mitigate Activation Outliers For Large Language Model Quantization, by Seungwoo Son et al.
-
Summary of Lilium: Ebay’s Large Language Models For E-commerce, by Christian Herold and Michael Kozielski and Leonid Ekimov and Pavel Petrushkov and Pierre-yves Vandenbussche and Shahram Khadivi
-
Summary of Sparsity-constraint Optimization Via Splicing Iteration, by Zezhi Wang et al.
-
Summary of Large Scale Transfer Learning For Tabular Data Via Language Modeling, by Josh Gardner et al.
-
Summary of Self-moe: Towards Compositional Large Language Models with Self-specialized Experts, by Junmo Kang et al.
-
Summary of Datacomp-lm: in Search Of the Next Generation Of Training Sets For Language Models, by Jeffrey Li et al.
-
Summary of Computationally Efficient Rl Under Linear Bellman Completeness For Deterministic Dynamics, by Runzhe Wu et al.
-
Summary of Stochastic Neural Network Symmetrisation in Markov Categories, by Rob Cornish
-
Summary of Iterative Length-regularized Direct Preference Optimization: a Case Study on Improving 7b Language Models to Gpt-4 Level, by Jie Liu et al.
-
Summary of Spectral Introspection Identifies Group Training Dynamics in Deep Neural Networks For Neuroimaging, by Bradley T. Baker et al.
-
Summary of Wpo: Enhancing Rlhf with Weighted Preference Optimization, by Wenxuan Zhou et al.
-
Summary of Learning Sum Of Diverse Features: Computational Hardness and Efficient Gradient-based Training For Ridge Combinations, by Kazusato Oko et al.
-
Summary of Mmdu: a Multi-turn Multi-image Dialog Understanding Benchmark and Instruction-tuning Dataset For Lvlms, by Ziyu Liu et al.
-
Summary of Mdpo: Conditional Preference Optimization For Multimodal Large Language Models, by Fei Wang et al.
-
Summary of The Earlybird Gets the Worm: Heuristically Accelerating Earlybird Convergence, by Adithya Vasudev
-
Summary of Matrix-free Jacobian Chaining, by Uwe Naumann
-
Summary of Applications Of Explainable Artificial Intelligence in Earth System Science, by Feini Huang et al.
-
Summary of Unraveling the Mechanics Of Learning-based Demonstration Selection For In-context Learning, by Hui Liu et al.
-
Summary of Financial Assets Dependency Prediction Utilizing Spatiotemporal Patterns, by Haoren Zhu et al.
-
Summary of Digirl: Training In-the-wild Device-control Agents with Autonomous Reinforcement Learning, by Hao Bai et al.
-
Summary of A Benchmark For Maximum Cut: Towards Standardization Of the Evaluation Of Learned Heuristics For Combinatorial Optimization, by Ankur Nath et al.
-
Summary of Towards Better Benchmark Datasets For Inductive Knowledge Graph Completion, by Harry Shomer et al.
-
Summary of Mixture-of-subspaces in Low-rank Adaptation, by Taiqiang Wu et al.
-
Summary of A Notion Of Complexity For Theory Of Mind Via Discrete World Models, by X. Angelo Huang et al.
-
Summary of Initial Investigation Of Kolmogorov-arnold Networks (kans) As Feature Extractors For Imu Based Human Activity Recognition, by Mengxi Liu et al.
-
Summary of The Role Of Inherent Bellman Error in Offline Reinforcement Learning with Linear Function Approximation, by Noah Golowich and Ankur Moitra
-
Summary of Score-fpinn: Fractional Score-based Physics-informed Neural Networks For High-dimensional Fokker-planck-levy Equations, by Zheyuan Hu et al.
-
Summary of Edge Classification on Graphs: New Directions in Topological Imbalance, by Xueqi Cheng et al.
-
Summary of Optimizing Instructions and Demonstrations For Multi-stage Language Model Programs, by Krista Opsahl-ong et al.
-
Summary of Unveiling Multiple Descents in Unsupervised Autoencoders, by Kobi Rahimi et al.
-
Summary of Nemotron-4 340b Technical Report, by Nvidia: Bo Adler et al.
-
Summary of Measuring Memorization in Rlhf For Code Completion, by Aneesh Pappu et al.
-
Summary of Scalable Expressiveness Through Preprocessed Graph Perturbations, by Danial Saber and Amirali Salehi-abari
-
Summary of Refusal in Language Models Is Mediated by a Single Direction, By Andy Arditi et al.
-
Summary of Zero-shot Generalization During Instruction Tuning: Insights From Similarity and Granularity, by Bingxiang He et al.
-
Summary of To Clip or Not to Clip: the Dynamics Of Sgd with Gradient Clipping in High-dimensions, by Noah Marshall et al.
-
Summary of Transcendence: Generative Models Can Outperform the Experts That Train Them, by Edwin Zhang et al.
-
Summary of A Semantic-aware Layer-freezing Approach to Computation-efficient Fine-tuning Of Language Models, by Jian Gu et al.
-
Summary of Joint Linked Component Analysis For Multiview Data, by Lin Xiao et al.
-
Summary of Optimal Transport-assisted Risk-sensitive Q-learning, by Zahra Shahrooei and Ali Baheri
-
Summary of Compact Proofs Of Model Performance Via Mechanistic Interpretability, by Jason Gross et al.
-
Summary of Split, Unlearn, Merge: Leveraging Data Attributes For More Effective Unlearning in Llms, by Swanand Ravindra Kadhe et al.
-
Summary of Cell Your Model: Contrastive Explanations For Large Language Models, by Ronny Luss et al.
-
Summary of Efficient Discovery Of Significant Patterns with Few-shot Resampling, by Leonardo Pellegrina and Fabio Vandin
-
Summary of Physics-constrained Learning For Pde Systems with Uncertainty Quantified Port-hamiltonian Models, by Kaiyuan Tan et al.
-
Summary of Just How Flexible Are Neural Networks in Practice?, by Ravid Shwartz-ziv and Micah Goldblum and Arpit Bansal and C. Bayan Bruss and Yann Lecun and Andrew Gordon Wilson
-
Summary of Constrained Reinforcement Learning with Average Reward Objective: Model-based and Model-free Algorithms, by Vaneet Aggarwal et al.
-
Summary of Active Clustering with Bandit Feedback, by Victor Thuot (mistea) et al.
-
Summary of Teleporter Theory: a General and Simple Approach For Modeling Cross-world Counterfactual Causality, by Jiangmeng Li et al.
-
Summary of Revisiting Spurious Correlation in Domain Generalization, by Bin Qin et al.
-
Summary of Analysing Zero-shot Temporal Relation Extraction on Clinical Notes Using Temporal Consistency, by Vasiliki Kougia et al.
-
Summary of Explainable Artificial Intelligence and Multicollinearity : a Mini Review Of Current Approaches, by Ahmed M Salih
-
Summary of Fullcert: Deterministic End-to-end Certification For Training and Inference Of Neural Networks, by Tobias Lorenz et al.
-
Summary of Do Parameters Reveal More Than Loss For Membership Inference?, by Anshuman Suri et al.
-
Summary of An Imitative Reinforcement Learning Framework For Autonomous Dogfight, by Siyuan Li et al.
-
Summary of On Gnn Explanability with Activation Rules, by Luca Veyrin-forrer et al.
-
Summary of Standardizing Structural Causal Models, by Weronika Ormaniec et al.
-
Summary of Words in Motion: Extracting Interpretable Control Vectors For Motion Transformers, by Omer Sahin Tas and Royden Wagner
-
Summary of Long Code Arena: a Set Of Benchmarks For Long-context Code Models, by Egor Bogomolov et al.