Paper List

We recommend you use the search box as this list is very long.

Summary of Measuring Memorization in Language Models Via Probabilistic Extraction, by Jamie Hayes et al.
Summary of Trade: Transfer Of Distributions Between External Conditions with Normalizing Flows, by Stefan Wahl et al.
Summary of Dmt-hi: Moe-based Hyperbolic Interpretable Deep Manifold Transformation For Unspervised Dimensionality Reduction, by Zelin Zang et al.
Summary of Enhancing Exchange Rate Forecasting with Explainable Deep Learning Models, by Shuchen Meng et al.
Summary of Shap Zero Explains Genomic Models with Near-zero Marginal Cost For Future Queried Sequences, by Darin Tsui and Aryan Musharaf and Yigit Efe Erginbas and Justin Singh Kang and Amirali Aghazadeh
Summary of Chestnut: a Qos Dataset For Mobile Edge Environments, by Guobing Zou et al.
Summary of Spatioformer: a Geo-encoded Transformer For Large-scale Plant Species Richness Prediction, by Yiqing Guo et al.
Summary of A Survey Of Deep Graph Learning Under Distribution Shifts: From Graph Out-of-distribution Generalization to Adaptation, by Kexin Zhang et al.
Summary of Ripple: Accelerating Llm Inference on Smartphones with Correlation-aware Neuron Management, by Tuowei Wang et al.
Summary of Coordinated Reply Attacks in Influence Operations: Characterization and Detection, by Manita Pote et al.
Summary of Applying Sparse Autoencoders to Unlearn Knowledge in Language Models, by Eoin Farrell et al.
Summary of Golden Ratio-based Sufficient Dimension Reduction, by Wenjing Yang and Yuhong Yang
Summary of Flow Generator Matching, by Zemin Huang and Zhengyang Geng and Weijian Luo and Guo-jun Qi
Summary of A Stock Price Prediction Approach Based on Time Series Decomposition and Multi-scale Cnn Using Ohlct Images, by Zhiyuan Pei et al.
Summary of Coat: Compressing Optimizer States and Activation For Memory-efficient Fp8 Training, by Haocheng Xi et al.
Summary of A Prescriptive Theory For Brain-like Inference, by Hadi Vafaii et al.
Summary of Two Are Better Than One: Context Window Extension with Multi-grained Self-injection, by Wei Han et al.
Summary of Simpler Diffusion (sid2): 1.5 Fid on Imagenet512 with Pixel-space Diffusion, by Emiel Hoogeboom et al.
Summary of Interpreting Neural Networks Through Mahalanobis Distance, by Alan Oursland
Summary of Febim: Efficient and Compact Bayesian Inference Engine Empowered with Ferroelectric In-memory Computing, by Chao Li et al.
Summary of Capsule Endoscopy Multi-classification Via Gated Attention and Wavelet Transformations, by Lakshmi Srinivas Panchananam et al.
Summary of Bitpipe: Bidirectional Interleaved Pipeline Parallelism For Accelerating Large Models Training, by Houming Wu et al.
Summary of Notes on the Mathematical Structure Of Gpt Llm Architectures, by Spencer Becker-kahn
Summary of Visual Text Matters: Improving Text-kvqa with Visual Text Entity Knowledge-aware Large Multimodal Assistant, by Abhirama Subramanyam Penamakuri et al.
Summary of Initialization Matters: on the Benign Overfitting Of Two-layer Relu Cnn with Fully Trainable Layers, by Shuning Shang et al.
Summary of Structured Diffusion Models with Mixture Of Gaussians As Prior Distribution, by Nanshan Jia et al.
Summary of Learning Coupled Subspaces For Multi-condition Spike Data, by Yididiya Y. Nadew et al.
Summary of Adversarial Attacks on Large Language Models Using Regularized Relaxation, by Samuel Jacob Chacko et al.
Summary of Indication Finding: a Novel Use Case For Representation Learning, by Maren Eckhoff et al.
Summary of Perturbation-based Graph Active Learning For Weakly-supervised Belief Representation Learning, by Dachun Sun et al.
Summary of No Argument Left Behind: Overlapping Chunks For Faster Processing Of Arbitrarily Long Legal Texts, by Israel Fama et al.
Summary of Enriching Gnns with Text Contextual Representations For Detecting Disinformation Campaigns on Social Media, by Bruno Croso Cunha Da Silva et al.
Summary of Team: Topological Evolution-aware Framework For Traffic Forecasting–extended Version, by Duc Kieu et al.
Summary of Map: Multi-human-value Alignment Palette, by Xinran Wang et al.
Summary of Inference Time Llm Alignment in Single and Multidomain Preference Spectrum, by Sadat Shahriar et al.
Summary of Binary Classification: Is Boosting Stronger Than Bagging?, by Dimitris Bertsimas and Vasiliki Stoumpou
Summary of Predicting Liquidity Coverage Ratio with Gated Recurrent Units: a Deep Learning Model For Risk Management, by Zhen Xu et al.
Summary of Equitable Federated Learning with Activation Clustering, by Antesh Upadhyay and Abolfazl Hashemi
Summary of No Free Lunch: Fundamental Limits Of Learning Non-hallucinating Generative Models, by Changlong Wu et al.
Summary of Peptide-gpt: Generative Design Of Peptides Using Generative Pre-trained Transformers and Bio-informatic Supervision, by Aayush Shah and Chakradhar Guntuboina and Amir Barati Farimani
Summary of Can Stories Help Llms Reason? Curating Information Space Through Narrative, by Vahid Sadiri Javadi et al.
Summary of Hierarchical Mixture Of Experts: Generalizable Learning For High-level Synthesis, by Weikai Li et al.
Summary of Humanizing the Machine: Proxy Attacks to Mislead Llm Detectors, by Tianchun Wang et al.
Summary of Dual Space Training For Gans: a Pathway to Efficient and Creative Generative Models, by Beka Modrekiladze
Summary of Large Language Models For Financial Aid in Financial Time-series Forecasting, by Md Khairul Islam et al.
Summary of Mixture Of Parrots: Experts Improve Memorization More Than Reasoning, by Samy Jelassi et al.
Summary of Less Discriminatory Alternative and Interpretable Xgboost Framework For Binary Classification, by Andrew Pangia et al.
Summary of Newton Losses: Using Curvature Information For Learning with Differentiable Algorithms, by Felix Petersen et al.
Summary of An Investigation on Machine Learning Predictive Accuracy Improvement and Uncertainty Reduction Using Vae-based Data Augmentation, by Farah Alsafadi et al.
Summary of Target Strangeness: a Novel Conformal Prediction Difficulty Estimator, by Alexis Bose et al.
Summary of Fastsurvival: Hidden Computational Blessings in Training Cox Proportional Hazards Models, by Jiachang Liu et al.
Summary of Provable Tempered Overfitting Of Minimal Nets and Typical Nets, by Itamar Harel et al.
Summary of Inherently Interpretable Tree Ensemble Learning, by Zebin Yang et al.
Summary of Tesseraq: Ultra Low-bit Llm Post-training Quantization with Block Reconstruction, by Yuhang Li et al.
Summary of Conditional Diffusions For Amortized Neural Posterior Estimation, by Tianyu Chen et al.
Summary of Lanfl: Differentially Private Federated Learning with Large Language Models Using Synthetic Samples, by Huiyu Wu et al.
Summary of Bio2token: All-atom Tokenization Of Any Biomolecular Structure with Mamba, by Andrew Liu et al.
Summary of Llm Tree Search, by Dylan Wilson
Summary of Read-me: Refactorizing Llms As Router-decoupled Mixture Of Experts with System Co-design, by Ruisi Cai et al.
Summary of Research on Key Technologies For Cross-cloud Federated Training Of Large Language Models, by Haowei Yang et al.
Summary of A Spectral Method For Multi-view Subspace Learning Using the Product Of Projections, by Renat Sergazinov et al.
Summary of Maximum a Posteriori Inference For Factor Graphs Via Benders’ Decomposition, by Harsh Vardhan Dubey et al.
Summary of Context-aware Trajectory Anomaly Detection, by Haoji Hu et al.
Summary of Diff-instruct++: Training One-step Text-to-image Generator Model to Align with Human Preferences, by Weijian Luo
Summary of Using Parametric Pinns For Predicting Internal and External Turbulent Flows, by Shinjan Ghosh et al.
Summary of Missnodag: Differentiable Cyclic Causal Graph Learning From Incomplete Data, by Muralikrishnna G. Sethuraman et al.
Summary of A Random Matrix Theory Perspective on the Spectrum Of Learned Features and Asymptotic Generalization Capabilities, by Yatin Dandi et al.
Summary of Adjusted Overfitting Regression, by Dylan Wilson
Summary of Dynamic Vocabulary Pruning in Early-exit Llms, by Jort Vincenti et al.
Summary of Learning Structured Compressed Sensing with Automatic Resource Allocation, by Han Wang et al.
Summary of Stable Consistency Tuning: Understanding and Improving Consistency Models, by Fu-yun Wang et al.
Summary of On the Crucial Role Of Initialization For Matrix Factorization, by Bingcong Li et al.
Summary of Context Is Key: a Benchmark For Forecasting with Essential Textual Information, by Andrew Robert Williams et al.
Summary of Unbounded: a Generative Infinite Game Of Character Life Simulation, by Jialu Li et al.
Summary of Ferret-ui 2: Mastering Universal User Interface Understanding Across Platforms, by Zhangheng Li et al.
Summary of Camel-bench: a Comprehensive Arabic Lmm Benchmark, by Sara Ghaboura et al.
Summary of Deep Insights Into Cognitive Decline: a Survey Of Leveraging Non-intrusive Modalities with Deep Learning Techniques, by David Ortiz-perez et al.
Summary of Pixelgaussian: Generalizable 3d Gaussian Reconstruction From Arbitrary Views, by Xin Fei et al.
Summary of Vehiclesdf: a 3d Generative Model For Constrained Engineering Design Via Surrogate Modeling, by Hayata Morita et al.
Summary of Deterministic Fokker-planck Transport — with Applications to Sampling, Variational Inference, Kernel Mean Embeddings & Sequential Monte Carlo, by Ilja Klebanov
Summary of Make Llms Better Zero-shot Reasoners: Structure-orientated Autonomous Reasoning, by Pengfei He et al.
Summary of Whither Bias Goes, I Will Go: An Integrative, Systematic Review Of Algorithmic Bias Mitigation, by Louis Hickman et al.
Summary of Heterogeneous Random Forest, by Ye-eun Kim et al.
Summary of Hierarchical Multimodal Llms with Semantic Space Alignment For Enhanced Time Series Classification, by Xiaoyu Tao et al.
Summary of Baton: Enhancing Batch-wise Inference Efficiency For Large Language Models Via Dynamic Re-batching, by Peizhuang Cong et al.
Summary of Exploiting Interpretable Capabilities with Concept-enhanced Diffusion and Prototype Networks, by Alba Carballo-castro et al.
Summary of Retrieval-augmented Diffusion Models For Time Series Forecasting, by Jingwei Liu et al.
Summary of Does Differential Privacy Impact Bias in Pretrained Nlp Models?, by Md. Khairul Islam et al.
Summary of Citywide Electric Vehicle Charging Demand Prediction Approach Considering Urban Region and Dynamic Influences, by Haoxuan Kuang et al.
Summary of A Little Help Goes a Long Way: Efficient Llm Training by Leveraging Small Lms, By Ankit Singh Rawat et al.
Summary of Denoising Diffusion Probabilistic Models Are Optimally Adaptive to Unknown Low Dimensionality, by Zhihan Huang et al.
Summary of Warp-lca: Efficient Convolutional Sparse Coding with Locally Competitive Algorithm, by Geoffrey Kasenbacher et al.
Summary of Fast Constrained Sampling in Pre-trained Diffusion Models, by Alexandros Graikos et al.
Summary of A Combinatorial Approach to Neural Emergent Communication, by Zheyuan Zhang
Summary of From Imitation to Introspection: Probing Self-consciousness in Language Models, by Sirui Chen et al.
Summary of High-dimensional Analysis Of Knowledge Distillation: Weak-to-strong Generalization and Scaling Laws, by M. Emrullah Ildiz et al.
Summary of Learning to Explore with Lagrangians For Bandits Under Unknown Linear Constraints, by Udvas Das and Debabrota Basu
Summary of From Efficiency to Equity: Measuring Fairness in Preference Learning, by Shreeyash Gowaikar et al.
Summary of Probabilistic Language-image Pre-training, by Sanghyuk Chun and Wonjae Kim and Song Park and Sangdoo Yun
Summary of Fedspd: a Soft-clustering Approach For Personalized Decentralized Federated Learning, by I-cheng Lin et al.