Paper List
We recommend you use the search box as this list is very long.
-
Summary of Measuring Memorization in Language Models Via Probabilistic Extraction, by Jamie Hayes et al.
-
Summary of Trade: Transfer Of Distributions Between External Conditions with Normalizing Flows, by Stefan Wahl et al.
-
Summary of Enhancing Exchange Rate Forecasting with Explainable Deep Learning Models, by Shuchen Meng et al.
-
Summary of Shap Zero Explains Genomic Models with Near-zero Marginal Cost For Future Queried Sequences, by Darin Tsui and Aryan Musharaf and Yigit Efe Erginbas and Justin Singh Kang and Amirali Aghazadeh
-
Summary of Chestnut: a Qos Dataset For Mobile Edge Environments, by Guobing Zou et al.
-
Summary of Spatioformer: a Geo-encoded Transformer For Large-scale Plant Species Richness Prediction, by Yiqing Guo et al.
-
Summary of A Survey Of Deep Graph Learning Under Distribution Shifts: From Graph Out-of-distribution Generalization to Adaptation, by Kexin Zhang et al.
-
Summary of Ripple: Accelerating Llm Inference on Smartphones with Correlation-aware Neuron Management, by Tuowei Wang et al.
-
Summary of Coordinated Reply Attacks in Influence Operations: Characterization and Detection, by Manita Pote et al.
-
Summary of Applying Sparse Autoencoders to Unlearn Knowledge in Language Models, by Eoin Farrell et al.
-
Summary of Golden Ratio-based Sufficient Dimension Reduction, by Wenjing Yang and Yuhong Yang
-
Summary of Flow Generator Matching, by Zemin Huang and Zhengyang Geng and Weijian Luo and Guo-jun Qi
-
Summary of A Stock Price Prediction Approach Based on Time Series Decomposition and Multi-scale Cnn Using Ohlct Images, by Zhiyuan Pei et al.
-
Summary of Coat: Compressing Optimizer States and Activation For Memory-efficient Fp8 Training, by Haocheng Xi et al.
-
Summary of A Prescriptive Theory For Brain-like Inference, by Hadi Vafaii et al.
-
Summary of Two Are Better Than One: Context Window Extension with Multi-grained Self-injection, by Wei Han et al.
-
Summary of Simpler Diffusion (sid2): 1.5 Fid on Imagenet512 with Pixel-space Diffusion, by Emiel Hoogeboom et al.
-
Summary of Interpreting Neural Networks Through Mahalanobis Distance, by Alan Oursland
-
Summary of Febim: Efficient and Compact Bayesian Inference Engine Empowered with Ferroelectric In-memory Computing, by Chao Li et al.
-
Summary of Capsule Endoscopy Multi-classification Via Gated Attention and Wavelet Transformations, by Lakshmi Srinivas Panchananam et al.
-
Summary of Bitpipe: Bidirectional Interleaved Pipeline Parallelism For Accelerating Large Models Training, by Houming Wu et al.
-
Summary of Notes on the Mathematical Structure Of Gpt Llm Architectures, by Spencer Becker-kahn
-
Summary of Visual Text Matters: Improving Text-kvqa with Visual Text Entity Knowledge-aware Large Multimodal Assistant, by Abhirama Subramanyam Penamakuri et al.
-
Summary of Initialization Matters: on the Benign Overfitting Of Two-layer Relu Cnn with Fully Trainable Layers, by Shuning Shang et al.
-
Summary of Structured Diffusion Models with Mixture Of Gaussians As Prior Distribution, by Nanshan Jia et al.
-
Summary of Learning Coupled Subspaces For Multi-condition Spike Data, by Yididiya Y. Nadew et al.
-
Summary of Adversarial Attacks on Large Language Models Using Regularized Relaxation, by Samuel Jacob Chacko et al.
-
Summary of Indication Finding: a Novel Use Case For Representation Learning, by Maren Eckhoff et al.
-
Summary of Perturbation-based Graph Active Learning For Weakly-supervised Belief Representation Learning, by Dachun Sun et al.
-
Summary of No Argument Left Behind: Overlapping Chunks For Faster Processing Of Arbitrarily Long Legal Texts, by Israel Fama et al.
-
Summary of Enriching Gnns with Text Contextual Representations For Detecting Disinformation Campaigns on Social Media, by Bruno Croso Cunha Da Silva et al.
-
Summary of Team: Topological Evolution-aware Framework For Traffic Forecasting–extended Version, by Duc Kieu et al.
-
Summary of Map: Multi-human-value Alignment Palette, by Xinran Wang et al.
-
Summary of Inference Time Llm Alignment in Single and Multidomain Preference Spectrum, by Sadat Shahriar et al.
-
Summary of Binary Classification: Is Boosting Stronger Than Bagging?, by Dimitris Bertsimas and Vasiliki Stoumpou
-
Summary of Predicting Liquidity Coverage Ratio with Gated Recurrent Units: a Deep Learning Model For Risk Management, by Zhen Xu et al.
-
Summary of Equitable Federated Learning with Activation Clustering, by Antesh Upadhyay and Abolfazl Hashemi
-
Summary of No Free Lunch: Fundamental Limits Of Learning Non-hallucinating Generative Models, by Changlong Wu et al.
-
Summary of Peptide-gpt: Generative Design Of Peptides Using Generative Pre-trained Transformers and Bio-informatic Supervision, by Aayush Shah and Chakradhar Guntuboina and Amir Barati Farimani
-
Summary of Can Stories Help Llms Reason? Curating Information Space Through Narrative, by Vahid Sadiri Javadi et al.
-
Summary of Hierarchical Mixture Of Experts: Generalizable Learning For High-level Synthesis, by Weikai Li et al.
-
Summary of Humanizing the Machine: Proxy Attacks to Mislead Llm Detectors, by Tianchun Wang et al.
-
Summary of Dual Space Training For Gans: a Pathway to Efficient and Creative Generative Models, by Beka Modrekiladze
-
Summary of Large Language Models For Financial Aid in Financial Time-series Forecasting, by Md Khairul Islam et al.
-
Summary of Mixture Of Parrots: Experts Improve Memorization More Than Reasoning, by Samy Jelassi et al.
-
Summary of Less Discriminatory Alternative and Interpretable Xgboost Framework For Binary Classification, by Andrew Pangia et al.
-
Summary of Newton Losses: Using Curvature Information For Learning with Differentiable Algorithms, by Felix Petersen et al.
-
Summary of An Investigation on Machine Learning Predictive Accuracy Improvement and Uncertainty Reduction Using Vae-based Data Augmentation, by Farah Alsafadi et al.
-
Summary of Target Strangeness: a Novel Conformal Prediction Difficulty Estimator, by Alexis Bose et al.
-
Summary of Fastsurvival: Hidden Computational Blessings in Training Cox Proportional Hazards Models, by Jiachang Liu et al.
-
Summary of Provable Tempered Overfitting Of Minimal Nets and Typical Nets, by Itamar Harel et al.
-
Summary of Inherently Interpretable Tree Ensemble Learning, by Zebin Yang et al.
-
Summary of Tesseraq: Ultra Low-bit Llm Post-training Quantization with Block Reconstruction, by Yuhang Li et al.
-
Summary of Conditional Diffusions For Amortized Neural Posterior Estimation, by Tianyu Chen et al.
-
Summary of Lanfl: Differentially Private Federated Learning with Large Language Models Using Synthetic Samples, by Huiyu Wu et al.
-
Summary of Bio2token: All-atom Tokenization Of Any Biomolecular Structure with Mamba, by Andrew Liu et al.
-
Summary of Llm Tree Search, by Dylan Wilson
-
Summary of Read-me: Refactorizing Llms As Router-decoupled Mixture Of Experts with System Co-design, by Ruisi Cai et al.
-
Summary of Research on Key Technologies For Cross-cloud Federated Training Of Large Language Models, by Haowei Yang et al.
-
Summary of A Spectral Method For Multi-view Subspace Learning Using the Product Of Projections, by Renat Sergazinov et al.
-
Summary of Maximum a Posteriori Inference For Factor Graphs Via Benders’ Decomposition, by Harsh Vardhan Dubey et al.
-
Summary of Context-aware Trajectory Anomaly Detection, by Haoji Hu et al.
-
Summary of Diff-instruct++: Training One-step Text-to-image Generator Model to Align with Human Preferences, by Weijian Luo
-
Summary of Using Parametric Pinns For Predicting Internal and External Turbulent Flows, by Shinjan Ghosh et al.
-
Summary of Missnodag: Differentiable Cyclic Causal Graph Learning From Incomplete Data, by Muralikrishnna G. Sethuraman et al.
-
Summary of A Random Matrix Theory Perspective on the Spectrum Of Learned Features and Asymptotic Generalization Capabilities, by Yatin Dandi et al.
-
Summary of Adjusted Overfitting Regression, by Dylan Wilson
-
Summary of Dynamic Vocabulary Pruning in Early-exit Llms, by Jort Vincenti et al.
-
Summary of Learning Structured Compressed Sensing with Automatic Resource Allocation, by Han Wang et al.
-
Summary of Stable Consistency Tuning: Understanding and Improving Consistency Models, by Fu-yun Wang et al.
-
Summary of On the Crucial Role Of Initialization For Matrix Factorization, by Bingcong Li et al.
-
Summary of Context Is Key: a Benchmark For Forecasting with Essential Textual Information, by Andrew Robert Williams et al.
-
Summary of Unbounded: a Generative Infinite Game Of Character Life Simulation, by Jialu Li et al.
-
Summary of Ferret-ui 2: Mastering Universal User Interface Understanding Across Platforms, by Zhangheng Li et al.
-
Summary of Camel-bench: a Comprehensive Arabic Lmm Benchmark, by Sara Ghaboura et al.
-
Summary of Deep Insights Into Cognitive Decline: a Survey Of Leveraging Non-intrusive Modalities with Deep Learning Techniques, by David Ortiz-perez et al.
-
Summary of Pixelgaussian: Generalizable 3d Gaussian Reconstruction From Arbitrary Views, by Xin Fei et al.
-
Summary of Vehiclesdf: a 3d Generative Model For Constrained Engineering Design Via Surrogate Modeling, by Hayata Morita et al.
-
Summary of Deterministic Fokker-planck Transport — with Applications to Sampling, Variational Inference, Kernel Mean Embeddings & Sequential Monte Carlo, by Ilja Klebanov
-
Summary of Make Llms Better Zero-shot Reasoners: Structure-orientated Autonomous Reasoning, by Pengfei He et al.
-
Summary of Whither Bias Goes, I Will Go: An Integrative, Systematic Review Of Algorithmic Bias Mitigation, by Louis Hickman et al.
-
Summary of Heterogeneous Random Forest, by Ye-eun Kim et al.
-
Summary of Hierarchical Multimodal Llms with Semantic Space Alignment For Enhanced Time Series Classification, by Xiaoyu Tao et al.
-
Summary of Baton: Enhancing Batch-wise Inference Efficiency For Large Language Models Via Dynamic Re-batching, by Peizhuang Cong et al.
-
Summary of Exploiting Interpretable Capabilities with Concept-enhanced Diffusion and Prototype Networks, by Alba Carballo-castro et al.
-
Summary of Does Differential Privacy Impact Bias in Pretrained Nlp Models?, by Md. Khairul Islam et al.
-
Summary of Citywide Electric Vehicle Charging Demand Prediction Approach Considering Urban Region and Dynamic Influences, by Haoxuan Kuang et al.
-
Summary of A Little Help Goes a Long Way: Efficient Llm Training by Leveraging Small Lms, By Ankit Singh Rawat et al.
-
Summary of Denoising Diffusion Probabilistic Models Are Optimally Adaptive to Unknown Low Dimensionality, by Zhihan Huang et al.
-
Summary of Warp-lca: Efficient Convolutional Sparse Coding with Locally Competitive Algorithm, by Geoffrey Kasenbacher et al.
-
Summary of Fast Constrained Sampling in Pre-trained Diffusion Models, by Alexandros Graikos et al.
-
Summary of A Combinatorial Approach to Neural Emergent Communication, by Zheyuan Zhang
-
Summary of From Imitation to Introspection: Probing Self-consciousness in Language Models, by Sirui Chen et al.
-
Summary of High-dimensional Analysis Of Knowledge Distillation: Weak-to-strong Generalization and Scaling Laws, by M. Emrullah Ildiz et al.
-
Summary of Learning to Explore with Lagrangians For Bandits Under Unknown Linear Constraints, by Udvas Das and Debabrota Basu
-
Summary of From Efficiency to Equity: Measuring Fairness in Preference Learning, by Shreeyash Gowaikar et al.
-
Summary of Probabilistic Language-image Pre-training, by Sanghyuk Chun and Wonjae Kim and Song Park and Sangdoo Yun
-
Summary of Fedspd: a Soft-clustering Approach For Personalized Decentralized Federated Learning, by I-cheng Lin et al.