Paper List
We recommend you use the search box as this list is very long.
-
Summary of Torchtitan: One-stop Pytorch Native Solution For Production Ready Llm Pre-training, by Wanchao Liang et al.
-
Summary of Diffgad: a Diffusion-based Unsupervised Graph Anomaly Detector, by Jinghan Li et al.
-
Summary of Do Great Minds Think Alike? Investigating Human-ai Complementarity in Question Answering with Caimira, by Maharshi Gor et al.
-
Summary of Harnessing the Power Of Noise: a Survey Of Techniques and Applications, by Reyhaneh Abdolazimi et al.
-
Summary of Sketch to Adapt: Fine-tunable Sketches For Efficient Llm Adaptation, by Tianyi Zhang et al.
-
Summary of Communication-efficient Federated Group Distributionally Robust Optimization, by Zhishuai Guo et al.
-
Summary of Physics-informed Regularization For Domain-agnostic Dynamical System Modeling, by Zijie Huang et al.
-
Summary of Unveiling the Backbone-optimizer Coupling Bias in Visual Representation Learning, by Siyuan Li et al.
-
Summary of Humvi: a Multilingual Dataset For Detecting Violent Incidents Impacting Humanitarian Aid, by Hemank Lamba et al.
-
Summary of Covering Numbers For Deep Relu Networks with Applications to Function Approximation and Nonparametric Regression, by Weigutian Ou et al.
-
Summary of Adver-city: Open-source Multi-modal Dataset For Collaborative Perception Under Adverse Weather Conditions, by Mateus Karvat et al.
-
Summary of Multimodal Representation Learning Using Adaptive Graph Construction, by Weichen Huang
-
Summary of Provable Accuracy Bounds For Hybrid Dynamical Optimization and Sampling, by Matthew X. Burns et al.
-
Summary of Adaptive Random Fourier Features Training Stabilized by Resampling with Applications in Image Regression, By Aku Kammonen et al.
-
Summary of Topology-agnostic Graph U-nets For Scalar Field Prediction on Unstructured Meshes, by Kevin Ferguson et al.
-
Summary of A Skewness-based Criterion For Addressing Heteroscedastic Noise in Causal Discovery, by Yingyu Lin et al.
-
Summary of Automating Data Science Pipelines with Tensor Completion, by Shaan Pakala et al.
-
Summary of Stochastic Sparse Sampling: a Framework For Variable-length Medical Time Series Classification, by Xavier Mootoo et al.
-
Summary of Predicting Battery Capacity Fade Using Probabilistic Machine Learning Models with and Without Pre-trained Priors, by Michael J. Kenney et al.
-
Summary of Restructuring Vector Quantization with the Rotation Trick, by Christopher Fifty et al.
-
Summary of Fairedu: a Multiple Regression-based Method For Enhancing Fairness in Machine Learning Models For Educational Applications, by Nga Pham et al.
-
Summary of Nlp Case Study on Predicting the Before and After Of the Ukraine-russia and Hamas-israel Conflicts, by Jordan Miner and John E. Ortega
-
Summary of Stress Detection on Code-mixed Texts in Dravidian Languages Using Machine Learning, by L. Ramos et al.
-
Summary of Evolve: Evaluating and Optimizing Llms For Exploration, by Allen Nie et al.
-
Summary of Symdiff: Equivariant Diffusion Via Stochastic Symmetrisation, by Leo Zhang et al.
-
Summary of Shade: Deep Density-based Clustering, by Anna Beer et al.
-
Summary of Think While You Generate: Discrete Diffusion with Planned Denoising, by Sulin Liu et al.
-
Summary of Mixture Compressor For Mixture-of-experts Llms Gains More, by Wei Huang et al.
-
Summary of Solving Functional Optimization with Deep Networks and Variational Principles, by Kawisorn Kamtue et al.
-
Summary of Non-halting Queries: Exploiting Fixed Points in Llms, by Ghaith Hammouri et al.
-
Summary of Accelerated Preference Optimization For Large Language Model Alignment, by Jiafan He et al.
-
Summary of Conformal Structured Prediction, by Botong Zhang et al.
-
Summary of Amortized Shap Values Via Sparse Fourier Function Approximation, by Ali Gorji et al.
-
Summary of Compositional Risk Minimization, by Divyat Mahajan et al.
-
Summary of Learning in Complex Action Spaces Without Policy Gradients, by Arash Tavakoli et al.
-
Summary of Bayesian Estimation and Tuning-free Rank Detection For Probability Mass Function Tensors, by Joseph K. Chege et al.
-
Summary of Differentiation Through Black-box Quadratic Programming Solvers, by Connor W. Magoon et al.
-
Summary of Auto-evolve: Enhancing Large Language Model’s Performance Via Self-reasoning Framework, by Krishna Aswani et al.
-
Summary of Locate-then-edit For Multi-hop Factual Recall Under Knowledge Editing, by Zhuoran Zhang et al.
-
Summary of Filtered Randomized Smoothing: a New Defense For Robust Modulation Classification, by Wenhan Zhang et al.
-
Summary of Batched Bayesian Optimization with Correlated Candidate Uncertainties, by Jenna Fromer et al.
-
Summary of Fedgraph: a Research Library and Benchmark For Federated Graph Learning, by Yuhang Yao et al.
-
Summary of Tree-based Leakage Inspection and Control in Concept Bottleneck Models, by Angelos Ragkousis et al.
-
Summary of Uncertainty Estimation Via Ensembles Of Deep Learning Models and Dropout Layers For Seismic Traces, by Giovanni Messuti et al.
-
Summary of Zero-shot Learning Of Causal Models, by Divyat Mahajan et al.
-
Summary of Estimating the Number Of Http/3 Responses in Quic Using Deep Learning, by Barak Gahtan et al.
-
Summary of Quality Diversity Imitation Learning, by Zhenglin Wan et al.
-
Summary of Markov Equivalence and Consistency in Differentiable Structure Learning, by Chang Deng et al.
-
Summary of Qgym: Scalable Simulation and Benchmarking Of Queuing Network Controllers, by Haozhe Chen et al.
-
Summary of Stochastic Kernel Regularisation Improves Generalisation in Deep Kernel Machines, by Edward Milsom et al.
-
Summary of Neural-bayesian Program Learning For Few-shot Dialogue Intent Parsing, by Mengze Hong et al.
-
Summary of Benign Overfitting For Regression with Trained Two-layer Relu Networks, by Junhyung Park et al.
-
Summary of Round and Round We Go! What Makes Rotary Positional Encodings Useful?, by Federico Barbero et al.
-
Summary of Leanagent: Lifelong Learning For Formal Theorem Proving, by Adarsh Kumarappan et al.
-
Summary of Solving Robust Mdps As a Sequence Of Static Rl Problems, by Adil Zouitine et al.
-
Summary of Rl, but Don’t Do Anything I Wouldn’t Do, by Michael K. Cohen et al.
-
Summary of Dataenvgym: Data Generation Agents in Teacher Environments with Student Feedback, by Zaid Khan et al.
-
Summary of A Timeline and Analysis For Representation Plasticity in Large Language Models, by Akshat Kannan
-
Summary of Relitlrm: Generative Relightable Radiance For Large Reconstruction Models, by Tianyuan Zhang et al.
-
Summary of Parameter Choice and Neuro-symbolic Approaches For Deep Domain-invariant Learning, by Marius-constantin Dinu
-
Summary of Teochat: a Large Vision-language Assistant For Temporal Earth Observation Data, by Jeremy Andrew Irvin et al.
-
Summary of Unsupervised Model Diagnosis, by Yinong Oliver Wang et al.
-
Summary of Generalizing to Any Diverse Distribution: Uniformity, Gentle Finetuning and Rebalancing, by Andreas Loukas et al.
-
Summary of Long-context Llms Meet Rag: Overcoming Challenges For Long Inputs in Rag, by Bowen Jin et al.
-
Summary of Asynchronous Stochastic Gradient Descent with Decoupled Backpropagation and Layer-wise Updates, by Cabrel Teguemne Fokam et al.
-
Summary of Generalized Sparse Additive Model with Unknown Link Function, by Peipei Yuan et al.
-
Summary of Utilizing Lyapunov Exponents in Designing Deep Neural Networks, by Tirthankar Mittra
-
Summary of Is the Mmi Criterion Necessary For Interpretability? Degenerating Non-causal Features to Plain Noise For Self-rationalization, by Wei Liu et al.
-
Summary of Unveiling Transformer Perception by Exploring Input Manifolds, By Alessandro Benfenati and Alfio Ferrara and Alessio Marta and Davide Riva and Elisabetta Rocchetti
-
Summary of Qt-dog: Quantization-aware Training For Domain Generalization, by Saqib Javed et al.
-
Summary of Sparse Repellency For Shielded Generation in Text-to-image Diffusion Models, by Michael Kirchhof et al.
-
Summary of Jet Expansions Of Residual Computation, by Yihong Chen et al.
-
Summary of Qera: An Analytical Framework For Quantization Error Reconstruction, by Cheng Zhang et al.
-
Summary of Extracting Finite State Machines From Transformers, by Rik Adriaensen et al.
-
Summary of Hierarchical Matrix Completion For the Prediction Of Properties Of Binary Mixtures, by Dominik Gond et al.
-
Summary of Gaussian-based and Outside-the-box Runtime Monitoring Join Forces, by Vahid Hashemi et al.
-
Summary of Posets and Bounded Probabilities For Discovering Order-inducing Features in Event Knowledge Graphs, by Christoffer Olling Back and Jakob Grue Simonsen
-
Summary of Enforcing Interpretability in Time Series Transformers: a Concept Bottleneck Framework, by Angela Van Sprang et al.
-
Summary of Contrastive Learning to Fine-tune Feature Extraction Models For the Visual Cortex, by Alex Mulrooney and Austin J. Brockmeier
-
Summary of Scalable Mechanistic Neural Networks For Differential Equations and Machine Learning, by Jiale Chen et al.
-
Summary of Diversity-rewarded Cfg Distillation, by Geoffrey Cideron et al.
-
Summary of Continuous Contrastive Learning For Long-tailed Semi-supervised Recognition, by Zi-hao Zhou and Siyuan Fang and Zi-jing Zhou and Tong Wei and Yuanyu Wan and Min-ling Zhang
-
Summary of Contextual Bandits with Non-stationary Correlated Rewards For User Association in Mmwave Vehicular Networks, by Xiaoyang He et al.
-
Summary of Enhanced Feature Based Granular Ball Twin Support Vector Machine, by A. Quadir et al.
-
Summary of Extended Convexity and Smoothness and Their Applications in Deep Learning, by Binchuan Qi et al.
-
Summary of Uncertainty-aware Fairness-adaptive Classification Trees, by Anna Gottard and Vanessa Verrina and Sabrina Giordano
-
Summary of Cap: Detecting Unauthorized Data Usage in Generative Models Via Prompt Generation, by Daniela Gallo et al.
-
Summary of Melissadl X Breed: Towards Data-efficient On-line Supervised Training Of Multi-parametric Surrogates with Active Learning, by Sofya Dymchenko (datamove) et al.
-
Summary of Stochastic Bandits For Egalitarian Assignment, by Eugene Lim et al.
-
Summary of Time Transfer: on Optimal Learning Rate and Batch Size in the Infinite Data Limit, by Oleg Filatov et al.
-
Summary of Improved Sample Complexity For Private Nonsmooth Nonconvex Optimization, by Guy Kornowski et al.
-
Summary of Ordering-based Causal Discovery For Linear and Nonlinear Relations, by Zhuopeng Xu et al.
-
Summary of Deep Learning-based Fault Identification in Condition Monitoring, by Hariom Dhungana et al.
-
Summary of Dimol: Dimensional Awareness As a New ‘dimension’ in Operator Learning, by Yichen Song et al.
-
Summary of Manifolds, Random Matrices and Spectral Gaps: the Geometric Phases Of Generative Diffusion, by Enrico Ventura et al.
-
Summary of Brain-inspired Continual Pre-trained Learner Via Silent Synaptic Consolidation, by Xuming Ran et al.
-
Summary of Accelerating Error Correction Code Transformers, by Matan Levy et al.
-
Summary of Active Evaluation Acquisition For Efficient Llm Benchmarking, by Yang Li et al.
-
Summary of Single Point-based Distributed Zeroth-order Optimization with a Non-convex Stochastic Objective Function, by Elissa Mhanna and Mohamad Assaad