Paper List
We recommend you use the search box as this list is very long.
-
Summary of Graph Of Records: Boosting Retrieval Augmented Generation For Long-context Summarization with Graphs, by Haozhen Zhang et al.
-
Summary of Liger Kernel: Efficient Triton Kernels For Llm Training, by Pin-lun Hsu et al.
-
Summary of One Language, Many Gaps: Evaluating Dialect Fairness and Robustness Of Large Language Models in Reasoning Tasks, by Fangru Lin et al.
-
Summary of When Does Perceptual Alignment Benefit Vision Representations?, by Shobhita Sundaram et al.
-
Summary of High-fidelity 3d Lung Ct Synthesis in Ards Swine Models Using Score-based 3d Residual Diffusion Models, by Siyeop Yoon et al.
-
Summary of Focus on What Matters: Separated Models For Visual-based Rl Generalization, by Di Zhang et al.
-
Summary of Lotus: Learning-based Online Thermal and Latency Variation Management For Two-stage Detectors on Edge Devices, by Yifan Gong et al.
-
Summary of Duo-llm: a Framework For Studying Adaptive Computation in Large Language Models, by Keivan Alizadeh et al.
-
Summary of Continuous Approximations For Improving Quantization Aware Training Of Llms, by He Li et al.
-
Summary of Reasoning Paths Optimization: Learning to Reason and Explore From Diverse Paths, by Yew Ken Chia et al.
-
Summary of Superficial Safety Alignment Hypothesis, by Jianwei Li and Jung-eun Kim
-
Summary of Application Of Notebooklm, a Large Language Model with Retrieval-augmented Generation, For Lung Cancer Staging, by Ryota Tozuka and Hisashi Johno and Akitomo Amakawa and Junichi Sato and Mizuki Muto and Shoichiro Seki and Atsushi Komaba and Hiroshi Onishi
-
Summary of Fill in the Gaps: Model Calibration and Generalization with Synthetic Data, by Yang Ba et al.
-
Summary of Large Continual Instruction Assistant, by Jingyang Qiao and Zhizhong Zhang and Xin Tan and Yanyun Qu and Shouhong Ding and Yuan Xie
-
Summary of Portllm: Personalizing Evolving Large Language Models with Training-free and Portable Model Patches, by Rana Muhammad Shahroz Khan et al.
-
Summary of Freqmark: Frequency-based Watermark For Sentence-level Detection Of Llm-generated Text, by Zhenyu Xu and Kun Zhang and Victor S. Sheng
-
Summary of Herald: a Natural Language Annotated Lean 4 Dataset, by Guoxiong Gao et al.
-
Summary of Enhancing Vision-language Model Pre-training with Image-text Pair Pruning Based on Word Frequency, by Mingliang Liang et al.
-
Summary of Replicable Uniformity Testing, by Sihan Liu et al.
-
Summary of Actnas : Generating Efficient Yolo Models Using Activation Nas, by Sudhakar Sah et al.
-
Summary of Fine-tuning Can Help Detect Pretraining Data From Large Language Models, by Hengxiang Zhang et al.
-
Summary of Come: Test-time Adaption by Conservatively Minimizing Entropy, By Qingyang Zhang et al.
-
Summary of Seedlm: Compressing Llm Weights Into Seeds Of Pseudo-random Generators, by Rasoul Shafipour et al.
-
Summary of Towards Llm-guided Efficient and Interpretable Multi-linear Tensor Network Rank Selection, by Giorgos Iacovides et al.
-
Summary of Towards Calibrated Losses For Adversarial Robust Reject Option Classification, by Vrund Shah et al.
-
Summary of Asymptotic Analysis Of Sample-averaged Q-learning, by Saunak Kumar Panda et al.
-
Summary of Adversarially Robust Out-of-distribution Detection Using Lyapunov-stabilized Embeddings, by Hossein Mirzaei et al.
-
Summary of Sensorbench: Benchmarking Llms in Coding-based Sensor Processing, by Pengrui Quan et al.
-
Summary of Enhancing Jepas with Spatial Conditioning: Robust and Efficient Representation Learning, by Etai Littwin et al.
-
Summary of Aflow: Automating Agentic Workflow Generation, by Jiayi Zhang et al.
-
Summary of When Attention Sink Emerges in Language Models: An Empirical View, by Xiangming Gu et al.
-
Summary of On Information-theoretic Measures Of Predictive Uncertainty, by Kajetan Schweighofer et al.
-
Summary of Semantic Image Inversion and Editing Using Rectified Stochastic Differential Equations, by Litu Rout et al.
-
Summary of Context-parametric Inversion: Why Instruction Finetuning May Not Actually Improve Context Reliance, by Sachin Goyal et al.
-
Summary of Tl-pca: Transfer Learning Of Principal Component Analysis, by Sharon Hendy et al.
-
Summary of Mix Data or Merge Models? Optimizing For Diverse Multi-task Learning, by Aakanksha et al.
-
Summary of Trajdiffuse: a Conditional Diffusion Model For Environment-aware Trajectory Prediction, by Qingze (tony) Liu et al.
-
Summary of Hard-constrained Neural Networks with Universal Approximation Guarantees, by Youngjae Min et al.
-
Summary of Deep Linear Probe Generators For Weight Space Learning, by Jonathan Kahana et al.
-
Summary of Hart: Efficient Visual Generation with Hybrid Autoregressive Transformer, by Haotian Tang et al.
-
Summary of Lvd-2m: a Long-take Video Dataset with Temporally Dense Captions, by Tianwei Xiong et al.
-
Summary of Burning Red: Unlocking Subtask-driven Reinforcement Learning and Risk-awareness in Average-reward Markov Decision Processes, by Juan Sebastian Rojas et al.
-
Summary of Regularized Robustly Reliable Learners and Instance Targeted Attacks, by Avrim Blum et al.
-
Summary of Stackfeed: Structured Textual Actor-critic Knowledge Base Editing with Feedback, by Naman Gupta et al.
-
Summary of Topofr: a Closer Look at Topology Alignment on Face Recognition, by Jun Dan et al.
-
Summary of Trestle: a Model Of Concept Formation in Structured Domains, by Christopher J. Maclellan et al.
-
Summary of Neural Networks That Overcome Classic Challenges Through Practice, by Kazuki Irie et al.
-
Summary of Multi-modal Vision Pre-training For Medical Image Analysis, by Shaohao Rui et al.
-
Summary of Lambda-skip Connections: the Architectural Component That Prevents Rank Collapse, by Federico Arangath Joseph et al.
-
Summary of High-dimensional Differential Parameter Inference in Exponential Family Using Time Score Matching, by Daniel J. Williams et al.
-
Summary of Robust Gradient Descent For Phase Retrieval, by Alex Buna et al.
-
Summary of Echo State Networks For Spatio-temporal Area-level Data, by Zhenhua Wang et al.
-
Summary of Transforming Game Play: a Comparative Study Of Dcqn and Dtqn Architectures in Reinforcement Learning, by William A. Stigall
-
Summary of A Simple Baseline For Predicting Events with Auto-regressive Tabular Transformers, by Alex Stein and Samuel Sharpe and Doron Bergman and Senthil Kumar and C. Bayan Bruss and John Dickerson and Tom Goldstein and Micah Goldblum
-
Summary of Cross-modal Few-shot Learning: a Generative Transfer Learning Framework, by Zhengwei Yang et al.
-
Summary of Enhancing Robustness in Deep Reinforcement Learning: a Lyapunov Exponent Approach, by Rory Young et al.
-
Summary of Combinatorial Multi-armed Bandits: Arm Selection Via Group Testing, by Arpan Mukherjee et al.
-
Summary of Double Jeopardy and Climate Impact in the Use Of Large Language Models: Socio-economic Disparities and Reduced Utility For Non-english Speakers, by Aivin V. Solatorio et al.
-
Summary of Sampa: Sharpness-aware Minimization Parallelized, by Wanyun Xie et al.
-
Summary of Dynamical Loss Functions Shape Landscape Topography and Improve Learning in Artificial Neural Networks, by Eduardo Lavin and Miguel Ruiz-garcia
-
Summary of Early Diagnosis Of Acute Lymphoblastic Leukemia Using Yolov8 and Yolov11 Deep Learning Models, by Alaa Awad et al.
-
Summary of Principled Bayesian Optimisation in Collaboration with Human Experts, by Wenjie Xu et al.
-
Summary of Moirai-moe: Empowering Time Series Foundation Models with Sparse Mixture Of Experts, by Xu Liu et al.
-
Summary of Information Propagation Dynamics in Deep Graph Networks, by Alessio Gravina
-
Summary of Model-based Privacy-preserving Knowledge Transfer For Large Language Models, by Zhaomin Wu et al.
-
Summary of The Implicit Bias Of Structured State Space Models Can Be Poisoned with Clean Labels, by Yonatan Slutzky et al.
-
Summary of A Practical Approach to Causal Inference Over Time, by Martina Cinquini et al.
-
Summary of Comparison Of Deep Learning and Conventional Methods For Disease Onset Prediction, by Luis H. John et al.
-
Summary of A Kernelizable Primal-dual Formulation Of the Multilinear Singular Value Decomposition, by Frederiek Wesel et al.
-
Summary of Artificial Intelligence-based Triaging Of Cutaneous Melanocytic Lesions, by Ruben T. Lucassen et al.
-
Summary of Unigem: a Unified Approach to Generation and Property Prediction For Molecules, by Shikun Feng et al.
-
Summary of Continual Deep Reinforcement Learning to Prevent Catastrophic Forgetting in Jamming Mitigation, by Kemal Davaslioglu et al.
-
Summary of Get Rid Of Isolation: a Continuous Multi-task Spatio-temporal Learning Framework, by Zhongchao Yi et al.
-
Summary of Inverse Problems and Data Assimilation: a Machine Learning Approach, by Eviatar Bach et al.
-
Summary of Transparent Networks For Multivariate Time Series, by Minkyu Kim et al.
-
Summary of Non-convergence to Global Minimizers in Data Driven Supervised Deep Learning: Adam and Stochastic Gradient Descent Optimization Provably Fail to Converge to Global Minimizers in the Training Of Deep Neural Networks with Relu Activation, by Thang Do and Sonja Hannibal and Arnulf Jentzen
-
Summary of Data-driven Approaches For Modelling Target Behaviour, by Isabel Schlangen et al.
-
Summary of Rethinking Legal Judgement Prediction in a Realistic Scenario in the Era Of Large Language Models, by Shubham Kumar Nigam et al.
-
Summary of Graph Classification Gaussian Processes Via Hodgelet Spectral Features, by Mathieu Alain et al.
-
Summary of Rosar: An Adversarial Re-training Framework For Robust Side-scan Sonar Object Detection, by Martin Aubard et al.
-
Summary of Slanc: Static Layernorm Calibration, by Mahsa Salmani et al.
-
Summary of Feature Averaging: An Implicit Bias Of Gradient Descent Leading to Non-robustness in Neural Networks, by Binghui Li et al.
-
Summary of Graphclip: Enhancing Transferability in Graph Foundation Models For Text-attributed Graphs, by Yun Zhu et al.
-
Summary of Comat: Chain Of Mathematically Annotated Thought Improves Mathematical Reasoning, by Joshua Ong Jun Leang et al.
-
Summary of Replay-and-forget-free Graph Class-incremental Learning: a Task Profiling and Prompting Approach, by Chaoxi Niu et al.
-
Summary of Optimal Time Complexity Algorithms For Computing General Random Walk Graph Kernels on Sparse Graphs, by Krzysztof Choromanski et al.
-
Summary of Spegcl: Self-supervised Graph Spectrum Contrastive Learning Without Positive Samples, by Yuntao Shou et al.
-
Summary of Bookworm: a Dataset For Character Description and Analysis, by Argyrios Papoudakis et al.
-
Summary of Sharpness-aware Minimization Efficiently Selects Flatter Minima Late in Training, by Zhanpeng Zhou et al.
-
Summary of Learning Sub-second Routing Optimization in Computer Networks Requires Packet-level Dynamics, by Andreas Boltres et al.
-
Summary of Stein Variational Evolution Strategies, by Cornelius V. Braun et al.
-
Summary of Bayesian Optimisation with Unknown Hyperparameters: Regret Bounds Logarithmically Closer to Optimal, by Juliusz Ziomek et al.
-
Summary of Gift-eval: a Benchmark For General Time Series Forecasting Model Evaluation, by Taha Aksu et al.
-
Summary of Improved Depth Estimation Of Bayesian Neural Networks, by Bart Van Erp and Bert De Vries
-
Summary of Tighter Risk Bounds For Mixtures Of Experts, by Wissam Akretche et al.
-
Summary of Deterministic Apple Tasting, by Zachary Chase and Idan Mehalel
-
Summary of Coupled Autoregressive Active Inference Agents For Control Of Multi-joint Dynamical Systems, by Tim N. Nisslbeck et al.