Paper List
We recommend you use the search box as this list is very long.
-
Summary of Why Llms Are Bad at Synthetic Table Generation (and What to Do About It), by Shengzhe Xu et al.
-
Summary of Connecting the Dots: Llms Can Infer and Verbalize Latent Structure From Disparate Training Data, by Johannes Treutlein et al.
-
Summary of Consistency Models Made Easy, by Zhengyang Geng et al.
-
Summary of Uncovering Latent Memories: Assessing Data Leakage and Memorization Patterns in Frontier Ai Models, by Sunny Duan et al.
-
Summary of Meat: Median-ensemble Adversarial Training For Improving Robustness and Generalization, by Zhaozhe Hu et al.
-
Summary of Learning to Discover Knowledge: a Weakly-supervised Partial Domain Adaptation Approach, by Mengcheng Lan et al.
-
Summary of Veriflow: Modeling Distributions For Neural Network Verification, by Faried Abu Zaid et al.
-
Summary of Fairx: a Comprehensive Benchmarking Tool For Model Analysis Using Fairness, Utility, and Explainability, by Md Fahim Sikder et al.
-
Summary of Revisiting Modularity Maximization For Graph Clustering: a Contrastive Learning Perspective, by Yunfei Liu et al.
-
Summary of Identifiable Exchangeable Mechanisms For Causal Structure and Representation Learning, by Patrik Reizinger et al.
-
Summary of Mind the Privacy Unit! User-level Differential Privacy For Language Model Fine-tuning, by Lynn Chua et al.
-
Summary of Computing Within Limits: An Empirical Study Of Energy Consumption in Ml Training and Inference, by Ioannis Mavromatis and Kostas Katsaros and Aftab Khan
-
Summary of Revealing the Learning Process in Reinforcement Learning Agents Through Attention-oriented Metrics, by Charlotte Beylier et al.
-
Summary of Adaptive Adversarial Cross-entropy Loss For Sharpness-aware Minimization, by Tanapat Ratchatorn et al.
-
Summary of Hotpp Benchmark: Are We Good at the Long Horizon Events Forecasting?, by Ivan Karpukhin et al.
-
Summary of Can You Trust Your Explanations? a Robustness Test For Feature Attribution Methods, by Ilaria Vascotto et al.
-
Summary of Ce-ssl: Computation-efficient Semi-supervised Learning For Ecg-based Cardiovascular Diseases Detection, by Rushuang Zhou et al.
-
Summary of Active Diffusion Subsampling, by Oisin Nolan et al.
-
Summary of How Far Are Today’s Time-series Models From Real-world Weather Forecasting Applications?, by Tao Han et al.
-
Summary of Jailbreaking As a Reward Misspecification Problem, by Zhihui Xie et al.
-
Summary of Fair Streaming Feature Selection, by Zhangling Duan et al.
-
Summary of Fvel: Interactive Formal Verification Environment with Large Language Models Via Theorem Proving, by Xiaohan Lin et al.
-
Summary of Predicting Probabilities Of Error to Combine Quantization and Early Exiting: Quee, by Florence Regol et al.
-
Summary of Syndarin: Synthesising Datasets For Automated Reasoning in Low-resource Languages, by Gayane Ghazaryan et al.
-
Summary of Urban-focused Multi-task Offline Reinforcement Learning with Contrastive Data Sharing, by Xinbo Zhao et al.
-
Summary of Flocora: Federated Learning Compression with Low-rank Adaptation, by Lucas Grativol Ribeiro et al.
-
Summary of Bayesian Bandit Algorithms with Approximate Inference in Stochastic Linear Bandits, by Ziyi Huang et al.
-
Summary of Seg-lstm: Performance Of Xlstm For Semantic Segmentation Of Remotely Sensed Images, by Qinfeng Zhu et al.
-
Summary of Teaching Models to Survive: Proper Scoring Rule and Stochastic Optimization with Competing Risks, by Julie Alberge (soda) et al.
-
Summary of Semi Supervised Heterogeneous Domain Adaptation Via Disentanglement and Pseudo-labelling, by Cassio F. Dantas (evergreen et al.
-
Summary of Graph Neural Networks For Job Shop Scheduling Problems: a Survey, by Igor G. Smit et al.
-
Summary of Memory-efficient Gradient Unrolling For Large-scale Bi-level Optimization, by Qianli Shen et al.
-
Summary of Measuring Sample Importance in Data Pruning For Language Models Based on Information Entropy, by Minsang Kim et al.
-
Summary of Finding Safety Neurons in Large Language Models, by Jianhui Chen et al.
-
Summary of Multi-modal Transfer Learning Between Biological Foundation Models, by Juan Jose Garau-luis et al.
-
Summary of Latent Functional Maps: a Spectral Framework For Representation Alignment, by Marco Fumero et al.
-
Summary of Iterative Sizing Field Prediction For Adaptive Mesh Generation From Expert Demonstrations, by Niklas Freymuth et al.
-
Summary of Layermatch: Do Pseudo-labels Benefit All Layers?, by Chaoqi Liang et al.
-
Summary of Defending Against Sophisticated Poisoning Attacks with Rl-based Aggregation in Federated Learning, by Yujing Wang et al.
-
Summary of Evaluation Of Deep Learning Semantic Segmentation For Land Cover Mapping on Multispectral, Hyperspectral and High Spatial Aerial Imagery, by Ilham Adi Panuntun et al.
-
Summary of Aeon: a Python Toolkit For Learning From Time Series, by Matthew Middlehurst et al.
-
Summary of Enhancing Robustness Of Data-driven Shm Models: Adversarial Training with Circle Loss, by Xiangli Yang et al.
-
Summary of Complex Fractal Trainability Boundary Can Arise From Trivial Non-convexity, by Yizhou Liu
-
Summary of Recent Advances in Traffic Accident Analysis and Prediction: a Comprehensive Review Of Machine Learning Techniques, by Noushin Behboudi et al.
-
Summary of The Elusive Pursuit Of Reproducing Pate-gan: Benchmarking, Auditing, Debugging, by Georgi Ganev et al.
-
Summary of Image Anomaly Detection and Prediction Scheme Based on Ssa Optimized Resnet50-bigru Model, by Qianhui Wan and Zecheng Zhang and Liheng Jiang and Zhaoqi Wang and Yan Zhou
-
Summary of Random Pairing Mle For Estimation Of Item Parameters in Rasch Model, by Yuepeng Yang et al.
-
Summary of Bayesian Inverse Reinforcement Learning For Non-markovian Rewards, by Noah Topper et al.
-
Summary of Exploring Changes in Nation Perception with Nationality-assigned Personas in Llms, by Mahammed Kamruzzaman and Gene Louis Kim
-
Summary of Prediction Of Unobserved Bifurcation by Unsupervised Extraction Of Slowly Time-varying System Parameter Dynamics From Time Series Using Reservoir Computing, By Keita Tokuda and Yuichi Katori
-
Summary of Deep Optimal Experimental Design For Parameter Estimation Problems, by Md Shahriar Rahim Siddiqui et al.
-
Summary of Information Guided Regularization For Fine-tuning Language Models, by Mandar Sharma et al.
-
Summary of Confidence Intervals and Simultaneous Confidence Bands Based on Deep Learning, by Asaf Ben Arie et al.
-
Summary of Feature Fusion Based on Mutual-cross-attention Mechanism For Eeg Emotion Recognition, by Yimin Zhao et al.
-
Summary of Cohortnet: Empowering Cohort Discovery For Interpretable Healthcare Analytics, by Qingpeng Cai et al.
-
Summary of Hight: Hierarchical Graph Tokenization For Graph-language Alignment, by Yongqiang Chen et al.
-
Summary of Investigating the Pre-training Dynamics Of In-context Learning: Task Recognition Vs. Task Learning, by Xiaolei Wang et al.
-
Summary of Demystifying Language Model Forgetting with Low-rank Example Associations, by Xisen Jin et al.
-
Summary of Ensembles Of Probabilistic Regression Trees, by Alexandre Seiller et al.
-
Summary of Towards Infinite-long Prefix in Transformer, by Yingyu Liang et al.
-
Summary of A Practical Diffusion Path For Sampling, by Omar Chehab et al.
-
Summary of Ltsm-bundle: a Toolbox and Benchmark on Large Language Models For Time Series Forecasting, by Yu-neng Chuang et al.
-
Summary of Hitchhiker’s Guide on Energy-based Models: a Comprehensive Review on the Relation with Other Generative Models, Sampling and Statistical Physics, by Davide Carbone (1 and 2) ((1) Dipartimento Di Scienze Matematiche et al.
-
Summary of Improving Gflownets with Monte Carlo Tree Search, by Nikita Morozov et al.
-
Summary of Model Internals-based Answer Attribution For Trustworthy Retrieval-augmented Generation, by Jirui Qi et al.
-
Summary of Beacon: Balancing Convenience and Nutrition in Meals with Long-term Group Recommendations and Reasoning on Multimodal Recipes, by Vansh Nagpal et al.
-
Summary of Challenges in Binary Classification, by Pengbo Yang et al.
-
Summary of On the Consistency Of Fairness Measurement Methods For Regression Tasks, by Abdalwahab Almajed et al.
-
Summary of On the Utility Of Domain-adjacent Fine-tuned Model Ensembles For Few-shot Problems, by Md Ibrahim Ibne Alam et al.
-
Summary of Integrating Fuzzy Logic with Causal Inference: Enhancing the Pearl and Neyman-rubin Methodologies, by Amir Saki and Usef Faghihi
-
Summary of Tree-sliced Wasserstein Distance on a System Of Lines, by Viet-hoang Tran et al.
-
Summary of You Can’t Handle the (dirty) Truth: Data-centric Insights Improve Pseudo-labeling, by Nabeel Seedat et al.
-
Summary of Stablesemantics: a Synthetic Language-vision Dataset Of Semantic Representations in Naturalistic Images, by Rushikesh Zawar et al.
-
Summary of Learn and Unlearn in Multilingual Llms, by Taiming Lu et al.
-
Summary of Genai-bench: Evaluating and Improving Compositional Text-to-visual Generation, by Baiqi Li et al.
-
Summary of Concept Drift Visualization Of Svm with Shifting Window, by Honorius Galmeanu et al.
-
Summary of Unveiling the Hidden Structure Of Self-attention Via Kernel Principal Component Analysis, by Rachel S.y. Teo et al.
-
Summary of Elliptical Attention, by Stefan K. Nielsen et al.
-
Summary of Game Of Llms: Discovering Structural Constructs in Activities Using Large Language Models, by Shruthi K. Hiremath and Thomas Ploetz
-
Summary of A Primal-dual Framework For Transformers and Neural Networks, by Tan M. Nguyen et al.
-
Summary of Iot-based Preventive Mental Health Using Knowledge Graphs and Standards For Better Well-being, by Amelie Gyrard et al.
-
Summary of Wikicontradict: a Benchmark For Evaluating Llms on Real-world Knowledge Conflicts From Wikipedia, by Yufang Hou et al.
-
Summary of Can Low-rank Knowledge Distillation in Llms Be Useful For Microelectronic Reasoning?, by Nirjhor Rouf et al.
-
Summary of Text Serialization and Their Relationship with the Conventional Paradigms Of Tabular Machine Learning, by Kyoka Ono et al.
-
Summary of Optimizing Quantile-based Trading Strategies in Electricity Arbitrage, by Ciaran O’connor et al.
-
Summary of Evaluating Representation Learning on the Protein Structure Universe, by Arian R. Jamasb and Alex Morehead and Chaitanya K. Joshi and Zuobai Zhang and Kieran Didi and Simon V. Mathis and Charles Harris and Jian Tang and Jianlin Cheng and Pietro Lio and Tom L. Blundell
-
Summary of Sdq: Sparse Decomposed Quantization For Llm Inference, by Geonhwa Jeong et al.
-
Summary of Global Human-guided Counterfactual Explanations For Molecular Properties Via Reinforcement Learning, by Danqing Wang et al.
-
Summary of Robust Time Series Forecasting with Non-heavy-tailed Gaussian Loss-weighted Sampler, by Jiang You et al.
-
Summary of Allocation Requires Prediction Only If Inequality Is Low, by Ali Shirali et al.
-
Summary of Generative Ai For Enhancing Active Learning in Education: a Comparative Study Of Gpt-3.5 and Gpt-4 in Crafting Customized Test Questions, by Hamdireza Rouzegar et al.
-
Summary of Beyond Optimism: Exploration with Partially Observable Rewards, by Simone Parisi et al.
-
Summary of Explainable Ai Security: Exploring Robustness Of Graph Neural Networks to Adversarial Attacks, by Tao Wu et al.
-
Summary of Optimal Deep Learning Of Holomorphic Operators Between Banach Spaces, by Ben Adcock et al.
-
Summary of Large Language Models Are Skeptics: False Negative Problem Of Input-conflicting Hallucination, by Jongyoon Song et al.
-
Summary of Soft-qmix: Integrating Maximum Entropy For Monotonic Value Function Factorization, by Wentse Chen et al.
-
Summary of Citybench: Evaluating the Capabilities Of Large Language Models For Urban Tasks, by Jie Feng et al.