Paper List
We recommend you use the search box as this list is very long.
-
Summary of Kolmogorov-arnold Network For Satellite Image Classification in Remote Sensing, by Minjong Cheon
-
Summary of Supergaussian: Repurposing Video Models For 3d Super Resolution, by Yuan Shen et al.
-
Summary of Longskywork: a Training Recipe For Efficiently Extending Context Length in Large Language Models, by Liang Zhao et al.
-
Summary of Representing Animatable Avatar Via Factorized Neural Fields, by Chunjin Song et al.
-
Summary of Compositional 4d Dynamic Scenes Understanding with Physics Priors For Video Question Answering, by Xingrui Wang et al.
-
Summary of The Embodied World Model Based on Llm with Visual Information and Prediction-oriented Prompts, by Wakana Haijima et al.
-
Summary of Diffusion Features to Bridge Domain Gap For Semantic Segmentation, by Yuxiang Ji et al.
-
Summary of Automatic Instruction Evolving For Large Language Models, by Weihao Zeng et al.
-
Summary of Prunerf: Segment-centric Dataset Pruning Via 3d Spatial Consistency, by Yeonsung Jung et al.
-
Summary of Focus: Forging Originality Through Contrastive Use in Self-plagiarism For Language Models, by Kaixin Lan et al.
-
Summary of Formality Style Transfer in Persian, by Parastoo Falakaflaki et al.
-
Summary of Mediq: Question-asking Llms and a Benchmark For Reliable Interactive Clinical Reasoning, by Shuyue Stella Li et al.
-
Summary of Clustered Retrieved Augmented Generation (crag), by Simon Akesson and Frances A. Santos
-
Summary of Retrieval-augmented Conversational Recommendation with Prompt-based Semi-structured Natural Language State Tracking, by Sara Kemper et al.
-
Summary of Adaptive Activation Steering: a Tuning-free Llm Truthfulness Improvement Method For Diverse Hallucinations Categories, by Tianlong Wang et al.
-
Summary of Aligning Llms Through Multi-perspective User Preference Ranking-based Feedback For Programming Question Answering, by Hongyu Yang et al.
-
Summary of Visper: Multilingual Audio-visual Speech Recognition, by Sanath Narayan et al.
-
Summary of An Empirical Analysis on Large Language Models in Debate Evaluation, by Xinyi Liu et al.
-
Summary of Long-span Question-answering: Automatic Question Generation and Qa-system Ranking Via Side-by-side Evaluation, by Bernd Bohnet et al.
-
Summary of A Novel Ranking Scheme For the Performance Analysis Of Stochastic Optimization Algorithms Using the Principles Of Severity, by Sowmya Chandrasekaran and Thomas Bartz-beielstein
-
Summary of Sned: Superposition Network Architecture Search For Efficient Video Diffusion Model, by Zhengang Li et al.
-
Summary of The Explanation Necessity For Healthcare Ai, by Michail Mamalakis et al.
-
Summary of Towards Rationality in Language and Multimodal Agents: a Survey, by Bowen Jiang et al.
-
Summary of Artemis: Towards Referential Understanding in Complex Videos, by Jihao Qiu and Yuan Zhang and Xi Tang and Lingxi Xie and Tianren Ma and Pengyu Yan and David Doermann and Qixiang Ye and Yunjie Tian
-
Summary of Genpalm: Contactless Palmprint Generation with Diffusion Models, by Steven A. Grosz and Anil K. Jain
-
Summary of Multi-dimensional Optimization For Text Summarization Via Reinforcement Learning, by Sangwon Ryu et al.
-
Summary of An Effective Weight Initialization Method For Deep Learning: Application to Satellite Image Classification, by Wadii Boulila et al.
-
Summary of Honestllm: Toward An Honest and Helpful Large Language Model, by Chujie Gao et al.
-
Summary of Neural Combinatorial Optimization Algorithms For Solving Vehicle Routing Problems: a Comprehensive Survey with Perspectives, by Xuan Wu et al.
-
Summary of Artificial Generational Intelligence: Cultural Accumulation in Reinforcement Learning, by Jonathan Cook et al.
-
Summary of Advancing Supervised Local Learning Beyond Classification with Long-term Feature Bank, by Feiyu Zhu et al.
-
Summary of Malt: Multi-scale Action Learning Transformer For Online Action Detection, by Zhipeng Yang et al.
-
Summary of The Ai Alignment Paradox, by Robert West and Roland Aydin
-
Summary of Automatic Channel Pruning For Multi-head Attention, by Eunho Lee and Youngbae Hwang
-
Summary of Investigating Calibration and Corruption Robustness Of Post-hoc Pruned Perception Cnns: An Image Classification Benchmark Study, by Pallavi Mitra et al.
-
Summary of Preemptive Answer “attacks” on Chain-of-thought Reasoning, by Rongwu Xu et al.
-
Summary of Or-bench: An Over-refusal Benchmark For Large Language Models, by Justin Cui et al.
-
Summary of Monte Carlo Tree Search Satellite Scheduling Under Cloud Cover Uncertainty, by Justin Norman and Francois Rivest
-
Summary of A Robot Walks Into a Bar: Can Language Models Serve As Creativity Support Tools For Comedy? An Evaluation Of Llms’ Humour Alignment with Comedians, by Piotr Wojciech Mirowski et al.
-
Summary of Navigating Tabular Data Synthesis Research: Understanding User Needs and Tool Capabilities, by Maria F. Davila R. and Sven Groen and Fabian Panse and Wolfram Wingerath
-
Summary of Generative Adversarial Networks in Ultrasound Imaging: Extending Field Of View Beyond Conventional Limits, by Matej Gazda et al.
-
Summary of Enhancing Noise Robustness Of Retrieval-augmented Language Models with Adaptive Adversarial Training, by Feiteng Fang et al.
-
Summary of Lacie: Listener-aware Finetuning For Confidence Calibration in Large Language Models, by Elias Stengel-eskin et al.
-
Summary of Standards For Belief Representations in Llms, by Daniel A. Herrmann and Benjamin A. Levinstein
-
Summary of Direct Alignment Of Language Models Via Quality-aware Self-refinement, by Runsheng Yu et al.
-
Summary of Code Pretraining Improves Entity Tracking Abilities Of Language Models, by Najoung Kim et al.
-
Summary of Pta: Enhancing Multimodal Sentiment Analysis Through Pipelined Prediction and Translation-based Alignment, by Shezheng Song et al.
-
Summary of Ehr-seqsql : a Sequential Text-to-sql Dataset For Interactively Exploring Electronic Health Records, by Jaehee Ryu et al.
-
Summary of Scalm: Towards Semantic Caching For Automated Chat Services with Large Language Models, by Jiaxing Li et al.
-
Summary of Paths Of a Million People: Extracting Life Trajectories From Wikipedia, by Ying Zhang et al.
-
Summary of Gamedx: Generative Ai-based Medical Entity Data Extractor Using Large Language Models, by Mohammed-khalil Ghali et al.
-
Summary of Multi-label Class Incremental Emotion Decoding with Augmented Emotional Semantics Learning, by Kaicheng Fu et al.
-
Summary of Unibias: Unveiling and Mitigating Llm Bias Through Internal Attention and Ffn Manipulation, by Hanzhang Zhou et al.
-
Summary of Leveraging Large Language Models For Entity Matching, by Qianyu Huang and Tongfang Zhao
-
Summary of Robust Planning with Llm-modulo Framework: Case Study in Travel Planning, by Atharva Gundawar et al.
-
Summary of Toxvidlm: a Multimodal Framework For Toxicity Detection in Code-mixed Videos, by Krishanu Maity et al.
-
Summary of Enhancing Jailbreak Attack Against Large Language Models Through Silent Tokens, by Jiahao Yu et al.
-
Summary of Learning Gaze-aware Compositional Gan, by Nerea Aranjuelo et al.
-
Summary of Unraveling and Mitigating Retriever Inconsistencies in Retrieval-augmented Large Language Models, by Mingda Li et al.
-
Summary of Automatic Counting and Classification Of Mosquito Eggs in Field Traps, by Javier Naranjo-alcazar et al.
-
Summary of Unveiling the Lexical Sensitivity Of Llms: Combinatorial Optimization For Prompt Enhancement, by Pengwei Zhan et al.
-
Summary of Self-degraded Contrastive Domain Adaptation For Industrial Fault Diagnosis with Bi-imbalanced Data, by Gecheng Chen et al.
-
Summary of Fingen: a Dataset For Argument Generation in Finance, by Chung-chi Chen et al.
-
Summary of Climate Variable Downscaling with Conditional Normalizing Flows, by Christina Winkler et al.
-
Summary of Contextgs: Compact 3d Gaussian Splatting with Anchor Level Context Model, by Yufei Wang et al.
-
Summary of Gi-nas: Boosting Gradient Inversion Attacks Through Adaptive Neural Architecture Search, by Wenbo Yu et al.
-
Summary of Large Language Model Sentinel: Llm Agent For Adversarial Purification, by Guang Lin and Qibin Zhao
-
Summary of Insightsee: Advancing Multi-agent Vision-language Models For Enhanced Visual Understanding, by Huaxiang Zhang et al.
-
Summary of Clembench-2024: a Challenging, Dynamic, Complementary, Multilingual Benchmark and Underlying Flexible Framework For Llms As Multi-action Agents, by Anne Beyer et al.
-
Summary of Don’t Buy It! Reassessing the Ad Understanding Abilities Of Contrastive Multimodal Models, by A. Bavaresco et al.
-
Summary of Mofa-video: Controllable Image Animation Via Generative Motion Field Adaptions in Frozen Image-to-video Diffusion Model, by Muyao Niu et al.
-
Summary of Esg-ftse: a Corpus Of News Articles with Esg Relevance Labels and Use Cases, by Mariya Pavlova et al.
-
Summary of Cv-vae: a Compatible Video Vae For Latent Generative Video Models, by Sijie Zhao et al.
-
Summary of Anah: Analytical Annotation Of Hallucinations in Large Language Models, by Ziwei Ji et al.
-
Summary of Hidden in Plain Sight: Exploring Chat History Tampering in Interactive Language Models, by Cheng’an Wei et al.
-
Summary of Parsel: Parameterized Shape Editing with Language, by Aditya Ganeshan et al.
-
Summary of Omnihands: Towards Robust 4d Hand Mesh Recovery Via a Versatile Transformer, by Dixuan Lin et al.
-
Summary of Occsora: 4d Occupancy Generation Models As World Simulators For Autonomous Driving, by Lening Wang et al.
-
Summary of Learning 3d Robotics Perception Using Inductive Priors, by Muhammad Zubair Irshad
-
Summary of Gradient Inversion Of Federated Diffusion Models, by Jiyue Huang et al.
-
Summary of Seamlessexpressivelm: Speech Language Model For Expressive Speech-to-speech Translation with Chain-of-thought, by Hongyu Gong et al.
-
Summary of Worse Than Random? An Embarrassingly Simple Probing Evaluation Of Large Multimodal Models in Medical Vqa, by Qianqi Yan et al.
-
Summary of Probabilities Of Causation For Continuous and Vector Variables, by Yuta Kawakami et al.
-
Summary of Automated Generation and Tagging Of Knowledge Components From Multiple-choice Questions, by Steven Moore et al.
-
Summary of Diffusion on Syntax Trees For Program Synthesis, by Shreyas Kapur et al.
-
Summary of Towards Ontology-enhanced Representation Learning For Large Language Models, by Francesco Ronzano and Jay Nanavati
-
Summary of Unveiling the Impact Of Coding Data Instruction Fine-tuning on Large Language Models Reasoning, by Xinlu Zhang et al.
-
Summary of An Automatic Question Usability Evaluation Toolkit, by Steven Moore et al.
-
Summary of Open Ko-llm Leaderboard: Evaluating Large Language Models in Korean with Ko-h5 Benchmark, by Chanjun Park et al.
-
Summary of Disrupting Diffusion: Token-level Attention Erasure Attack Against Diffusion-based Customization, by Yisu Liu et al.
-
Summary of Know: a Real-world Ontology For Knowledge Capture with Large Language Models, by Arto Bendiken
-
Summary of Open-set Domain Adaptation For Semantic Segmentation, by Seun-an Choe et al.
-
Summary of Learning to Discuss Strategically: a Case Study on One Night Ultimate Werewolf, by Xuanfa Jin et al.
-
Summary of Holmes: to Detect Adversarial Examples with Multiple Detectors, by Jing Wen
-
Summary of Pla4d: Pixel-level Alignments For Text-to-4d Gaussian Splatting, by Qiaowei Miao et al.
-
Summary of Multi-aspect Controllable Text Generation with Disentangled Counterfactual Augmentation, by Yi Liu and Xiangyu Liu and Xiangrong Zhu and Wei Hu
-
Summary of Strategies to Counter Artificial Intelligence in Law Enforcement: Cross-country Comparison Of Citizens in Greece, Italy and Spain, by Petra Saskia Bayerl et al.
-
Summary of Dp-iqa: Utilizing Diffusion Prior For Blind Image Quality Assessment in the Wild, by Honghao Fu et al.