Paper List
We recommend you use the search box as this list is very long.
-
Summary of An Investigation Of Neuron Activation As a Unified Lens to Explain Chain-of-thought Eliciting Arithmetic Reasoning Of Llms, by Daking Rai et al.
-
Summary of Research on Dangerous Flight Weather Prediction Based on Machine Learning, by Haoxing Liu et al.
-
Summary of Dassf: Dynamic-attention Scale-sequence Fusion For Aerial Object Detection, by Haodong Li et al.
-
Summary of Generative Ai Voting: Fair Collective Choice Is Resilient to Llm Biases and Inconsistencies, by Srijoni Majumdar et al.
-
Summary of Logic-based Explainability: Past, Present & Future, by Joao Marques-silva
-
Summary of Chatpcg: Large Language Model-driven Reward Design For Procedural Content Generation, by In-chang Baek et al.
-
Summary of Predicting User Perception Of Move Brilliance in Chess, by Kamron Zaidi and Michael Guerzhoy
-
Summary of Prompt Design Matters For Computational Social Science Tasks but in Unpredictable Ways, by Shubham Atreja et al.
-
Summary of Look Further Ahead: Testing the Limits Of Gpt-4 in Path Planning, by Mohamed Aghzal et al.
-
Summary of Tracking the Perspectives Of Interacting Language Models, by Hayden Helm and Brandon Duderstadt and Youngser Park and Carey E. Priebe
-
Summary of Medcalc-bench: Evaluating Large Language Models For Medical Calculations, by Nikhil Khandekar et al.
-
Summary of Spa-vl: a Comprehensive Safety Preference Alignment Dataset For Vision Language Model, by Yongting Zhang et al.
-
Summary of Grade Score: Quantifying Llm Performance in Option Selection, by Dmitri Iourovitski
-
Summary of Welldunn: on the Robustness and Explainability Of Language Models and Large Language Models in Identifying Wellness Dimensions, by Seyedali Mohammadi et al.
-
Summary of Medea: Multi-view Efficient Depth Adjustment, by Mikhail Artemyev et al.
-
Summary of When Reasoning Meets Information Aggregation: a Case Study with Sports Narratives, by Yebowen Hu et al.
-
Summary of Conformance Checking Of Fuzzy Logs Against Declarative Temporal Specifications, by Ivan Donadello et al.
-
Summary of Who’s Asking? User Personas and the Mechanics Of Latent Misalignment, by Asma Ghandeharioun and Ann Yuan and Marius Guerard and Emily Reif and Michael A. Lepori and Lucas Dixon
-
Summary of Distillnerf: Perceiving 3d Scenes From Single-glance Images by Distilling Neural Fields and Foundation Model Features, By Letian Wang et al.
-
Summary of Ids For Ai Systems, by Alan Chan et al.
-
Summary of Should Ai Optimize Your Code? a Comparative Study Of Current Large Language Models Versus Classical Optimizing Compilers, by Miguel Romero Rosas et al.
-
Summary of Llms Are Prone to Fallacies in Causal Inference, by Nitish Joshi et al.
-
Summary of How Far Can In-context Alignment Go? Exploring the State Of In-context Alignment, by Heyan Huang et al.
-
Summary of How Can We Effectively Expand the Vocabulary Of Llms with 0.01gb Of Target Language Text?, by Atsuki Yamaguchi et al.
-
Summary of Improving Quality Control Of Whole Slide Images by Explicit Artifact Augmentation, By Artur Jurgas et al.
-
Summary of Input Conditioned Graph Generation For Language Agents, by Lukas Vierling et al.
-
Summary of Quaternion Generative Adversarial Neural Networks and Applications to Color Image Inpainting, by Duan Wang and Dandan Zhu and Meixiang Zhao and Zhigang Jia
-
Summary of Intrinsic Evaluation Of Unlearning Using Parametric Knowledge Traces, by Yihuai Hong et al.
-
Summary of Unveiling the Power Of Source: Source-based Minimum Bayes Risk Decoding For Neural Machine Translation, by Boxuan Lyu et al.
-
Summary of The Base-rate Effect on Llm Benchmark Performance: Disambiguating Test-taking Strategies From Benchmark Performance, by Kyle Moore et al.
-
Summary of Yolo-feder Fusionnet: a Novel Deep Learning Architecture For Drone Detection, by Tamara R. Lenhard et al.
-
Summary of Masai: Modular Architecture For Software-engineering Ai Agents, by Daman Arora et al.
-
Summary of See It From My Perspective: How Language Affects Cultural Bias in Image Understanding, by Amith Ananthram et al.
-
Summary of Knowledge-to-jailbreak: One Knowledge Point Worth One Attack, by Shangqing Tu et al.
-
Summary of R-eval: a Unified Toolkit For Evaluating Domain Knowledge Of Retrieval Augmented Large Language Models, by Shangqing Tu et al.
-
Summary of Interactive Evolution: a Neural-symbolic Self-training Framework For Large Language Models, by Fangzhi Xu et al.
-
Summary of Star: Sociotechnical Approach to Red Teaming Language Models, by Laura Weidinger et al.
-
Summary of Task Me Anything, by Jieyu Zhang et al.
-
Summary of Deep Learning Methodology For the Identification Of Wood Species Using High-resolution Macroscopic Images, by David Herrera-poyatos et al.
-
Summary of Mdcr: a Dataset For Multi-document Conditional Reasoning, by Peter Baile Chen et al.
-
Summary of Repliqa: a Question-answering Dataset For Benchmarking Llms on Unseen Reference Content, by Joao Monteiro et al.
-
Summary of Language Modeling with Editable External Knowledge, by Belinda Z. Li et al.
-
Summary of Nldf: Neural Light Dynamic Fields For Efficient 3d Talking Head Generation, by Niu Guanchen
-
Summary of Understanding the Collapse Of Llms in Model Editing, by Wanli Yang et al.
-
Summary of Development Of An Adaptive Multi-domain Artificial Intelligence System Built Using Machine Learning and Expert Systems Technologies, by Jeremy Straub
-
Summary of From Pixels to Progress: Generating Road Network From Satellite Imagery For Socioeconomic Insights in Impoverished Areas, by Yanxin Xi et al.
-
Summary of Videovista: a Versatile Benchmark For Video Understanding and Reasoning, by Yunxin Li et al.
-
Summary of Guicourse: From General Vision Language Models to Versatile Gui Agents, by Wentong Chen et al.
-
Summary of Temporal Lidar Depth Completion, by Pietari Kaskela et al.
-
Summary of Program Synthesis Benchmark For Visual Programming in Xlogoonline Environment, by Chao Wen et al.
-
Summary of Preserving Knowledge in Large Language Model with Model-agnostic Self-decompression, by Zilun Zhang et al.
-
Summary of Full-ece: a Metric For Token-level Calibration on Large Language Models, by Han Liu et al.
-
Summary of Refiner: Restructure Retrieval Content Efficiently to Advance Question-answering Capabilities, by Zhonghao Li et al.
-
Summary of Boosting Scientific Concepts Understanding: Can Analogy From Teacher Models Empower Student Models?, by Siyu Yuan et al.
-
Summary of Codegemma: Open Code Models Based on Gemma, by Codegemma Team: Heri Zhao et al.
-
Summary of Hare: Human Priors, a Key to Small Language Model Efficiency, by Lingyun Zhang et al.
-
Summary of Fusion Makes Perfection: An Efficient Multi-grained Matching Approach For Zero-shot Relation Extraction, by Shilong Li et al.
-
Summary of Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-strong Generalization, by Wenkai Yang et al.
-
Summary of Anytrans: Translate Anytext in the Image with Large Scale Models, by Zhipeng Qian et al.
-
Summary of Adaptive Reinforcement Learning Planning: Harnessing Large Language Models For Complex Information Extraction, by Zepeng Ding et al.
-
Summary of Promises, Outlooks and Challenges Of Diffusion Language Modeling, by Justin Deschenaux et al.
-
Summary of Trace the Evidence: Constructing Knowledge-grounded Reasoning Chains For Retrieval-augmented Generation, by Jinyuan Fang et al.
-
Summary of Connecting the Dots: Evaluating Abstract Reasoning Capabilities Of Llms Using the New York Times Connections Word Game, by Prisha Samadarshi et al.
-
Summary of A Peek Into Token Bias: Large Language Models Are Not Yet Genuine Reasoners, by Bowen Jiang et al.
-
Summary of Wildvision: Evaluating Vision-language Models in the Wild with Human Preferences, by Yujie Lu et al.
-
Summary of Instructcmp: Length Control in Sentence Compression Through Instruction-based Large Language Models, by Juseon-do et al.
-
Summary of Boosting Medical Image Classification with Segmentation Foundation Model, by Pengfei Gu and Zihan Zhao and Hongxiao Wang and Yaopeng Peng and Yizhe Zhang and Nishchal Sapkota and Chaoli Wang and Danny Z. Chen
-
Summary of Grading Massive Open Online Courses Using Large Language Models, by Shahriar Golchin et al.
-
Summary of Exploiting Diffusion Prior For Out-of-distribution Detection, by Armando Zhu et al.
-
Summary of From Intentions to Techniques: a Comprehensive Taxonomy and Challenges in Text Watermarking For Large Language Models, by Harsh Nishant Lalai et al.
-
Summary of Are Large Language Models a Good Replacement Of Taxonomies?, by Yushi Sun et al.
-
Summary of Diffusion Models in Low-level Vision: a Survey, by Chunming He et al.
-
Summary of Scorecards For Synthetic Medical Data Evaluation and Reporting, by Ghada Zamzmi et al.
-
Summary of Context Graph, by Chengjin Xu et al.
-
Summary of Emotion-llama: Multimodal Emotion Recognition and Reasoning with Instruction Tuning, by Zebang Cheng et al.
-
Summary of Aligning Large Language Models From Self-reference Ai Feedback with One General Principle, by Rong Bao et al.
-
Summary of Fine-tuning or Fine-failing? Debunking Performance Myths in Large Language Models, by Scott Barnett et al.
-
Summary of Weatherqa: Can Multimodal Language Models Reason About Severe Weather?, by Chengqian Ma et al.
-
Summary of Minicongts: a Near Ultimate Minimalist Contrastive Grid Tagging Scheme For Aspect Sentiment Triplet Extraction, by Qiao Sun et al.
-
Summary of Famicom: Further Demystifying Prompts For Language Models with Task-agnostic Performance Estimation, by Bangzheng Li et al.
-
Summary of Silverspeak: Evading Ai-generated Text Detectors Using Homoglyphs, by Aldan Creo et al.
-
Summary of Adversarial Style Augmentation Via Large Language Model For Robust Fake News Detection, by Sungwon Park et al.
-
Summary of Post-hoc Utterance Refining Method by Entity Mining For Faithful Knowledge Grounded Conversations, By Yoonna Jang et al.
-
Summary of Kgpa: Robustness Evaluation For Large Language Models Via Cross-domain Knowledge Graphs, by Aihua Pei (1) et al.
-
Summary of Ptt5-v2: a Closer Look at Continued Pretraining Of T5 Models For the Portuguese Language, by Marcos Piau et al.
-
Summary of Llmfactor: Extracting Profitable Factors Through Prompts For Explainable Stock Movement Prediction, by Meiyun Wang et al.
-
Summary of Algorithm Selection For Optimal Multi-agent Path Finding Via Graph Embedding, by Carmel Shabalin et al.
-
Summary of Gui-world: a Dataset For Gui-oriented Multimodal Llm-based Agents, by Dongping Chen et al.
-
Summary of Torchopera: a Compound Ai System For Llm Safety, by Shanshan Han et al.
-
Summary of Large Language Models For Automatic Milestone Detection in Group Discussions, by Zhuoxu Duan et al.
-
Summary of Ig2: Integrated Gradient on Iterative Gradient Path For Feature Attribution, by Yue Zhuo et al.
-
Summary of Step-level Value Preference Optimization For Mathematical Reasoning, by Guoxin Chen et al.
-
Summary of Alps: An Auto-labeling and Pre-training Scheme For Remote Sensing Segmentation with Segment Anything Model, by Song Zhang et al.
-
Summary of Demonstration Notebook: Finding the Most Suited In-context Learning Example From Interactions, by Yiming Tang and Bin Dong
-
Summary of Generating Tables From the Parametric Knowledge Of Language Models, by Yevgeni Berkovitch et al.
-
Summary of Explora: Parameter-efficient Extended Pre-training to Adapt Vision Transformers Under Domain Shifts, by Samar Khanna et al.
-
Summary of Open-vocabulary X-ray Prohibited Item Detection Via Fine-tuning Clip, by Shuyang Lin et al.
-
Summary of Towards Supporting Legal Argumentation with Nlp: Is More Data Really All You Need?, by T.y.s.s Santosh et al.