Paper List

We recommend you use the search box as this list is very long.

Summary of Belief in the Machine: Investigating Epistemological Blind Spots Of Language Models, by Mirac Suzgun et al.
Summary of Multi-modal Ai For Comprehensive Breast Cancer Prognostication, by Jan Witowski et al.
Summary of Autobench-v: Can Large Vision-language Models Benchmark Themselves?, by Han Bao et al.
Summary of Few-shot Open Relation Extraction with Gaussian Prototype and Adaptive Margin, by Tianlin Guo et al.
Summary of R-llava: Improving Med-vqa Understanding Through Visual Region Of Interest, by Xupeng Chen et al.
Summary of Get Large Language Models Ready to Speak: a Late-fusion Approach For Speech Generation, by Maohao Shen et al.
Summary of Historical Test-time Prompt Tuning For Vision Foundation Models, by Jingyi Zhang et al.
Summary of Maintaining Informative Coherence: Migrating Hallucinations in Large Language Models Via Absorbing Markov Chains, by Jiemin Wu et al.
Summary of Idempotent Unsupervised Representation Learning For Skeleton-based Action Recognition, by Lilang Lin et al.
Summary of Ropetp: Global Human Motion Recovery Via Integrating Robust Pose Estimation with Diffusion Trajectory Prior, by Mingjiang Liang et al.
Summary of Rethinking Data Synthesis: a Teacher Model Training Recipe with Interpretation, by Yifang Chen et al.
Summary of Open-vocabulary Object Detection Via Language Hierarchy, by Jiaxing Huang et al.
Summary of Addressing the Pitfalls Of Image-based Structural Health Monitoring: a Focus on False Positives, False Negatives, and Base Rate Bias, by Vagelis Plevris
Summary of Lodge++: High-quality and Long Dance Generation with Vivid Choreography Patterns, by Ronghui Li et al.
Summary of Autokaggle: a Multi-agent Framework For Autonomous Data Science Competitions, by Ziming Li et al.
Summary of Nt-vot211: a Large-scale Benchmark For Night-time Visual Object Tracking, by Yu Liu et al.
Summary of Medgo: a Chinese Medical Large Language Model, by Haitao Zhang and Bo An
Summary of A Derivational Chainbank For Modern Standard Arabic, by Reham Marzouk et al.
Summary of What Factors Affect Multi-modal In-context Learning? An In-depth Exploration, by Libo Qin et al.
Summary of Asynchronous Perception Machine For Efficient Test-time-training, by Rajat Modi et al.
Summary of Subjective-qa: Measuring Subjectivity in Earnings Call Transcripts’ Qa Through Six-dimensional Feature Analysis, by Huzaifa Pardawala et al.
Summary of Gender Bias in Llm-generated Interview Responses, by Haein Kong et al.
Summary of Relation-based Counterfactual Data Augmentation and Contrastive Learning For Robustifying Natural Language Inference Models, by Heerin Yang et al.
Summary of Real-time Weapon Detection Using Yolov8 For Enhanced Safety, by Ayush Thakur et al.
Summary of Paved or Unpaved? a Deep Learning Derived Road Surface Global Dataset From Mapillary Street-view Imagery, by Sukanya Randhawa et al.
Summary of Cooperative Strategic Planning Enhances Reasoning Capabilities in Large Language Models, by Danqing Wang et al.
Summary of Scube: Instant Large-scale Scene Reconstruction Using Voxsplats, by Xuanchi Ren et al.
Summary of Oreole-fm: Successes and Challenges Toward Billion-parameter Foundation Models For High-resolution Satellite Imagery, by Philipe Dias and Aristeidis Tsaris and Jordan Bowman and Abhishek Potnis and Jacob Arndt and H. Lexie Yang and Dalton Lunga
Summary of Think Carefully and Check Again! Meta-generation Unlocking Llms For Low-resource Cross-lingual Summarization, by Zhecheng Li et al.
Summary of Beyond Fine-tuning: Effective Strategies For Mitigating Hallucinations in Large Language Models For Data Analytics, by Mikhail Rumiantsau et al.
Summary of Rare: Retrieval Augmented Retrieval with In-context Examples, by Atula Tejaswi et al.
Summary of Give: Guiding Visual Encoder to Perceive Overlooked Information, by Junjie Li et al.
Summary of Llm-consensus: Multi-agent Debate For Visual Misinformation Detection, by Kumud Lakara et al.
Summary of Diff-cxr: Report-to-cxr Generation Through a Disease-knowledge Enhanced Diffusion Model, by Peng Huang et al.
Summary of A Stack-propagation Framework For Low-resource Personalized Dialogue Generation, by Haoyu Song et al.
Summary of Rethinking the Uncertainty: a Critical Review and Analysis in the Era Of Large Language Models, by Mohammad Beigi et al.
Summary of A Survey Of Large Language Models For Arabic Language and Its Dialects, by Malak Mashaabi et al.
Summary of Adaptive Video Understanding Agent: Enhancing Efficiency with Dynamic Frame Sampling and Feedback-driven Reasoning, by Sullam Jeoung et al.
Summary of Swe-search: Enhancing Software Agents with Monte Carlo Tree Search and Iterative Refinement, by Antonis Antoniades et al.
Summary of Learning From Response Not Preference: a Stackelberg Approach For Llm Detoxification Using Non-parallel Data, by Xinhong Xie et al.
Summary of Mardini: Masked Autoregressive Diffusion For Video Generation at Scale, by Haozhe Liu et al.
Summary of Deep Learning Based Dense Retrieval: a Comparative Study, by Ming Zhong et al.
Summary of Effective Instruction Parsing Plugin For Complex Logical Query Answering on Knowledge Graphs, by Xingrui Zhuo et al.
Summary of Peter Parker or Spiderman? Disambiguating Multiple Class Labels, by Nuthan Mummani et al.
Summary of Shared Control with Black Box Agents Using Oracle Queries, by Inbal Avraham et al.
Summary of Openwebvoyager: Building Multimodal Web Agents Via Iterative Real-world Exploration, Feedback and Optimization, by Hongliang He et al.
Summary of Planning-aware Diffusion Networks For Enhanced Motion Forecasting in Autonomous Driving, by Liu Yunhao et al.
Summary of Knowledge Graph Enhanced Language Agents For Recommendation, by Taicheng Guo et al.
Summary of Vars: Vision-based Assessment Of Risk in Security Systems, by Pranav Gupta et al.
Summary of Agent-cq: Automatic Generation and Evaluation Of Clarifying Questions For Conversational Search with Llms, by Clemencia Siro et al.
Summary of Timesuite: Improving Mllms For Long Video Understanding Via Grounded Tuning, by Xiangyu Zeng et al.
Summary of 2d-dpo: Scaling Direct Preference Optimization with 2-dimensional Supervision, by Shilong Li et al.
Summary of Counting Ability Of Large Language Models and Impact Of Tokenization, by Xiang Zhang et al.
Summary of Integrating Reasoning Systems For Trustworthy Ai, Proceedings Of the 4th Workshop on Logic and Practice Of Programming (lpop), by Anil Nerode and Yanhong A. Liu
Summary of The Potential and Value Of Ai Chatbot in Personalized Cognitive Training, by Zilong Wang et al.
Summary of A Sam Based Tool For Semi-automatic Food Annotation, by Lubnaa Abdur Rahman et al.
Summary of Movie Trailer Genre Classification Using Multimodal Pretrained Features, by Serkan Sulun et al.
Summary of Pinning Cerebral Blood Flow: Analysis Of Perfusion Mri in Infants Using Physics-informed Neural Networks, by Christoforos Galazis et al.
Summary of Reliable, Routable, and Reproducible: Collection Of Pedestrian Pathways at Statewide Scale, by Yuxiang Zhang et al.
Summary of Locatebench: Evaluating the Locating Ability Of Vision Language Models, by Ting-rui Chiang et al.
Summary of Screenwriter: Automatic Screenplay Generation and Movie Summarisation, by Louis Mahon et al.
Summary of Step Guided Reasoning: Improving Mathematical Reasoning Using Guidance Generation and Step Reasoning, by Lang Cao et al.
Summary of Greeneye: Development Of Real-time Traffic Signal Recognition System For Visual Impairments, by Danu Kim
Summary of Pdl: a Declarative Prompt Programming Language, by Mandana Vaziri et al.
Summary of Rsa-control: a Pragmatics-grounded Lightweight Controllable Text Generation Framework, by Yifan Wang et al.
Summary of Lived Experience Not Found: Llms Struggle to Align with Experts on Addressing Adverse Drug Reactions From Psychiatric Medication Use, by Mohit Chandra et al.
Summary of Can Self Supervision Rejuvenate Similarity-based Link Prediction?, by Chenhan Zhang et al.
Summary of Tailored-llama: Optimizing Few-shot Learning in Pruned Llama Models with Task-specific Prompts, by Danyal Aftab et al.
Summary of Integrating Large Language Models with Internet Of Things Applications, by Mingyu Zong et al.
Summary of Developing a Tutoring Dialog Dataset to Optimize Llms For Educational Use, by Menna Fateen et al.
Summary of Not All Heads Matter: a Head-level Kv Cache Compression Method with Integrated Retrieval and Reasoning, by Yu Fu et al.
Summary of Designing Llm-agents with Personalities: a Psychometric Approach, by Muhua Huang et al.
Summary of Autonomous Building Cyber-physical Systems Using Decentralized Autonomous Organizations, Digital Twins, and Large Language Model, by Reachsak Ly et al.
Summary of Interleaving Text and Number Embeddings to Solve Mathemathics Problems, by Marvin Alberts et al.
Summary of Engineering Trustworthy Ai: a Developer Guide For Empirical Risk Minimization, by Diana Pfau and Alexander Jung
Summary of Larctan-skan: Simple and Efficient Single-parameterized Kolmogorov-arnold Networks Using Learnable Trigonometric Function, by Zhijie Chen and Xinglin Zhang
Summary of Learning Neural Strategy-proof Matching Mechanism From Examples, by Ryota Maruo et al.
Summary of Investigating the Role Of Prompting and External Tools in Hallucination Rates Of Large Language Models, by Liam Barkley and Brink Van Der Merwe
Summary of Expose Before You Defend: Unifying and Enhancing Backdoor Defenses Via Exposed Models, by Yige Li et al.
Summary of Offline-to-online Multi-agent Reinforcement Learning with Offline Value Function Memory and Sequential Exploration, by Hai Zhong et al.
Summary of Intelligent Understanding Of Large Language Models in Traditional Chinese Medicine Based on Prompt Engineering Framework, by Yirui Chen et al.
Summary of Edge: Enhanced Grounded Gui Understanding with Enriched Multi-granularity Synthetic Data, by Xuetian Chen et al.
Summary of On Occlusions in Video Action Detection: Benchmark Datasets and Training Recipes, by Rajat Modi et al.
Summary of Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances, by Shilin Lu et al.
Summary of Applying Neural Monte Carlo Tree Search to Unsignalized Multi-intersection Scheduling For Autonomous Vehicles, by Yucheng Shi et al.
Summary of From English-centric to Effective Bilingual: Llms with Custom Tokenizers For Underrepresented Languages, by Artur Kiulian et al.
Summary of Towards Visual Text Design Transfer Across Languages, by Yejin Choi et al.
Summary of Decore: Decoding by Contrasting Retrieval Heads to Mitigate Hallucinations, By Aryo Pradipta Gema et al.
Summary of The Cat and Mouse Game: the Ongoing Arms Race Between Diffusion Models and Detection Methods, by Linda Laurier et al.
Summary of Guiding Empowerment Model: Liberating Neurodiversity in Online Higher Education, by Hannah Beaux et al.
Summary of Demystifying Large Language Models For Medicine: a Primer, by Qiao Jin et al.
Summary of Improving Small-scale Large Language Models Function Calling For Reasoning Tasks, by Graziano A. Manduzio et al.
Summary of Prism: a Methodology For Auditing Biases in Large Language Models, by Leif Azzopardi and Yashar Moshfeghi
Summary of From Blind Solvers to Logical Thinkers: Benchmarking Llms’ Logical Integrity on Faulty Mathematical Problems, by a M Muntasir Rahman et al.
Summary of Segllm: Multi-round Reasoning Segmentation, by Xudong Wang et al.
Summary of Schema-guided Culture-aware Complex Event Simulation with Multi-agent Role-play, by Sha Li et al.
Summary of Oscar: Operating System Control Via State-aware Reasoning and Re-planning, by Xiaoqiang Wang and Bang Liu
Summary of O1 Replication Journey: a Strategic Progress Report — Part 1, by Yiwei Qin et al.
Summary of 3d-adapter: Geometry-consistent Multi-view Diffusion For High-quality 3d Generation, by Hansheng Chen et al.
Summary of Infogent: An Agent-based Framework For Web Information Aggregation, by Revanth Gangi Reddy et al.