Paper List
We recommend you use the search box as this list is very long.
-
Summary of Belief in the Machine: Investigating Epistemological Blind Spots Of Language Models, by Mirac Suzgun et al.
-
Summary of Multi-modal Ai For Comprehensive Breast Cancer Prognostication, by Jan Witowski et al.
-
Summary of Autobench-v: Can Large Vision-language Models Benchmark Themselves?, by Han Bao et al.
-
Summary of Few-shot Open Relation Extraction with Gaussian Prototype and Adaptive Margin, by Tianlin Guo et al.
-
Summary of R-llava: Improving Med-vqa Understanding Through Visual Region Of Interest, by Xupeng Chen et al.
-
Summary of Get Large Language Models Ready to Speak: a Late-fusion Approach For Speech Generation, by Maohao Shen et al.
-
Summary of Historical Test-time Prompt Tuning For Vision Foundation Models, by Jingyi Zhang et al.
-
Summary of Maintaining Informative Coherence: Migrating Hallucinations in Large Language Models Via Absorbing Markov Chains, by Jiemin Wu et al.
-
Summary of Idempotent Unsupervised Representation Learning For Skeleton-based Action Recognition, by Lilang Lin et al.
-
Summary of Ropetp: Global Human Motion Recovery Via Integrating Robust Pose Estimation with Diffusion Trajectory Prior, by Mingjiang Liang et al.
-
Summary of Rethinking Data Synthesis: a Teacher Model Training Recipe with Interpretation, by Yifang Chen et al.
-
Summary of Open-vocabulary Object Detection Via Language Hierarchy, by Jiaxing Huang et al.
-
Summary of Addressing the Pitfalls Of Image-based Structural Health Monitoring: a Focus on False Positives, False Negatives, and Base Rate Bias, by Vagelis Plevris
-
Summary of Lodge++: High-quality and Long Dance Generation with Vivid Choreography Patterns, by Ronghui Li et al.
-
Summary of Autokaggle: a Multi-agent Framework For Autonomous Data Science Competitions, by Ziming Li et al.
-
Summary of Nt-vot211: a Large-scale Benchmark For Night-time Visual Object Tracking, by Yu Liu et al.
-
Summary of Medgo: a Chinese Medical Large Language Model, by Haitao Zhang and Bo An
-
Summary of A Derivational Chainbank For Modern Standard Arabic, by Reham Marzouk et al.
-
Summary of What Factors Affect Multi-modal In-context Learning? An In-depth Exploration, by Libo Qin et al.
-
Summary of Asynchronous Perception Machine For Efficient Test-time-training, by Rajat Modi et al.
-
Summary of Subjective-qa: Measuring Subjectivity in Earnings Call Transcripts’ Qa Through Six-dimensional Feature Analysis, by Huzaifa Pardawala et al.
-
Summary of Gender Bias in Llm-generated Interview Responses, by Haein Kong et al.
-
Summary of Relation-based Counterfactual Data Augmentation and Contrastive Learning For Robustifying Natural Language Inference Models, by Heerin Yang et al.
-
Summary of Real-time Weapon Detection Using Yolov8 For Enhanced Safety, by Ayush Thakur et al.
-
Summary of Paved or Unpaved? a Deep Learning Derived Road Surface Global Dataset From Mapillary Street-view Imagery, by Sukanya Randhawa et al.
-
Summary of Cooperative Strategic Planning Enhances Reasoning Capabilities in Large Language Models, by Danqing Wang et al.
-
Summary of Scube: Instant Large-scale Scene Reconstruction Using Voxsplats, by Xuanchi Ren et al.
-
Summary of Oreole-fm: Successes and Challenges Toward Billion-parameter Foundation Models For High-resolution Satellite Imagery, by Philipe Dias and Aristeidis Tsaris and Jordan Bowman and Abhishek Potnis and Jacob Arndt and H. Lexie Yang and Dalton Lunga
-
Summary of Think Carefully and Check Again! Meta-generation Unlocking Llms For Low-resource Cross-lingual Summarization, by Zhecheng Li et al.
-
Summary of Beyond Fine-tuning: Effective Strategies For Mitigating Hallucinations in Large Language Models For Data Analytics, by Mikhail Rumiantsau et al.
-
Summary of Give: Guiding Visual Encoder to Perceive Overlooked Information, by Junjie Li et al.
-
Summary of Llm-consensus: Multi-agent Debate For Visual Misinformation Detection, by Kumud Lakara et al.
-
Summary of Diff-cxr: Report-to-cxr Generation Through a Disease-knowledge Enhanced Diffusion Model, by Peng Huang et al.
-
Summary of A Stack-propagation Framework For Low-resource Personalized Dialogue Generation, by Haoyu Song et al.
-
Summary of Rethinking the Uncertainty: a Critical Review and Analysis in the Era Of Large Language Models, by Mohammad Beigi et al.
-
Summary of A Survey Of Large Language Models For Arabic Language and Its Dialects, by Malak Mashaabi et al.
-
Summary of Adaptive Video Understanding Agent: Enhancing Efficiency with Dynamic Frame Sampling and Feedback-driven Reasoning, by Sullam Jeoung et al.
-
Summary of Swe-search: Enhancing Software Agents with Monte Carlo Tree Search and Iterative Refinement, by Antonis Antoniades et al.
-
Summary of Learning From Response Not Preference: a Stackelberg Approach For Llm Detoxification Using Non-parallel Data, by Xinhong Xie et al.
-
Summary of Mardini: Masked Autoregressive Diffusion For Video Generation at Scale, by Haozhe Liu et al.
-
Summary of Deep Learning Based Dense Retrieval: a Comparative Study, by Ming Zhong et al.
-
Summary of Peter Parker or Spiderman? Disambiguating Multiple Class Labels, by Nuthan Mummani et al.
-
Summary of Shared Control with Black Box Agents Using Oracle Queries, by Inbal Avraham et al.
-
Summary of Openwebvoyager: Building Multimodal Web Agents Via Iterative Real-world Exploration, Feedback and Optimization, by Hongliang He et al.
-
Summary of Planning-aware Diffusion Networks For Enhanced Motion Forecasting in Autonomous Driving, by Liu Yunhao et al.
-
Summary of Knowledge Graph Enhanced Language Agents For Recommendation, by Taicheng Guo et al.
-
Summary of Vars: Vision-based Assessment Of Risk in Security Systems, by Pranav Gupta et al.
-
Summary of Agent-cq: Automatic Generation and Evaluation Of Clarifying Questions For Conversational Search with Llms, by Clemencia Siro et al.
-
Summary of Timesuite: Improving Mllms For Long Video Understanding Via Grounded Tuning, by Xiangyu Zeng et al.
-
Summary of 2d-dpo: Scaling Direct Preference Optimization with 2-dimensional Supervision, by Shilong Li et al.
-
Summary of Counting Ability Of Large Language Models and Impact Of Tokenization, by Xiang Zhang et al.
-
Summary of Integrating Reasoning Systems For Trustworthy Ai, Proceedings Of the 4th Workshop on Logic and Practice Of Programming (lpop), by Anil Nerode and Yanhong A. Liu
-
Summary of The Potential and Value Of Ai Chatbot in Personalized Cognitive Training, by Zilong Wang et al.
-
Summary of A Sam Based Tool For Semi-automatic Food Annotation, by Lubnaa Abdur Rahman et al.
-
Summary of Movie Trailer Genre Classification Using Multimodal Pretrained Features, by Serkan Sulun et al.
-
Summary of Pinning Cerebral Blood Flow: Analysis Of Perfusion Mri in Infants Using Physics-informed Neural Networks, by Christoforos Galazis et al.
-
Summary of Reliable, Routable, and Reproducible: Collection Of Pedestrian Pathways at Statewide Scale, by Yuxiang Zhang et al.
-
Summary of Locatebench: Evaluating the Locating Ability Of Vision Language Models, by Ting-rui Chiang et al.
-
Summary of Screenwriter: Automatic Screenplay Generation and Movie Summarisation, by Louis Mahon et al.
-
Summary of Step Guided Reasoning: Improving Mathematical Reasoning Using Guidance Generation and Step Reasoning, by Lang Cao et al.
-
Summary of Greeneye: Development Of Real-time Traffic Signal Recognition System For Visual Impairments, by Danu Kim
-
Summary of Pdl: a Declarative Prompt Programming Language, by Mandana Vaziri et al.
-
Summary of Rsa-control: a Pragmatics-grounded Lightweight Controllable Text Generation Framework, by Yifan Wang et al.
-
Summary of Lived Experience Not Found: Llms Struggle to Align with Experts on Addressing Adverse Drug Reactions From Psychiatric Medication Use, by Mohit Chandra et al.
-
Summary of Can Self Supervision Rejuvenate Similarity-based Link Prediction?, by Chenhan Zhang et al.
-
Summary of Tailored-llama: Optimizing Few-shot Learning in Pruned Llama Models with Task-specific Prompts, by Danyal Aftab et al.
-
Summary of Integrating Large Language Models with Internet Of Things Applications, by Mingyu Zong et al.
-
Summary of Developing a Tutoring Dialog Dataset to Optimize Llms For Educational Use, by Menna Fateen et al.
-
Summary of Not All Heads Matter: a Head-level Kv Cache Compression Method with Integrated Retrieval and Reasoning, by Yu Fu et al.
-
Summary of Designing Llm-agents with Personalities: a Psychometric Approach, by Muhua Huang et al.
-
Summary of Autonomous Building Cyber-physical Systems Using Decentralized Autonomous Organizations, Digital Twins, and Large Language Model, by Reachsak Ly et al.
-
Summary of Engineering Trustworthy Ai: a Developer Guide For Empirical Risk Minimization, by Diana Pfau and Alexander Jung
-
Summary of Larctan-skan: Simple and Efficient Single-parameterized Kolmogorov-arnold Networks Using Learnable Trigonometric Function, by Zhijie Chen and Xinglin Zhang
-
Summary of Learning Neural Strategy-proof Matching Mechanism From Examples, by Ryota Maruo et al.
-
Summary of Investigating the Role Of Prompting and External Tools in Hallucination Rates Of Large Language Models, by Liam Barkley and Brink Van Der Merwe
-
Summary of Expose Before You Defend: Unifying and Enhancing Backdoor Defenses Via Exposed Models, by Yige Li et al.
-
Summary of Offline-to-online Multi-agent Reinforcement Learning with Offline Value Function Memory and Sequential Exploration, by Hai Zhong et al.
-
Summary of Intelligent Understanding Of Large Language Models in Traditional Chinese Medicine Based on Prompt Engineering Framework, by Yirui Chen et al.
-
Summary of Edge: Enhanced Grounded Gui Understanding with Enriched Multi-granularity Synthetic Data, by Xuetian Chen et al.
-
Summary of On Occlusions in Video Action Detection: Benchmark Datasets and Training Recipes, by Rajat Modi et al.
-
Summary of Applying Neural Monte Carlo Tree Search to Unsignalized Multi-intersection Scheduling For Autonomous Vehicles, by Yucheng Shi et al.
-
Summary of From English-centric to Effective Bilingual: Llms with Custom Tokenizers For Underrepresented Languages, by Artur Kiulian et al.
-
Summary of Towards Visual Text Design Transfer Across Languages, by Yejin Choi et al.
-
Summary of Decore: Decoding by Contrasting Retrieval Heads to Mitigate Hallucinations, By Aryo Pradipta Gema et al.
-
Summary of The Cat and Mouse Game: the Ongoing Arms Race Between Diffusion Models and Detection Methods, by Linda Laurier et al.
-
Summary of Guiding Empowerment Model: Liberating Neurodiversity in Online Higher Education, by Hannah Beaux et al.
-
Summary of Demystifying Large Language Models For Medicine: a Primer, by Qiao Jin et al.
-
Summary of Improving Small-scale Large Language Models Function Calling For Reasoning Tasks, by Graziano A. Manduzio et al.
-
Summary of Prism: a Methodology For Auditing Biases in Large Language Models, by Leif Azzopardi and Yashar Moshfeghi
-
Summary of From Blind Solvers to Logical Thinkers: Benchmarking Llms’ Logical Integrity on Faulty Mathematical Problems, by a M Muntasir Rahman et al.
-
Summary of Segllm: Multi-round Reasoning Segmentation, by Xudong Wang et al.
-
Summary of Schema-guided Culture-aware Complex Event Simulation with Multi-agent Role-play, by Sha Li et al.
-
Summary of Oscar: Operating System Control Via State-aware Reasoning and Re-planning, by Xiaoqiang Wang and Bang Liu
-
Summary of O1 Replication Journey: a Strategic Progress Report — Part 1, by Yiwei Qin et al.
-
Summary of 3d-adapter: Geometry-consistent Multi-view Diffusion For High-quality 3d Generation, by Hansheng Chen et al.
-
Summary of Infogent: An Agent-based Framework For Web Information Aggregation, by Revanth Gangi Reddy et al.