Paper List

We recommend you use the search box as this list is very long.

Summary of Enhanced Sign Language Translation Between American Sign Language (asl) and Indian Sign Language (isl) Using Llms, by Malay Kumar et al.
Summary of Catch: Complementary Adaptive Token-level Contrastive Decoding to Mitigate Hallucinations in Lvlms, by Zhehan Kan et al.
Summary of Enhancing Multi-class Disease Classification: Neoplasms, Cardiovascular, Nervous System, and Digestive Disorders Using Advanced Llms, by Ahmed Akib Jawad Karim et al.
Summary of Chapter 7 Review Of Data-driven Generative Ai Models For Knowledge Extraction From Scientific Literature in Healthcare, by Leon Kopitar et al.
Summary of Enhancing Llm Reasoning with Reward-guided Tree Search, by Jinhao Jiang et al.
Summary of Fedcollm: a Parameter-efficient Federated Co-tuning Framework For Large and Small Language Models, by Tao Fan et al.
Summary of Mc-llava: Multi-concept Personalized Vision-language Model, by Ruichuan An et al.
Summary of Moral Persuasion in Large Language Models: Evaluating Susceptibility and Ethical Alignment, by Allison Huang et al.
Summary of Woodyolo: a Novel Object Detector For Wood Species Detection in Microscopic Images, by Lars Nieradzik et al.
Summary of The Power Of Many: Multi-agent Multimodal Models For Cultural Image Captioning, by Longju Bai et al.
Summary of Cnmbert: a Model For Converting Hanyu Pinyin Abbreviations to Chinese Characters, by Zishuo Feng et al.
Summary of Bi-mamba: Towards Accurate 1-bit State Space Models, by Shengkun Tang and Liqun Ma and Haonan Li and Mingjie Sun and Zhiqiang Shen
Summary of Reslearn: Transformer-based Residual Learning For Metaverse Network Traffic Prediction, by Yoga Suhas Kuruba Manjunath et al.
Summary of Survey on Semantic Interpretation Of Tabular Data: Challenges and Directions, by Marco Cremaschi et al.
Summary of On-board Vision-language Models For Personalized Autonomous Vehicle Motion Control: System Design and Real-world Validation, by Can Cui et al.
Summary of Atomthink: a Slow Thinking Framework For Multimodal Mathematical Reasoning, by Kun Xiang et al.
Summary of Spatialdreamer: Self-supervised Stereo Video Synthesis From Monocular Input, by Zhen Lv et al.
Summary of Medical Video Generation For Disease Progression Simulation, by Xu Cao et al.
Summary of Bytescience: Bridging Unstructured Scientific Literature and Structured Data with Auto Fine-tuned Large Language Model in Token Granularity, by Tong Xie et al.
Summary of Tsprank: Bridging Pairwise and Listwise Methods with a Bilinear Travelling Salesman Model, by Weixian Waylon Li et al.
Summary of The Role Of Accuracy and Validation Effectiveness in Conversational Business Analytics, by Adem Alparslan
Summary of Hncse: Advancing Sentence Embeddings Via Hybrid Contrastive Learning with Hard Negatives, by Wenxiao Liu et al.
Summary of Ccis-diff: a Generative Model with Stable Diffusion Prior For Controlled Colonoscopy Image Synthesis, by Yifan Xie et al.
Summary of Biancang: a Traditional Chinese Medicine Large Language Model, by Sibo Wei et al.
Summary of Wafer Map Defect Classification Using Autoencoder-based Data Augmentation and Convolutional Neural Network, by Yin-yin Bao et al.
Summary of Sra-mcts: Self-driven Reasoning Augmentation with Monte Carlo Tree Search For Code Generation, by Bin Xu and Yiguan Lin and Yinghao Li and Yang Gao
Summary of Label Sharing Incremental Learning Framework For Independent Multi-label Segmentation Tasks, by Deepa Anand et al.
Summary of Enhanced Anime Image Generation Using Use-cmhsa-gan, by J. Lu
Summary of Reinforcing Competitive Multi-agents For Playing So Long Sucker, by Medant Sharan et al.
Summary of Memo-bench: a Multiple Benchmark For Text-to-image and Multimodal Large Language Models on Human Emotion Analysis, by Yingjie Zhou et al.
Summary of Zefav: Boosting Large Language Models For Zero-shot Fact Verification, by Son T. Luu et al.
Summary of Zero-shot Automatic Annotation and Instance Segmentation Using Llm-generated Datasets: Eliminating Field Imaging and Manual Annotation For Deep Learning Model Development, by Ranjan Sapkota et al.
Summary of Cross-patient Pseudo Bags Generation and Curriculum Contrastive Learning For Imbalanced Multiclassification Of Whole Slide Image, by Yonghuang Wu et al.
Summary of Lp Data Pipeline: Lightweight, Purpose-driven Data Pipeline For Large Language Models, by Yungi Kim et al.
Summary of Transcending Language Boundaries: Harnessing Llms For Low-resource Language Translation, by Peng Shu et al.
Summary of Tp-unet: Temporal Prompt Guided Unet For Medical Image Segmentation, by Ranmin Wang et al.
Summary of Syllabus: Portable Curricula For Reinforcement Learning Agents, by Ryan Sullivan et al.
Summary of Mitigating Knowledge Conflicts in Language Model-driven Question Answering, by Han Cao et al.
Summary of A Comprehensive Survey Of Oracle Character Recognition: Challenges, Benchmarks, and Beyond, by Jing Li et al.
Summary of Robust Markov Decision Processes: a Place Where Ai and Formal Methods Meet, by Marnix Suilen et al.
Summary of Search, Verify and Feedback: Towards Next Generation Post-training Paradigm Of Foundation Models Via Verifier Engineering, by Xinyan Guan et al.
Summary of Psa-vlm: Enhancing Vision-language Model Safety Through Progressive Concept-bottleneck-driven Alignment, by Zhendong Liu et al.
Summary of Addressing Hallucinations in Language Models with Knowledge Graph Embeddings As An Additional Modality, by Viktoriia Chekalina et al.
Summary of Towards Automatic Evaluation Of Task-oriented Dialogue Flows, by Mehrnoosh Mirtaheri et al.
Summary of A Survey Of Event Causality Identification: Principles, Taxonomy, Challenges, and Assessment, by Qing Cheng et al.
Summary of Mitigating Parameter Degeneracy Using Joint Conditional Diffusion Model For Wecc Composite Load Model in Power Systems, by Feiqin Zhu et al.
Summary of Mitigating Hallucination in Multimodal Large Language Model Via Hallucination-targeted Direct Preference Optimization, by Yuhan Fu et al.
Summary of Usp-gaussian: Unifying Spike-based Image Reconstruction, Pose Correction and Gaussian Splatting, by Kang Chen and Jiyuan Zhang and Zecheng Hao and Yajing Zheng and Tiejun Huang and Zhaofei Yu
Summary of On the Shortcut Learning in Multilingual Neural Machine Translation, by Wenxuan Wang et al.
Summary of Vision Eagle Attention: a New Lens For Advancing Image Classification, by Mahmudul Hasan
Summary of A Dataset Of Questions on Decision-theoretic Reasoning in Newcomb-like Problems, by Caspar Oesterheld and Emery Cooper and Miles Kodama and Linh Chi Nguyen and Ethan Perez
Summary of Is Thermography a Viable Solution For Detecting Pressure Injuries in Dark Skin Patients?, by Miriam Asare-baiden et al.
Summary of Leveraging Large Language Models For Efficient Representation Learning For Entity Resolution, by Xiaowei Xu et al.
Summary of Ltcxnet: Advancing Chest X-ray Analysis with Solutions For Long-tailed Multi-label Classification and Fairness Challenges, by Chin-wei Huang et al.
Summary of Vibe: a Text-to-video Benchmark For Evaluating Hallucination in Large Multimodal Models, by Vipula Rawte et al.
Summary of Sam Decoding: Speculative Decoding Via Suffix Automaton, by Yuxuan Hu et al.
Summary of Empowering Meta-analysis: Leveraging Large Language Models For Scientific Synthesis, by Jawad Ibn Ahad et al.
Summary of Metricgold: Leveraging Text-to-image Latent Diffusion Models For Metric Depth Estimation, by Ansh Shah et al.
Summary of Learn From Downstream and Be Yourself in Multimodal Large Language Model Fine-tuning, by Wenke Huang et al.
Summary of Hyperspectral Imaging-based Grain Quality Assessment with Limited Labelled Data, by Priyabrata Karmakar et al.
Summary of Vidcomposition: Can Mllms Analyze Compositions in Compiled Videos?, by Yunlong Tang et al.
Summary of Unveiling the Hidden: Online Vectorized Hd Map Construction with Clip-level Token Interaction and Propagation, by Nayeon Kim et al.
Summary of Time Step Generating: a Universal Synthesized Deepfake Image Detector, by Ziyue Zeng et al.
Summary of Real-time Ai-driven People Tracking and Counting Using Overhead Cameras, by Ishrath Ahamed et al.
Summary of Rethinking Normalization Strategies and Convolutional Kernels For Multimodal Image Fusion, by Dan He et al.
Summary of Legal Evalutions and Challenges Of Large Language Models, by Jiaqi Wang et al.
Summary of Multi-task Adversarial Variational Autoencoder For Estimating Biological Brain Age with Multimodal Neuroimaging, by Muhammad Usman et al.
Summary of Mitigating Sycophancy in Decoder-only Transformer Architectures: Synthetic Data Intervention, by Libo Wang
Summary of Evaluating the Role Of `constitutions’ For Learning From Ai Feedback, by Saskia Redgate et al.
Summary of Increasing the Accessibility Of Causal Domain Knowledge Via Causal Information Extraction Methods: a Case Study in the Semiconductor Manufacturing Industry, by Houssam Razouk et al.
Summary of Semantics and Spatiality Of Emergent Communication, by Rotem Ben Zion et al.
Summary of Agentic Llms in the Supply Chain: Towards Autonomous Multi-agent Consensus-seeking, by Valeria Jannelli et al.
Summary of Let People Fail! Exploring the Influence Of Explainable Virtual and Robotic Agents in Learning-by-doing Tasks, by Marco Matarese et al.
Summary of Evoke: Elevating Chest X-ray Report Generation Via Multi-view Contrastive Learning and Patient-specific Knowledge, by Qiguang Miao and Kang Liu and Zhuoqi Ma and Yunan Li and Xiaolu Kang and Ruixuan Liu and Tianyi Liu and Kun Xie and Zhicheng Jiao
Summary of A Logic For Reasoning with Inconsistent Knowledge — a Reformulation Using Nowadays Terminology (2024), by Nico Roos
Summary of Coloredit: Training-free Image-guided Color Editing with Diffusion Model, by Xingxi Yin et al.
Summary of Artificial Intelligence in Pediatric Echocardiography: Exploring Challenges, Opportunities, and Clinical Applications with Explainable Ai and Federated Learning, by Mohammed Yaseen Jabarulla et al.
Summary of A Realistic Collimated X-ray Image Simulation Pipeline, by Benjamin El-zein et al.
Summary of The Dawn Of Gui Agent: a Preliminary Case Study with Claude 3.5 Computer Use, by Siyuan Hu et al.
Summary of Forming Auxiliary High-confident Instance-level Loss to Promote Learning From Label Proportions, by Tianhao Ma et al.
Summary of Mechanisms Of Generative Image-to-image Translation Networks, by Guangzong Chen et al.
Summary of Towards High-fidelity 3d Portrait Generation with Rich Details by Cross-view Prior-aware Diffusion, By Haoran Wei et al.
Summary of Repurposing Stable Diffusion Attention For Training-free Unsupervised Interactive Segmentation, by Markus Karmann et al.
Summary of Automating Reformulation Of Essence Specifications Via Graph Rewriting, by Ian Miguel et al.
Summary of Accelerating Knowledge Graph and Ontology Engineering with Large Language Models, by Cogan Shimizu et al.
Summary of Local-global Attention: An Adaptive Mechanism For Multi-scale Feature Integration, by Yifan Shao
Summary of Towards Neural Foundation Models For Vision: Aligning Eeg, Meg, and Fmri Representations For Decoding, Encoding, and Modality Conversion, by Matteo Ferrante et al.
Summary of Llm Hallucination Reasoning with Zero-shot Knowledge Test, by Seongmin Lee et al.
Summary of Ptr: Precision-driven Tool Recommendation For Large Language Models, by Hang Gao et al.
Summary of A Benchmark For Long-form Medical Question Answering, by Pedram Hosseini et al.
Summary of A Self-supervised Model For Multi-modal Stroke Risk Prediction, by Camille Delgrange et al.
Summary of Amxfp4: Taming Activation Outliers with Asymmetric Microscaling Floating-point For 4-bit Llm Inference, by Janghwan Lee et al.
Summary of A Hybrid Artificial Intelligence System For Automated Eeg Background Analysis and Report Generation, by Chin-sung Tung et al.
Summary of Motion-grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level, by Andong Deng et al.
Summary of Jradievo: a Japanese Radiology Report Generation Model Enhanced by Evolutionary Optimization Of Model Merging, By Kaito Baba et al.
Summary of Ggavatar: Reconstructing Garment-separated 3d Gaussian Splatting Avatars From Monocular Video, by Jingxuan Chen
Summary of Large Language Models As User-agents For Evaluating Task-oriented-dialogue Systems, by Taaha Kazi et al.
Summary of Seeing Clearly by Layer Two: Enhancing Attention Heads to Alleviate Hallucination in Lvlms, By Xiaofeng Zhang et al.
Summary of Unlocking Transfer Learning For Open-world Few-shot Recognition, by Byeonggeun Kim et al.
Summary of Orca: Enhancing Role-playing Abilities Of Large Language Models by Integrating Personality Traits, By Yuxuan Huang