Paper List
We recommend you use the search box as this list is very long.
-
Summary of Spatialdreamer: Self-supervised Stereo Video Synthesis From Monocular Input, by Zhen Lv et al.
-
Summary of Medical Video Generation For Disease Progression Simulation, by Xu Cao et al.
-
Summary of Bytescience: Bridging Unstructured Scientific Literature and Structured Data with Auto Fine-tuned Large Language Model in Token Granularity, by Tong Xie et al.
-
Summary of Tsprank: Bridging Pairwise and Listwise Methods with a Bilinear Travelling Salesman Model, by Weixian Waylon Li et al.
-
Summary of The Role Of Accuracy and Validation Effectiveness in Conversational Business Analytics, by Adem Alparslan
-
Summary of Hncse: Advancing Sentence Embeddings Via Hybrid Contrastive Learning with Hard Negatives, by Wenxiao Liu et al.
-
Summary of Ccis-diff: a Generative Model with Stable Diffusion Prior For Controlled Colonoscopy Image Synthesis, by Yifan Xie et al.
-
Summary of Balancing Accuracy and Efficiency in Multi-turn Intent Classification For Llm-powered Dialog Systems in Production, by Junhua Liu and Yong Keat Tan and Bin Fu and Kwan Hui Lim
-
Summary of Evaluating Tokenizer Performance Of Large Language Models Across Official Indian Languages, by S. Tamang and D. J. Bora
-
Summary of Efficient Training in Multi-agent Reinforcement Learning: a Communication-free Framework For the Box-pushing Problem, by David Ge et al.
-
Summary of Sseditor: Controllable Mask-to-scene Generation with Diffusion Model, by Haowen Zheng and Yanyan Liang
-
Summary of Clip Unreasonable Potential in Single-shot Face Recognition, by Nhan T. Luu
-
Summary of Do Llms Understand Ambiguity in Text? a Case Study in Open-world Question Answering, by Aryan Keluskar et al.
-
Summary of Evaluating the Prompt Steerability Of Large Language Models, by Erik Miehling et al.
-
Summary of Preference-conditioned Gradient Variations For Multi-objective Quality-diversity, by Hannah Janmohamed et al.
-
Summary of Cross-patient Pseudo Bags Generation and Curriculum Contrastive Learning For Imbalanced Multiclassification Of Whole Slide Image, by Yonghuang Wu et al.
-
Summary of Zero-shot Automatic Annotation and Instance Segmentation Using Llm-generated Datasets: Eliminating Field Imaging and Manual Annotation For Deep Learning Model Development, by Ranjan Sapkota et al.
-
Summary of Lp Data Pipeline: Lightweight, Purpose-driven Data Pipeline For Large Language Models, by Yungi Kim et al.
-
Summary of Transcending Language Boundaries: Harnessing Llms For Low-resource Language Translation, by Peng Shu et al.
-
Summary of Tp-unet: Temporal Prompt Guided Unet For Medical Image Segmentation, by Ranmin Wang et al.
-
Summary of Mitigating Knowledge Conflicts in Language Model-driven Question Answering, by Han Cao et al.
-
Summary of Syllabus: Portable Curricula For Reinforcement Learning Agents, by Ryan Sullivan et al.
-
Summary of A Comprehensive Survey Of Oracle Character Recognition: Challenges, Benchmarks, and Beyond, by Jing Li et al.
-
Summary of Robust Markov Decision Processes: a Place Where Ai and Formal Methods Meet, by Marnix Suilen et al.
-
Summary of Search, Verify and Feedback: Towards Next Generation Post-training Paradigm Of Foundation Models Via Verifier Engineering, by Xinyan Guan et al.
-
Summary of Addressing Hallucinations in Language Models with Knowledge Graph Embeddings As An Additional Modality, by Viktoriia Chekalina et al.
-
Summary of Psa-vlm: Enhancing Vision-language Model Safety Through Progressive Concept-bottleneck-driven Alignment, by Zhendong Liu et al.
-
Summary of Chapter 7 Review Of Data-driven Generative Ai Models For Knowledge Extraction From Scientific Literature in Healthcare, by Leon Kopitar et al.
-
Summary of Enhancing Llm Reasoning with Reward-guided Tree Search, by Jinhao Jiang et al.
-
Summary of Mc-llava: Multi-concept Personalized Vision-language Model, by Ruichuan An et al.
-
Summary of Fedcollm: a Parameter-efficient Federated Co-tuning Framework For Large and Small Language Models, by Tao Fan et al.
-
Summary of Moral Persuasion in Large Language Models: Evaluating Susceptibility and Ethical Alignment, by Allison Huang et al.
-
Summary of Woodyolo: a Novel Object Detector For Wood Species Detection in Microscopic Images, by Lars Nieradzik et al.
-
Summary of The Power Of Many: Multi-agent Multimodal Models For Cultural Image Captioning, by Longju Bai et al.
-
Summary of Cnmbert: a Model For Converting Hanyu Pinyin Abbreviations to Chinese Characters, by Zishuo Feng et al.
-
Summary of Is Thermography a Viable Solution For Detecting Pressure Injuries in Dark Skin Patients?, by Miriam Asare-baiden et al.
-
Summary of Leveraging Large Language Models For Efficient Representation Learning For Entity Resolution, by Xiaowei Xu et al.
-
Summary of Ltcxnet: Advancing Chest X-ray Analysis with Solutions For Long-tailed Multi-label Classification and Fairness Challenges, by Chin-wei Huang et al.
-
Summary of Sam Decoding: Speculative Decoding Via Suffix Automaton, by Yuxuan Hu et al.
-
Summary of Vibe: a Text-to-video Benchmark For Evaluating Hallucination in Large Multimodal Models, by Vipula Rawte et al.
-
Summary of Empowering Meta-analysis: Leveraging Large Language Models For Scientific Synthesis, by Jawad Ibn Ahad et al.
-
Summary of Metricgold: Leveraging Text-to-image Latent Diffusion Models For Metric Depth Estimation, by Ansh Shah et al.
-
Summary of Hyperspectral Imaging-based Grain Quality Assessment with Limited Labelled Data, by Priyabrata Karmakar et al.
-
Summary of Learn From Downstream and Be Yourself in Multimodal Large Language Model Fine-tuning, by Wenke Huang et al.
-
Summary of Vidcomposition: Can Mllms Analyze Compositions in Compiled Videos?, by Yunlong Tang et al.
-
Summary of Unveiling the Hidden: Online Vectorized Hd Map Construction with Clip-level Token Interaction and Propagation, by Nayeon Kim et al.
-
Summary of Time Step Generating: a Universal Synthesized Deepfake Image Detector, by Ziyue Zeng et al.
-
Summary of Biancang: a Traditional Chinese Medicine Large Language Model, by Sibo Wei et al.
-
Summary of Sra-mcts: Self-driven Reasoning Augmentation with Monte Carlo Tree Search For Code Generation, by Bin Xu and Yiguan Lin and Yinghao Li and Yang Gao
-
Summary of Wafer Map Defect Classification Using Autoencoder-based Data Augmentation and Convolutional Neural Network, by Yin-yin Bao et al.
-
Summary of Reinforcing Competitive Multi-agents For Playing So Long Sucker, by Medant Sharan et al.
-
Summary of Label Sharing Incremental Learning Framework For Independent Multi-label Segmentation Tasks, by Deepa Anand et al.
-
Summary of Enhanced Anime Image Generation Using Use-cmhsa-gan, by J. Lu
-
Summary of Memo-bench: a Multiple Benchmark For Text-to-image and Multimodal Large Language Models on Human Emotion Analysis, by Yingjie Zhou et al.
-
Summary of Zefav: Boosting Large Language Models For Zero-shot Fact Verification, by Son T. Luu et al.
-
Summary of Agentic Llms in the Supply Chain: Towards Autonomous Multi-agent Consensus-seeking, by Valeria Jannelli et al.
-
Summary of Let People Fail! Exploring the Influence Of Explainable Virtual and Robotic Agents in Learning-by-doing Tasks, by Marco Matarese et al.
-
Summary of A Logic For Reasoning with Inconsistent Knowledge — a Reformulation Using Nowadays Terminology (2024), by Nico Roos
-
Summary of Evoke: Elevating Chest X-ray Report Generation Via Multi-view Contrastive Learning and Patient-specific Knowledge, by Qiguang Miao and Kang Liu and Zhuoqi Ma and Yunan Li and Xiaolu Kang and Ruixuan Liu and Tianyi Liu and Kun Xie and Zhicheng Jiao
-
Summary of Coloredit: Training-free Image-guided Color Editing with Diffusion Model, by Xingxi Yin et al.
-
Summary of Artificial Intelligence in Pediatric Echocardiography: Exploring Challenges, Opportunities, and Clinical Applications with Explainable Ai and Federated Learning, by Mohammed Yaseen Jabarulla et al.
-
Summary of A Realistic Collimated X-ray Image Simulation Pipeline, by Benjamin El-zein et al.
-
Summary of The Dawn Of Gui Agent: a Preliminary Case Study with Claude 3.5 Computer Use, by Siyuan Hu et al.
-
Summary of Forming Auxiliary High-confident Instance-level Loss to Promote Learning From Label Proportions, by Tianhao Ma et al.
-
Summary of Mechanisms Of Generative Image-to-image Translation Networks, by Guangzong Chen et al.
-
Summary of Towards High-fidelity 3d Portrait Generation with Rich Details by Cross-view Prior-aware Diffusion, By Haoran Wei et al.
-
Summary of A Survey Of Event Causality Identification: Principles, Taxonomy, Challenges, and Assessment, by Qing Cheng et al.
-
Summary of Repurposing Stable Diffusion Attention For Training-free Unsupervised Interactive Segmentation, by Markus Karmann et al.
-
Summary of Towards Automatic Evaluation Of Task-oriented Dialogue Flows, by Mehrnoosh Mirtaheri et al.
-
Summary of Mitigating Parameter Degeneracy Using Joint Conditional Diffusion Model For Wecc Composite Load Model in Power Systems, by Feiqin Zhu et al.
-
Summary of Usp-gaussian: Unifying Spike-based Image Reconstruction, Pose Correction and Gaussian Splatting, by Kang Chen and Jiyuan Zhang and Zecheng Hao and Yajing Zheng and Tiejun Huang and Zhaofei Yu
-
Summary of Mitigating Hallucination in Multimodal Large Language Model Via Hallucination-targeted Direct Preference Optimization, by Yuhan Fu et al.
-
Summary of On the Shortcut Learning in Multilingual Neural Machine Translation, by Wenxuan Wang et al.
-
Summary of Vision Eagle Attention: a New Lens For Advancing Image Classification, by Mahmudul Hasan
-
Summary of A Dataset Of Questions on Decision-theoretic Reasoning in Newcomb-like Problems, by Caspar Oesterheld and Emery Cooper and Miles Kodama and Linh Chi Nguyen and Ethan Perez
-
Summary of A Hybrid Artificial Intelligence System For Automated Eeg Background Analysis and Report Generation, by Chin-sung Tung et al.
-
Summary of Amxfp4: Taming Activation Outliers with Asymmetric Microscaling Floating-point For 4-bit Llm Inference, by Janghwan Lee et al.
-
Summary of Motion-grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level, by Andong Deng et al.
-
Summary of Jradievo: a Japanese Radiology Report Generation Model Enhanced by Evolutionary Optimization Of Model Merging, By Kaito Baba et al.
-
Summary of Ggavatar: Reconstructing Garment-separated 3d Gaussian Splatting Avatars From Monocular Video, by Jingxuan Chen
-
Summary of Seeing Clearly by Layer Two: Enhancing Attention Heads to Alleviate Hallucination in Lvlms, By Xiaofeng Zhang et al.
-
Summary of Large Language Models As User-agents For Evaluating Task-oriented-dialogue Systems, by Taaha Kazi et al.
-
Summary of Orca: Enhancing Role-playing Abilities Of Large Language Models by Integrating Personality Traits, By Yuxuan Huang
-
Summary of Graph-based Complexity For Causal Effect by Empirical Plug-in, By Rina Dechter and Annie Raichev and Alexander Ihler and Jin Tian
-
Summary of Vmid: a Multimodal Fusion Llm Framework For Detecting and Identifying Misinformation Of Short Videos, by Weihao Zhong et al.
-
Summary of Rethinking Normalization Strategies and Convolutional Kernels For Multimodal Image Fusion, by Dan He et al.
-
Summary of Real-time Ai-driven People Tracking and Counting Using Overhead Cameras, by Ishrath Ahamed et al.
-
Summary of Multi-task Adversarial Variational Autoencoder For Estimating Biological Brain Age with Multimodal Neuroimaging, by Muhammad Usman et al.
-
Summary of Legal Evalutions and Challenges Of Large Language Models, by Jiaqi Wang et al.
-
Summary of Mitigating Sycophancy in Decoder-only Transformer Architectures: Synthetic Data Intervention, by Libo Wang
-
Summary of Evaluating the Role Of `constitutions’ For Learning From Ai Feedback, by Saskia Redgate et al.
-
Summary of Increasing the Accessibility Of Causal Domain Knowledge Via Causal Information Extraction Methods: a Case Study in the Semiconductor Manufacturing Industry, by Houssam Razouk et al.
-
Summary of Semantics and Spatiality Of Emergent Communication, by Rotem Ben Zion et al.
-
Summary of Towards Unified Neural Decoding Of Perceived, Spoken and Imagined Speech From Eeg Signals, by Jung-sun Lee et al.
-
Summary of Enhancing Financial Domain Adaptation Of Language Models Via Model Augmentation, by Kota Tanabe et al.
-
Summary of Cross Space and Time: a Spatio-temporal Unitized Model For Traffic Flow Forecasting, by Weilin Ruan et al.
-
Summary of Cross-modal Consistency in Multimodal Large Language Models, by Xiang Zhang et al.
-
Summary of Multi-scale Generative Modeling For Fast Sampling, by Xiongye Xiao et al.