Paper List
We recommend you use the search box as this list is very long.
-
Summary of A Survey on Quality Metrics For Text-to-image Generation, by Sebastian Hartwig et al.
-
Summary of Regennet: Towards Human Action-reaction Synthesis, by Liang Xu et al.
-
Summary of Queryagent: a Reliable and Efficient Reasoning Framework with Environmental Feedback-based Self-correction, by Xiang Huang et al.
-
Summary of Visionclip: An Med-aigc Based Ethical Language-image Foundation Model For Generalizable Retina Image Analysis, by Hao Wei et al.
-
Summary of Sim2real Within 5 Minutes: Efficient Domain Transfer with Stylized Gaussian Splatting For Endoscopic Images, by Junyang Wu et al.
-
Summary of Optimizing Language Augmentation For Multilingual Large Language Models: a Case Study on Korean, by Changsu Choi et al.
-
Summary of Inducing Individual Students’ Learning Strategies Through Homomorphic Pomdps, by Huifan Gao et al.
-
Summary of Boosting Flow-based Generative Super-resolution Models Via Learned Prior, by Li-yuan Tsao et al.
-
Summary of Dialectbench: a Nlp Benchmark For Dialects, Varieties, and Closely-related Languages, by Fahim Faisal et al.
-
Summary of Towards Neuro-symbolic Video Understanding, by Minkyu Choi et al.
-
Summary of Reward Guided Latent Consistency Distillation, by Jiachen Li et al.
-
Summary of Audio-visual Segmentation Via Unlabeled Frame Exploitation, by Jinxiang Liu et al.
-
Summary of From Pixels to Predictions: Spectrogram and Vision Transformer For Better Time Series Forecasting, by Zhen Zeng et al.
-
Summary of Tokensome: Towards a Genetic Vision-language Gpt For Explainable and Cognitive Karyotyping, by Haoxi Zhang et al.
-
Summary of Lost in Translation? Translation Errors and Challenges For Fair Assessment Of Text-to-image Models on Multilingual Concepts, by Michael Saxon et al.
-
Summary of Phd: a Chatgpt-prompted Visual Hallucination Evaluation Dataset, by Jiazhen Liu et al.
-
Summary of Scaling Data Diversity For Fine-tuning Language Models in Human Alignment, by Feifan Song et al.
-
Summary of Evaluation Ethics Of Llms in Legal Domain, by Ruizhe Zhang et al.
-
Summary of Research on Personal Credit Risk Assessment Methods Based on Causal Inference, by Jiaxin Wang et al.
-
Summary of Correcting Misinformation on Social Media with a Large Language Model, by Xinyi Zhou et al.
-
Summary of Mindeye2: Shared-subject Models Enable Fmri-to-image with 1 Hour Of Data, by Paul S. Scotti et al.
-
Summary of Causality From Bottom to Top: a Survey, by Abraham Itzhak Weinberg et al.
-
Summary of Stateflow: Enhancing Llm Task-solving Through State-driven Workflows, by Yiran Wu et al.
-
Summary of Lifted Causal Inference in Relational Domains, by Malte Luttermann et al.
-
Summary of Hawkeye: Training Video-text Llms For Grounding Text in Videos, by Yueqian Wang et al.
-
Summary of Read Between the Lines — Functionality Extraction From Readmes, by Prince Kumar et al.
-
Summary of A Question on the Explainability Of Large Language Models and the Word-level Univariate First-order Plausibility Assumption, by Jeremie Bogaert et al.
-
Summary of A Multi-constraint and Multi-objective Allocation Model For Emergency Rescue in Iot Environment, by Xinrun Xu and Zhanbiao Lian and Yurong Wu and Manying Lv and Zhiming Ding and Jian Yan and Shang Jiang
-
Summary of Kif: a Wikidata-based Framework For Integrating Heterogeneous Knowledge Sources, by Guilherme Lima et al.
-
Summary of Gradient Based Feature Attribution in Explainable Ai: a Technical Review, by Yongjie Wang et al.
-
Summary of Neuflow: Real-time, High-accuracy Optical Flow Estimation on Robots Using Edge Devices, by Zhiyong Zhang et al.
-
Summary of Videoagent: Long-form Video Understanding with Large Language Model As Agent, by Xiaohan Wang et al.
-
Summary of Belief Change Based on Knowledge Measures, by Umberto Straccia et al.
-
Summary of Visreas: Complex Visual Reasoning with Unanswerable Questions, by Syeda Nahida Akter et al.
-
Summary of Neural Erosion: Emulating Controlled Neurodegeneration and Aging in Ai Systems, by Antonios Alexos et al.
-
Summary of Explorer: Exploration-guided Reasoning For Textual Reinforcement Learning, by Kinjal Basu et al.
-
Summary of Robust Influence-based Training Methods For Noisy Brain Mri, by Minh-hao Van et al.
-
Summary of Development and Application Of a Monte Carlo Tree Search Algorithm For Simulating Da Vinci Code Game Strategies, by Ye Zhang et al.
-
Summary of Game and Reference: Policy Combination Synthesis For Epidemic Prevention and Control, by Zhiyi Tan et al.
-
Summary of Depression Detection on Social Media with Large Language Models, by Xiaochong Lan et al.
-
Summary of Segment Any Object Model (saom): Real-to-simulation Fine-tuning Strategy For Multi-class Multi-instance Segmentation, by Mariia Khan et al.
-
Summary of Exploring Chinese Humor Generation: a Study on Two-part Allegorical Sayings, by Rongwu Xu
-
Summary of Re-search For the Truth: Multi-round Retrieval-augmented Large Language Models Are Strong Fake News Detectors, by Guanghua Li et al.
-
Summary of Meta-cognitive Analysis: Evaluating Declarative and Procedural Knowledge in Datasets and Large Language Models, by Zhuoqun Li et al.
-
Summary of Xlp: Explainable Link Prediction For Master Data Management, by Balaji Ganesan et al.
-
Summary of Self-consistency Boosts Calibration For Math Reasoning, by Ante Wang et al.
-
Summary of Surrogate Assisted Monte Carlo Tree Search in Combinatorial Optimization, by Saeid Amiri et al.
-
Summary of Take Care Of Your Prompt Bias! Investigating and Mitigating Prompt Bias in Factual Knowledge Extraction, by Ziyang Xu et al.
-
Summary of Radclip: Enhancing Radiologic Image Analysis Through Contrastive Language-image Pre-training, by Zhixiu Lu et al.
-
Summary of Fbpt: a Fully Binary Point Transformer, by Zhixing Hou et al.
-
Summary of Efficientvmamba: Atrous Selective Scan For Light Weight Visual Mamba, by Xiaohuan Pei et al.
-
Summary of Rethinking Low-quality Optical Flow in Unsupervised Surgical Instrument Segmentation, by Peiran Wu et al.
-
Summary of Boundary Matters: a Bi-level Active Finetuning Framework, by Han Lu et al.
-
Summary of Don’t Half-listen: Capturing Key-part Information in Continual Instruction Tuning, by Yongquan He and Xuancheng Huang and Minghao Tang and Lingxun Meng and Xiang Li and Wei Lin and Wenyuan Zhang and Yifu Gao
-
Summary of Learning Physical Dynamics For Object-centric Visual Prediction, by Huilin Xu et al.
-
Summary of Enhancing Human-centered Dynamic Scene Understanding Via Multiple Llms Collaborated Reasoning, by Hang Zhang et al.
-
Summary of Intent-conditioned and Non-toxic Counterspeech Generation Using Multi-task Instruction Tuning with Rlaif, by Amey Hengle et al.
-
Summary of Single- and Multi-agent Private Active Sensing: a Deep Neuroevolution Approach, by George Stamatelis et al.
-
Summary of Raft: Adapting Language Model to Domain Specific Rag, by Tianjun Zhang et al.
-
Summary of Efficient Detection Of Exchangeable Factors in Factor Graphs, by Malte Luttermann et al.
-
Summary of Autonode: a Neuro-graphic Self-learnable Engine For Cognitive Gui Automation, by Arkajit Datta et al.
-
Summary of Efficient Event-based Object Detection: a Hybrid Neural Network with Spatial and Temporal Attention, by Soikat Hasan Ahmed et al.
-
Summary of 3d-vla: a 3d Vision-language-action Generative World Model, by Haoyu Zhen and Xiaowen Qiu and Peihao Chen and Jincheng Yang and Xin Yan and Yilun Du and Yining Hong and Chuang Gan
-
Summary of Unmasking the Shadows Of Ai: Investigating Deceptive Capabilities in Large Language Models, by Linge Guo
-
Summary of Shapley Values-powered Framework For Fair Reward Split in Content Produced by Genai, By Alex Glinsky et al.
-
Summary of A Novel Nuanced Conversation Evaluation Framework For Large Language Models in Mental Health, by Alexander Marrapese et al.
-
Summary of Schema-aware Multi-task Learning For Complex Text-to-sql, by Yangjun Wu and Han Wang
-
Summary of A Knowledge-injected Curriculum Pretraining Framework For Question Answering, by Xin Lin et al.
-
Summary of A Hybrid Intelligence Method For Argument Mining, by Michiel Van Der Meer et al.
-
Summary of Linguistic Structure Induction From Language Models, by Omar Momen
-
Summary of Comprehensive Implementation Of Textcnn For Enhanced Collaboration Between Natural Language Processing and System Recommendation, by Xiaonan Xu et al.
-
Summary of Mevaker: Conclusion Extraction and Allocation Resources For the Hebrew Language, by Vitaly Shalumov et al.
-
Summary of Fine-tuning Vs Prompting, Can Language Models Understand Human Values?, by Pingwei Sun
-
Summary of A Semantic Mention Graph Augmented Model For Document-level Event Argument Extraction, by Jian Zhang et al.
-
Summary of Enhancing Readmission Prediction with Deep Learning: Extracting Biomedical Concepts From Clinical Texts, by Rasoul Samani et al.
-
Summary of Rad-phi2: Instruction Tuning Phi-2 For Radiology, by Mercy Ranjit et al.
-
Summary of Pet-sql: a Prompt-enhanced Two-round Refinement Of Text-to-sql with Cross-consistency, by Zhishuai Li et al.
-
Summary of Overleafcopilot: Empowering Academic Writing in Overleaf with Large Language Models, by Haomin Wen et al.
-
Summary of Simulating Weighted Automata Over Sequences and Trees with Transformers, by Michael Rizvi et al.
-
Summary of Evaluating Large Language Models As Generative User Simulators For Conversational Recommendation, by Se-eun Yoon et al.
-
Summary of Evaluating the Application Of Large Language Models to Generate Feedback in Programming Education, by Sven Jacobs and Steffen Jaschke
-
Summary of Sd-net: Symmetric-aware Keypoint Prediction and Domain Adaptation For 6d Pose Estimation in Bin-picking Scenarios, by Ding-tao Huang et al.
-
Summary of Griffon V2: Advancing Multimodal Perception with High-resolution Scaling and Visual-language Co-referring, by Yufei Zhan et al.
-
Summary of Localmamba: Visual State Space Model with Windowed Selective Scan, by Tao Huang et al.
-
Summary of Sketchinr: a First Look Into Sketches As Implicit Neural Representations, by Hmrishav Bandyopadhyay et al.
-
Summary of B-avibench: Towards Evaluating the Robustness Of Large Vision-language Model on Black-box Adversarial Visual-instructions, by Hao Zhang et al.
-
Summary of D3t: Distinctive Dual-domain Teacher Zigzagging Across Rgb-thermal Gap For Domain-adaptive Object Detection, by Dinh Phat Do et al.
-
Summary of Heuristic Reasoning in Ai: Instrumental Use and Mimetic Absorption, by Anirban Mukherjee et al.
-
Summary of A Multi-population Integrated Approach For Capacitated Location Routing, by Pengfei He et al.
-
Summary of Xcoop: Explainable Prompt Learning For Computer-aided Diagnosis Via Concept-guided Context Optimization, by Yequan Bie et al.
-
Summary of Mitigating Attribute Amplification in Counterfactual Image Generation, by Tian Xia et al.
-
Summary of 3d-scenedreamer: Text-driven 3d-consistent Scene Generation, by Frank Zhang et al.
-
Summary of Clinical Reasoning Over Tabular Data and Text with Bayesian Networks, by Paloma Rabaey et al.
-
Summary of Rectifying Demonstration Shortcut in In-context Learning, by Joonwon Jang et al.
-
Summary of What Sketch Explainability Really Means For Downstream Tasks, by Hmrishav Bandyopadhyay et al.
-
Summary of Trust Ai Regulation? Discerning Users Are Vital to Build Trust and Effective Ai Regulation, by Zainab Alalawi et al.
-
Summary of Visiongpt-3d: a Generalized Multimodal Agent For Enhanced 3d Vision Understanding, by Chris Kelly et al.