Paper List
We recommend you use the search box as this list is very long.
-
Summary of Funnynet-w: Multimodal Learning Of Funny Moments in Videos in the Wild, by Zhi-song Liu et al.
-
Summary of Attack-resilient Image Watermarking Using Stable Diffusion, by Lijun Zhang et al.
-
Summary of Starcraftimage: a Dataset For Prototyping Spatial Reasoning Methods For Multi-agent Environments, by Sean Kulinski et al.
-
Summary of Has Your Pretrained Model Improved? a Multi-head Posterior Based Approach, by Prince Aboagye et al.
-
Summary of Evaluating Large Language Models on the Gmat: Implications For the Future Of Business Education, by Vahid Ashrafimoghari et al.
-
Summary of Advanced Unstructured Data Processing For Esg Reports: a Methodology For Structured Transformation and Enhanced Analysis, by Jiahui Peng et al.
-
Summary of Refusion: Improving Natural Language Understanding with Computation-efficient Retrieval Representation Fusion, by Shangyu Wu et al.
-
Summary of Blending Is All You Need: Cheaper, Better Alternative to Trillion-parameters Llm, by Xiaoding Lu et al.
-
Summary of Canamrf: An Attention-based Model For Multimodal Depression Detection, by Yuntao Wei et al.
-
Summary of Umie: Unified Multimodal Information Extraction with Instruction Tuning, by Lin Sun et al.
-
Summary of Xxai: Towards Explicitly Explainable Artificial Intelligence, by V. L. Kalmykov et al.
-
Summary of Cot-driven Framework For Short Text Classification: Enhancing and Transferring Capabilities From Large to Smaller Model, by Hui Wu et al.
-
Summary of Manifold-based Shapley For Sar Recognization Network Explanation, by Xuran Hu et al.
-
Summary of Mpn: Leveraging Multilingual Patch Neuron For Cross-lingual Model Editing, by Nianwen Si et al.
-
Summary of A Survey on Verification and Validation, Testing and Evaluations Of Neurosymbolic Artificial Intelligence, by Justus Renkhoff et al.
-
Summary of Mirrordiffusion: Stabilizing Diffusion Process in Zero-shot Image Translation by Prompts Redescription and Beyond, By Yupei Lin et al.
-
Summary of Posdiffnet: Positional Neural Diffusion For Point Cloud Registration in a Large Field Of View with Perturbations, by Rui She et al.
-
Summary of Real Time Human Detection by Unmanned Aerial Vehicles, By Walid Guettala and Ali Sayah and Laid Kahloul and Ahmed Tibermacine
-
Summary of Exploiting Data Hierarchy As a New Modality For Contrastive Learning, by Arjun Bhalla et al.
-
Summary of Escalation Risks From Language Models in Military and Diplomatic Decision-making, by Juan-pablo Rivera et al.
-
Summary of On Leveraging Large Language Models For Enhancing Entity Resolution: a Cost-efficient Approach, by Huahang Li et al.
-
Summary of Exploring Large Language Model Based Intelligent Agents: Definitions, Methods, and Prospects, by Yuheng Cheng et al.
-
Summary of Learning Image Demoireing From Unpaired Real Data, by Yunshan Zhong et al.
-
Summary of Complementary Information Mutual Learning For Multimodality Medical Image Segmentation, by Chuyun Shen and Wenhao Li and Haoqing Chen and Xiaoling Wang and Fengping Zhu and Yuxin Li and Xiangfeng Wang and Bo Jin
-
Summary of Enhancing Targeted Transferability Via Feature Space Fine-tuning, by Hui Zeng et al.
-
Summary of Parameter-efficient Sparsity Crafting From Dense to Mixture-of-experts For Instruction Tuning on General Tasks, by Haoyuan Wu et al.
-
Summary of Hyperparameter-free Approach For Faster Minimum Bayes Risk Decoding, by Yuu Jinnai and Kaito Ariu
-
Summary of Mami: Multi-attentional Mutual-information For Long Sequence Neuron Captioning, by Alfirsa Damasyifa Fauzulhaq et al.
-
Summary of Pefomed: Parameter Efficient Fine-tuning Of Multimodal Large Language Models For Medical Imaging, by Jinlong He et al.
-
Summary of From Llm to Conversational Agent: a Memory Enhanced Architecture with Fine-tuning Of Large Language Models, by Na Liu et al.
-
Summary of Crisisvit: a Robust Vision Transformer For Crisis Image Classification, by Zijun Long and Richard Mccreadie and Muhammad Imran
-
Summary of Natural Language Programming in Medicine: Administering Evidence Based Clinical Workflows with Autonomous Agents Powered by Generative Large Language Models, By Akhil Vaid et al.
-
Summary of A Customizable Generator For Comic-style Visual Narrative, by Yi-chun Chen et al.
-
Summary of Deep Anomaly Detection in Text, by Andrei Manolache
-
Summary of Efficacy Of Utilizing Large Language Models to Detect Public Threat Posted Online, by Taeksoo Kwon (algorix Convergence Research Office) et al.
-
Summary of Trace and Edit Relation Associations in Gpt, by Jiahang Li et al.
-
Summary of Learning From a Generative Ai Predecessor — the Many Motivations For Interacting with Conversational Agents, by Donald Brinkman and Jonathan Grudin
-
Summary of Findabench: Benchmarking Financial Data Analysis Ability Of Large Language Models, by Shu Liu et al.
-
Summary of Large Language Models in Mental Health Care: a Scoping Review, by Yining Hua et al.
-
Summary of Fine-tuning and Utilization Methods Of Domain-specific Llms, by Cheonsu Jeong
-
Summary of Self-contrast: Better Reflection Through Inconsistent Solving Perspectives, by Wenqi Zhang et al.
-
Summary of Revisiting Zero-shot Abstractive Summarization in the Era Of Large Language Models From the Perspective Of Position Bias, by Anshuman Chhabra et al.
-
Summary of Sycoca: Symmetrizing Contrastive Captioners with Attentive Masking For Multimodal Alignment, by Ziping Ma et al.
-
Summary of Dcr-consistency: Divide-conquer-reasoning For Consistency Evaluation and Improvement Of Large Language Models, by Wendi Cui et al.
-
Summary of Prompt Decoupling For Text-to-image Person Re-identification, by Weihao Li et al.
-
Summary of Shayona@smm4h23: Covid-19 Self Diagnosis Classification Using Bert and Lightgbm Models, by Rushi Chavda et al.
-
Summary of Mining Fine-grained Image-text Alignment For Zero-shot Captioning Via Text-only Training, by Longtian Qiu et al.
-
Summary of Joint Multi-facts Reasoning Network For Complex Temporal Question Answering Over Knowledge Graph, by Rikui Huang et al.
-
Summary of Survey Of 3d Human Body Pose and Shape Estimation Methods For Contemporary Dance Applications, by Darshan Venkatrayappa et al.
-
Summary of Tinyllama: An Open-source Small Language Model, by Peiyuan Zhang et al.
-
Summary of On the Prospects Of Incorporating Large Language Models (llms) in Automated Planning and Scheduling (aps), by Vishal Pallagani et al.
-
Summary of Quantitative Technology Forecasting: a Review Of Trend Extrapolation Methods, by Peng-hung Tsai et al.
-
Summary of Characterizing Satellite Geometry Via Accelerated 3d Gaussian Splatting, by Van Minh Nguyen and Emma Sandidge and Trupti Mahendrakar and Ryan T. White
-
Summary of Object-oriented Backdoor Attack Against Image Captioning, by Meiling Li et al.
-
Summary of Progress and Prospects in 3d Generative Ai: a Technical Overview Including 3d Human, by Song Bai et al.
-
Summary of Verifying Relational Explanations: a Probabilistic Approach, by Abisha Thapa Magar et al.
-
Summary of Training and Serving System Of Foundation Models: a Comprehensive Survey, by Jiahang Zhou et al.
-
Summary of Progressive Knowledge Distillation Of Stable Diffusion Xl Using Layer Level Loss, by Yatharth Gupta et al.
-
Summary of Xuat-copilot: Multi-agent Collaborative System For Automated User Acceptance Testing with Large Language Model, by Zhitao Wang et al.
-
Summary of Identiface : a Vgg Based Multimodal Facial Biometric System, by Mahmoud Rabea et al.
-
Summary of Large Legal Fictions: Profiling Legal Hallucinations in Large Language Models, by Matthew Dahl et al.
-
Summary of Quantifying the Uniqueness Of Donald Trump in Presidential Discourse, by Karen Zhou et al.
-
Summary of Outlier Ranking in Large-scale Public Health Streams, by Ananya Joshi et al.
-
Summary of Goat-bench: Safety Insights to Large Multimodal Models Through Meme-based Social Abuse, by Hongzhan Lin et al.
-
Summary of Question-answering Based Summarization Of Electronic Health Records Using Retrieval Augmented Generation, by Walid Saba et al.
-
Summary of Medsumm: a Multimodal Approach to Summarizing Code-mixed Hindi-english Clinical Queries, by Akash Ghosh et al.
-
Summary of Large Language Model Capabilities in Perioperative Risk Prediction and Prognostication, by Philip Chung et al.
-
Summary of Can Ai Be As Creative As Humans?, by Haonan Wang et al.
-
Summary of A Cybersecurity Risk Analysis Framework For Systems with Artificial Intelligence Components, by Jose Manuel Camacho et al.
-
Summary of A Novel Paradigm For Neural Computation: X-net with Learnable Neurons and Adaptable Structure, by Yanjie Li et al.
-
Summary of Aigcbench: Comprehensive Evaluation Of Image-to-video Content Generated by Ai, By Fanda Fan et al.
-
Summary of A Generative Ai Assistant to Accelerate Cloud Migration, by Amal Vaidya et al.
-
Summary of Large Language Models Relearn Removed Concepts, by Michelle Lo et al.
-
Summary of Neural Control: Concurrent System Identification and Control Learning with Neural Ode, by Cheng Chi
-
Summary of Step Length Measurement in the Wild Using Fmcw Radar, by Parthipan Siva et al.
-
Summary of Instruct-imagen: Image Generation with Multi-modal Instruction, by Hexiang Hu et al.
-
Summary of A Mechanistic Understanding Of Alignment Algorithms: a Case Study on Dpo and Toxicity, by Andrew Lee et al.
-
Summary of Fmgs: Foundation Model Embedded 3d Gaussian Splatting For Holistic 3d Scene Understanding, by Xingxing Zuo et al.
-
Summary of Temporal Validity Change Prediction, by Georg Wenzel and Adam Jatowt
-
Summary of Astraios: Parameter-efficient Instruction Tuning Code Large Language Models, by Terry Yue Zhuo et al.
-
Summary of Taking the Next Step with Generative Artificial Intelligence: the Transformative Role Of Multimodal Large Language Models in Science Education, by Arne Bewersdorff et al.
-
Summary of Refining Pre-trained Motion Models, by Xinglong Sun et al.
-
Summary of Towards Bridging the Gap Between High-level Reasoning and Execution on Robots, by Till Hofmann
-
Summary of Masked Modeling For Self-supervised Representation Learning on Vision and Beyond, by Siyuan Li et al.
-
Summary of Accurate Leukocyte Detection Based on Deformable-detr and Multi-level Feature Fusion For Aiding Diagnosis Of Blood Diseases, by Yifei Chen et al.
-
Summary of Real-time Object Detection in Occluded Environment with Background Cluttering Effects Using Deep Learning, by Syed Muhammad Aamir et al.
-
Summary of Safety and Performance, Why Not Both? Bi-objective Optimized Model Compression Against Heterogeneous Attacks Toward Ai Software Deployment, by Jie Zhu et al.
-
Summary of Fast Sampling Through the Reuse Of Attention Maps in Diffusion Models, by Rosco Hunter et al.
-
Summary of Towards Cognitive Ai Systems: a Survey and Prospective on Neuro-symbolic Ai, by Zishen Wan et al.
-
Summary of Llama Beyond English: An Empirical Study on Language Capability Transfer, by Jun Zhao et al.
-
Summary of Discovering Significant Topics From Legal Decisions with Selective Inference, by Jerrold Soh
-
Summary of Vietnamese Poem Generation & the Prospect Of Cross-language Poem-to-poem Translation, by Triet Minh Huynh and Quan Le Bao
-
Summary of Quokka: An Open-source Large Language Model Chatbot For Material Science, by Xianjun Yang et al.
-
Summary of Accurate and Efficient Urban Street Tree Inventory with Deep Learning on Mobile Phone Imagery, by Asim Khan et al.