Paper List
We recommend you use the search box as this list is very long.
-
Summary of Greek2mathtex: a Greek Speech-to-text Framework For Latex Equations Generation, by Evangelia Gkritzali et al.
-
Summary of Ai Adoption to Combat Financial Crime: Study on Natural Language Processing in Adverse Media Screening Of Financial Services in English and Bangla Multilingual Interpretation, by Soumita Roy
-
Summary of Relieving Universal Label Noise For Unsupervised Visible-infrared Person Re-identification by Inferring From Neighbors, By Xiao Teng et al.
-
Summary of A Notso Simple Way to Beat Simple Bench, by Soham Sane and Angus Mclean
-
Summary of Can Video Generation Replace Cinematographers? Research on the Cinematic Language Of Generated Video, by Xiaozhe Li et al.
-
Summary of Bioragent: a Retrieval-augmented Generation System For Showcasing Generative Query Expansion and Domain-specific Search For Scientific Q&a, by Samy Ateia et al.
-
Summary of The Ramanujan Library — Automated Discovery on the Hypergraph Of Integer Relations, by Itay Beit-halachmi et al.
-
Summary of How Different Ai Chatbots Behave? Benchmarking Large Language Models in Behavioral Economics Games, by Yutong Xie et al.
-
Summary of Automated Generation Of Massive Reasonable Empirical Theorems by Forward Reasoning Based on Strong Relevant Logics — a Solution to the Problem Of Llm Pre-training Data Exhaustion, By Jingde Cheng
-
Summary of Improving Cooperation in Language Games with Bayesian Inference and the Cognitive Hierarchy, by Joseph Bills et al.
-
Summary of Three Things to Know About Deep Metric Learning, by Yash Patel et al.
-
Summary of Pattern Analogies: Learning to Perform Programmatic Image Edits by Analogy, By Aditya Ganeshan et al.
-
Summary of Rareagents: Advancing Rare Disease Care Through Llm-empowered Multi-disciplinary Team, by Xuanzhong Chen et al.
-
Summary of Boosting Long-context Management Via Query-guided Activation Refilling, by Hongjin Qian et al.
-
Summary of Re-attentional Controllable Video Diffusion Editing, by Yuanzhi Wang et al.
-
Summary of Llms Can Simulate Standardized Patients Via Agent Coevolution, by Zhuoyun Du et al.
-
Summary of Transferable Adversarial Face Attack with Text Controlled Attribute, by Wenyun Li et al.
-
Summary of Drivegazen: Event-based Driving Status Recognition Using Conventional Camera, by Xiaoyin Yang
-
Summary of Ami-net: Adaptive Mask Inpainting Network For Industrial Anomaly Detection and Localization, by Wei Luo et al.
-
Summary of Physaug: a Physical-guided and Frequency-based Data Augmentation For Single-domain Generalized Object Detection, by Xiaoran Xu et al.
-
Summary of A Theory Of Formalisms For Representing Knowledge, by Heng Zhang and Guifei Jiang and Donghui Quan
-
Summary of A Variable Occurrence-centric Framework For Inconsistency Handling (extended Version), by Yakoub Salhi
-
Summary of Punchbench: Benchmarking Mllms in Multimodal Punchline Comprehension, by Kun Ouyang et al.
-
Summary of Retrollm: Empowering Large Language Models to Retrieve Fine-grained Evidence Within Generation, by Xiaoxi Li et al.
-
Summary of Picle: Pseudo-annotations For In-context Learning in Low-resource Named Entity Detection, by Sepideh Mamooler et al.
-
Summary of Explainable Procedural Mistake Detection, by Shane Storks et al.
-
Summary of Seagraph: Unveiling the Whole Story Of Paper Review Comments, by Jianxiang Yu et al.
-
Summary of Stepwise Reasoning Error Disruption Attack Of Llms, by Jingyu Peng et al.
-
Summary of Openreviewer: a Specialized Large Language Model For Generating Critical Scientific Paper Reviews, by Maximilian Idahl et al.
-
Summary of Fairness Shields: Safeguarding Against Biased Decision Makers, by Filip Cano et al.
-
Summary of Agentic Ai-driven Technical Troubleshooting For Enterprise Systems: a Novel Weighted Retrieval-augmented Generation Paradigm, by Rajat Khanda
-
Summary of Cp-guard: Malicious Agent Detection and Defense in Collaborative Bird’s Eye View Perception, by Senkang Hu et al.
-
Summary of Fsfm: a Generalizable Face Security Foundation Model Via Self-supervised Facial Representation Learning, by Gaojian Wang et al.
-
Summary of Artificial Intelligence in Traffic Systems, by Ritwik Raj Saxena
-
Summary of Theoretical Analysis Of Quality Diversity Algorithms For a Classical Path Planning Problem, by Duc-cuong Dang and Aneta Neumann and Frank Neumann and Andre Opris and Dirk Sudholt
-
Summary of Towards Better Multi-task Learning: a Framework For Optimizing Dataset Combinations in Large Language Models, by Zaifu Zhan et al.
-
Summary of Efficient Policy Adaptation with Contrastive Prompt Ensemble For Embodied Agents, by Wonje Choi et al.
-
Summary of Embodied Cot Distillation From Llm to Off-the-shelf Agents, by Wonje Choi et al.
-
Summary of Intention Knowledge Graph Construction For User Intention Relation Modeling, by Jiaxin Bai et al.
-
Summary of Glimpse: Enabling White-box Methods to Use Proprietary Models For Zero-shot Llm-generated Text Detection, by Guangsheng Bao et al.
-
Summary of Dart: An Aigt Detector Using Amr Of Rephrased Text, by Hyeonchu Park et al.
-
Summary of Editsplat: Multi-view Fusion and Attention-guided Optimization For View-consistent 3d Scene Editing with 3d Gaussian Splatting, by Dong in Lee et al.
-
Summary of Meralion-speechencoder: Towards a Speech Foundation Model For Singapore and Beyond, by Muhammad Huzaifah et al.
-
Summary of Ts-satfire: a Multi-task Satellite Image Time-series Dataset For Wildfire Detection and Prediction, by Yu Zhao and Sebastian Gerard and Yifang Ban
-
Summary of Combating Semantic Contamination in Learning with Label Noise, by Wenxiao Fan et al.
-
Summary of Introduction to Ai Planning, by Marco Aiello and Ilche Georgievski
-
Summary of Se-gcl: An Event-based Simple and Effective Graph Contrastive Learning For Text Representation, by Tao Meng et al.
-
Summary of A Comprehensive Geoai Review: Progress, Challenges and Outlooks, by Anasse Boutayeb and Iyad Lahsen-cherif and Ahmed El Khadimi
-
Summary of Llm-daas: Llm-driven Drone-as-a-service Operations From Text User Requests, by Lillian Wassim et al.
-
Summary of Bias Vector: Mitigating Biases in Language Models with Task Arithmetic Approach, by Daiki Shirafuji et al.
-
Summary of Multilingual and Explainable Text Detoxification with Parallel Corpora, by Daryna Dementieva et al.
-
Summary of Vocabulary Expansion Of Chat Models with Unlabeled Target Language Data, by Atsuki Yamaguchi et al.
-
Summary of Scenellm: Implicit Language Reasoning in Llm For Dynamic Scene Graph Generation, by Hang Zhang et al.
-
Summary of Nitro: Llm Inference on Intel Laptop Npus, by Anthony Fei et al.
-
Summary of Rac3: Retrieval-augmented Corner Case Comprehension For Autonomous Driving with Vision-language Models, by Yujin Wang et al.
-
Summary of Law: Legal Agentic Workflows For Custody and Fund Services Contracts, by William Watson et al.
-
Summary of Seeing the Forest and the Trees: Solving Visual Graph and Tree Based Data Structure Problems Using Large Multimodal Models, by Sebastian Gutierrez et al.
-
Summary of Ad-llm: Benchmarking Large Language Models For Anomaly Detection, by Tiankai Yang et al.
-
Summary of Efficient Quantization-aware Training on Segment Anything Model in Medical Images and Its Deployment, by Haisheng Lu et al.
-
Summary of Leveraging Large Language Models For Active Merchant Non-player Characters, by Byungjun Kim et al.
-
Summary of Distribution-consistency-guided Multi-modal Hashing, by Jin-yu Liu et al.
-
Summary of Task-oriented Dialog Systems For the Senegalese Wolof Language, by Derguene Mbaye and Moussa Diallo
-
Summary of Beyond Discrete Personas: Personality Modeling Through Journal Intensive Conversations, by Sayantan Pal et al.
-
Summary of Vividface: a Diffusion-based Hybrid Framework For High-fidelity Video Face Swapping, by Hao Shao et al.
-
Summary of Cater: Leveraging Llm to Pioneer a Multidimensional, Reference-independent Paradigm in Translation Quality Evaluation, by Kurando Iida et al.
-
Summary of Segment-level Diffusion: a Framework For Controllable Long-form Generation with Diffusion Language Models, by Xiaochen Zhu et al.
-
Summary of Detecting Daily Living Gait Amid Huntington’s Disease Chorea Using a Foundation Deep Learning Model, by Dafna Schwartz et al.
-
Summary of Can Ai Extract Antecedent Factors Of Human Trust in Ai? An Application Of Information Extraction For Scientific Literature in Behavioural and Computer Sciences, by Melanie Mcgrath et al.
-
Summary of Codenames As a Benchmark For Large Language Models, by Matthew Stephenson et al.
-
Summary of Adapting Segment Anything Model (sam) to Experimental Datasets Via Fine-tuning on Gan-based Simulation: a Case Study in Additive Manufacturing, by Anika Tabassum et al.
-
Summary of Attention with Dependency Parsing Augmentation For Fine-grained Attribution, by Qiang Ding et al.
-
Summary of Multi-modal and Multi-scale Spatial Environment Understanding For Immersive Visual Text-to-speech, by Rui Liu and Shuwei He and Yifan Hu and Haizhou Li
-
Summary of Efficient Adaptation Of Multilingual Models For Japanese Asr, by Mark Bajo et al.
-
Summary of Hitgram: a Platform For Experimenting with N-gram Language Models, by Shibaranjani Dasgupta et al.
-
Summary of Just a Few Glances: Open-set Visual Perception with Image Prompt Paradigm, by Jinrong Zhang et al.
-
Summary of Rebalanced Vision-language Retrieval Considering Structure-aware Distillation, by Yang Yang et al.
-
Summary of Sample-efficient Unsupervised Policy Cloning From Ensemble Self-supervised Labeled Videos, by Xin Liu and Yaran Chen
-
Summary of Optimizing Few-step Sampler For Diffusion Probabilistic Model, by Jen-yuan Huang
-
Summary of Medical Manifestation-aware De-identification, by Yuan Tian et al.
-
Summary of Enhance Vision-language Alignment with Noise, by Sida Huang et al.
-
Summary of Rethinking Chain-of-thought From the Perspective Of Self-training, by Zongqian Wu et al.
-
Summary of Superhuman Performance Of a Large Language Model on the Reasoning Tasks Of a Physician, by Peter G. Brodeur et al.
-
Summary of Heterogeneous Graph Transformer For Multiple Tiny Object Tracking in Rgb-t Videos, by Qingyu Xu et al.
-
Summary of Llms-in-the-loop Part 2: Expert Small Ai Models For Anonymization and De-identification Of Phi Across Multiple Languages, by Murat Gunay et al.
-
Summary of Tokens, the Oft-overlooked Appetizer: Large Language Models, the Distributional Hypothesis, and Meaning, by Julia Witte Zimmerman et al.
-
Summary of Recursive Aggregates As Intensional Functions in Answer Set Programming: Semantics and Strong Equivalence, by Jorge Fandinno and Zachary Hansen
-
Summary of Medg-krp: Medical Graph Knowledge Representation Probing, by Gabriel R. Rosenbaum et al.
-
Summary of Navigating Dialectal Bias and Ethical Complexities in Levantine Arabic Hate Speech Detection, by Ahmed Haj Ahmed et al.
-
Summary of Rapidnet: Multi-level Dilated Convolution Based Mobile Backbone, by Mustafa Munir et al.
-
Summary of From Simple to Professional: a Combinatorial Controllable Image Captioning Agent, by Xinran Wang et al.
-
Summary of Dual Traits in Probabilistic Reasoning Of Large Language Models, by Shenxiong Li et al.
-
Summary of Multi-level Matching Network For Multimodal Entity Linking, by Zhiwei Hu et al.
-
Summary of Sweettok: Semantic-aware Spatial-temporal Tokenizer For Compact Video Discretization, by Zhentao Tan et al.
-
Summary of Disentanglement and Compositionality Of Letter Identity and Letter Position in Variational Auto-encoder Vision Models, by Bruno Bianchi et al.
-
Summary of Unlocking Visual Secrets: Inverting Features with Diffusion Priors For Image Reconstruction, by Sai Qian Zhang et al.
-
Summary of Geo-llava: a Large Multi-modal Model For Solving Geometry Math Problems with Meta In-context Learning, by Shihao Xu et al.