Paper List
We recommend you use the search box as this list is very long.
-
Summary of Prefix Text As a Yarn: Eliciting Non-english Alignment in Foundation Language Model, by Runzhe Zhan et al.
-
Summary of Consistentid: Portrait Generation with Multimodal Fine-grained Identity Preserving, by Jiehui Huang et al.
-
Summary of Make Your Llm Fully Utilize the Context, by Shengnan An et al.
-
Summary of Mammo-clip: Leveraging Contrastive Language-image Pre-training (clip) For Enhanced Breast Cancer Diagnosis with Multi-view Mammography, by Xuxin Chen et al.
-
Summary of Maggie: Masked Guided Gradual Human Instance Matting, by Chuong Huynh et al.
-
Summary of Fairdedup: Detecting and Mitigating Vision-language Fairness Disparities in Semantic Dataset Deduplication, by Eric Slyman and Stefan Lee and Scott Cohen and Kushal Kafle
-
Summary of Classifying Human-generated and Ai-generated Election Claims in Social Media, by Alphaeus Dmonte et al.
-
Summary of A Survey on Generative Ai and Llm For Video Generation, Understanding, and Streaming, by Pengyuan Zhou et al.
-
Summary of From Local to Global: a Graph Rag Approach to Query-focused Summarization, by Darren Edge et al.
-
Summary of Domain-specific Improvement on Psychotherapy Chatbot Using Assistant, by Cheng Kang and Daniel Novak and Katerina Urbanova and Yuqing Cheng and Yong Hu
-
Summary of Url: Universal Referential Knowledge Linking Via Task-instructed Representation Compression, by Zhuoqun Li et al.
-
Summary of Knowledge Graph Completion Using Structural and Textual Embeddings, by Sakher Khalil Alqaaidi et al.
-
Summary of Translation Of Multifaceted Data Without Re-training Of Machine Translation Systems, by Hyeonseok Moon et al.
-
Summary of Llm-based Section Identifiers Excel on Open Source but Stumble in Real World Applications, by Saranya Krishnamoorthy et al.
-
Summary of Research on Splicing Image Detection Algorithms Based on Natural Image Statistical Characteristics, by Ao Xiang et al.
-
Summary of Semantic Segmentation Refiner For Ultrasound Applications with Zero-shot Foundation Models, by Hedda Cohen Indelman et al.
-
Summary of Imwa: Iterative Model Weight Averaging Benefits Class-imbalanced Learning Tasks, by Zitong Huang et al.
-
Summary of Rezero: Boosting Mcts-based Algorithms by Backward-view and Entire-buffer Reanalyze, By Chunyu Xuan et al.
-
Summary of Training-free Unsupervised Prompt For Vision-language Models, by Sifan Long et al.
-
Summary of Visla Benchmark: Evaluating Embedding Sensitivity to Semantic and Lexical Alterations, by Sri Harsha Dumpala et al.
-
Summary of List Items One by One: a New Data Source and Learning Paradigm For Multimodal Llms, By An Yan et al.
-
Summary of Optimal and Bounded Suboptimal Any-angle Multi-agent Pathfinding, by Konstantin Yakovlev et al.
-
Summary of Label-free Topic-focused Summarization Using Query Augmentation, by Wenchuan Mu and Kwan Hui Lim
-
Summary of Logicbench: Towards Systematic Evaluation Of Logical Reasoning Ability Of Large Language Models, by Mihir Parmar et al.
-
Summary of Prism: Patient Records Interpretation For Semantic Clinical Trial Matching Using Large Language Models, by Shashi Kant Gupta et al.
-
Summary of Multi-agent Reinforcement Learning For Energy Networks: Computational Challenges, Progress and Open Problems, by Sarah Keren and Chaimaa Essayeh and Stefano V. Albrecht and Thomas Morstyn
-
Summary of Hybrid Llm/rule-based Approaches to Business Insights Generation From Structured Data, by Aliaksei Vertsel et al.
-
Summary of Priornet: a Novel Lightweight Network with Multidimensional Interactive Attention For Efficient Image Dehazing, by Yutong Chen et al.
-
Summary of The Promise and Challenges Of Using Llms to Accelerate the Screening Process Of Systematic Reviews, by Aleksi Huotala et al.
-
Summary of Hdbn: a Novel Hybrid Dual-branch Network For Robust Skeleton-based Action Recognition, by Jinfu Liu and Baiqiao Yin and Jiaying Lin and Jiajun Wen and Yue Li and Mengyuan Liu
-
Summary of Beyond Chain-of-thought: a Survey Of Chain-of-x Paradigms For Llms, by Yu Xia et al.
-
Summary of Ada-df: An Adaptive Label Distribution Fusion Network For Facial Expression Recognition, by Shu Liu et al.
-
Summary of Deepfeaturex Net: Deep Features Extractors Based Network For Discriminating Synthetic From Real Images, by Orazio Pontorno (1) et al.
-
Summary of Sparo: Selective Attention For Robust and Compositional Transformer Encodings For Vision, by Ankit Vani et al.
-
Summary of What Makes Multimodal In-context Learning Work?, by Folco Bertini Baldassini et al.
-
Summary of Let’s Think Dot by Dot: Hidden Computation in Transformer Language Models, By Jacob Pfau et al.
-
Summary of Toward Physics-aware Deep Learning Architectures For Lidar Intensity Simulation, by Vivek Anand et al.
-
Summary of Real-time Compressed Sensing For Joint Hyperspectral Image Transmission and Restoration For Cubesat, by Chih-chung Hsu et al.
-
Summary of Raformer: Redundancy-aware Transformer For Video Wire Inpainting, by Zhong Ji et al.
-
Summary of Steal Now and Attack Later: Evaluating Robustness Of Object Detection Against Black-box Adversarial Attacks, by Erh-chung Chen et al.
-
Summary of Assessing the Potential Of Mid-sized Language Models For Clinical Qa, by Elliot Bolton et al.
-
Summary of Kgvalidator: a Framework For Automatic Validation Of Knowledge Graph Construction, by Jack Boylan et al.
-
Summary of Unexplored Faces Of Robustness and Out-of-distribution: Covariate Shifts in Environment and Sensor Domains, by Eunsu Baek et al.
-
Summary of Mixlora: Enhancing Large Language Models Fine-tuning with Lora-based Mixture Of Experts, by Dengchun Li and Yingzi Ma and Naizheng Wang and Zhengmao Ye and Zhiyuan Cheng and Yinghao Tang and Yan Zhang and Lei Duan and Jie Zuo and Cal Yang and Mingjie Tang
-
Summary of Text2grasp: Grasp Synthesis by Text Prompts Of Object Grasping Parts, By Xiaoyun Chang and Yi Sun
-
Summary of Pixels and Predictions: Potential Of Gpt-4v in Meteorological Imagery Analysis and Forecast Communication, by John R. Lawson et al.
-
Summary of Reducing Human-robot Goal State Divergence with Environment Design, by Kelsey Sikes et al.
-
Summary of Socratic Planner: Inquiry-based Zero-shot Planning For Embodied Instruction Following, by Suyeon Shin et al.
-
Summary of Measuring Diversity Of Game Scenarios, by Yuchen Li et al.
-
Summary of Automatic Layout Planning For Visually-rich Documents with Instruction-following Models, by Wanrong Zhu et al.
-
Summary of Ct-glip: 3d Grounded Language-image Pretraining with Ct Scans and Radiology Reports For Full-body Scenarios, by Jingyang Lin et al.
-
Summary of Culturebank: An Online Community-driven Knowledge Base Towards Culturally Aware Language Technologies, by Weiyan Shi et al.
-
Summary of Wandr: Intention-guided Human Motion Generation, by Markos Diomataris et al.
-
Summary of Sum Of Group Error Differences: a Critical Examination Of Bias Evaluation in Biometric Verification and a Dual-metric Measure, by Alaa Elobaid et al.
-
Summary of Wiki-llava: Hierarchical Retrieval-augmented Generation For Multimodal Llms, by Davide Caffagni et al.
-
Summary of Glod: Composing Global Contexts and Local Details in Image Generation, by Moyuru Yamada
-
Summary of Iryonlp at Mediqa-corr 2024: Tackling the Medical Error Detection & Correction Task on the Shoulders Of Medical Agents, by Jean-philippe Corbeil
-
Summary of Evaluating the Efficacy Of Large Language Models in Identifying Phishing Attempts, by Het Patel et al.
-
Summary of Id-aligner: Enhancing Identity-preserving Text-to-image Generation with Reward Feedback Learning, by Weifeng Chen et al.
-
Summary of Multi-scale Intervention Planning Based on Generative Design, by Ioannis Kavouras et al.
-
Summary of Killkan: the Automatic Speech Recognition Dataset For Kichwa with Morphosyntactic Information, by Chihiro Taguchi et al.
-
Summary of Tom-lm: Delegating Theory Of Mind Reasoning to External Symbolic Executors in Large Language Models, by Weizhi Tang et al.
-
Summary of Visual Delta Generator with Large Multi-modal Models For Semi-supervised Composed Image Retrieval, by Young Kyun Jang et al.
-
Summary of Snapkv: Llm Knows What You Are Looking For Before Generation, by Yuhong Li et al.
-
Summary of The Adversarial Ai-art: Understanding, Generation, Detection, and Benchmarking, by Yuying Li et al.
-
Summary of Cross-task Multi-branch Vision Transformer For Facial Expression and Mask Wearing Classification, by Armando Zhu et al.
-
Summary of Bayesian Example Selection Improves In-context Learning For Speech, Text, and Visual Modalities, by Siyin Wang et al.
-
Summary of Generate-on-graph: Treat Llm As Both Agent and Kg in Incomplete Knowledge Graph Question Answering, by Yao Xu et al.
-
Summary of Grounded Knowledge-enhanced Medical Vision-language Pre-training For Chest X-ray, by Qiao Deng et al.
-
Summary of A Survey Of Large Language Models on Generative Graph Analytics: Query, Learning, and Applications, by Wenbo Shang et al.
-
Summary of Cnn2gnn: How to Bridge Cnn with Gnn, by Ziheng Jiao and Hongyuan Zhang and Xuelong Li
-
Summary of Leveraging Speech For Gesture Detection in Multimodal Communication, by Esam Ghaleb et al.
-
Summary of Beyond the Speculative Game: a Survey Of Speculative Execution in Large Language Models, by Chen Zhang et al.
-
Summary of Copronn: Concept-based Prototypical Nearest Neighbors For Explaining Vision Models, by Teodor Chiaburu et al.
-
Summary of Achieving >97% on Gsm8k: Deeply Understanding the Problems Makes Llms Better Solvers For Math Word Problems, by Qihuang Zhong et al.
-
Summary of Coarf: Controllable 3d Artistic Style Transfer For Radiance Fields, by Deheng Zhang et al.
-
Summary of A Review Of Deep Learning-based Information Fusion Techniques For Multimodal Medical Image Classification, by Yihao Li et al.
-
Summary of Cutdiffusion: a Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method, by Mingbao Lin et al.
-
Summary of Using Deep Reinforcement Learning to Promote Sustainable Human Behaviour on a Common Pool Resource Problem, by Raphael Koster et al.
-
Summary of Performance Characterization Of Expert Router For Scalable Llm Inference, by Josef Pichlmeier et al.
-
Summary of Do Not Think About Pink Elephant!, by Kyomin Hwang et al.
-
Summary of Fasttrack: Fast and Accurate Fact Tracing For Llms, by Si Chen et al.
-
Summary of Unlawful Proxy Discrimination: a Framework For Challenging Inherently Discriminatory Algorithms, by Hilde Weerts et al.
-
Summary of Bcfpl: Binary Classification Convnet Based Fast Parking Space Recognition with Low Resolution Image, by Shuo Zhang et al.
-
Summary of Mechanistic Interpretability For Ai Safety — a Review, by Leonard Bereska and Efstratios Gavves
-
Summary of Urbancross: Enhancing Satellite Image-text Retrieval with Cross-domain Adaptation, by Siru Zhong et al.
-
Summary of A Survey on Efficient Inference For Large Language Models, by Zixuan Zhou et al.
-
Summary of Explaining Arguments’ Strength: Unveiling the Role Of Attacks and Supports (technical Report), by Xiang Yin et al.
-
Summary of Automatic Discovery Of Visual Circuits, by Achyuta Rajaram et al.
-
Summary of Pre-calc: Learning to Use the Calculator Improves Numeracy in Language Models, by Vishruth Veerendranath et al.
-
Summary of Graphic Design with Large Multimodal Model, by Yutao Cheng et al.
-
Summary of Beyond Scaling: Predicting Patent Approval with Domain-specific Fine-grained Claim Dependency Graph, by Xiaochen Kev Gao et al.
-
Summary of A Survey on Self-evolution Of Large Language Models, by Zhengwei Tao et al.
-
Summary of Evaluation Of Machine Translation Based on Semantic Dependencies and Keywords, by Kewei Yuan and Qiurong Zhao and Yang Xu and Xiao Zhang and Huansheng Ning
-
Summary of A Multimodal Automated Interpretability Agent, by Tamar Rott Shaham et al.
-
Summary of Graphmatcher: a Graph Representation Learning Approach For Ontology Matching, by Sefika Efeoglu
-
Summary of Epi-sql: Enhancing Text-to-sql Translation with Error-prevention Instructions, by Xiping Liu et al.
-
Summary of Reinforcement Of Explainability Of Chatgpt Prompts by Embedding Breast Cancer Self-screening Rules Into Ai Responses, By Yousef Khan and Ahmed Abdeen Hamed