Paper List
We recommend you use the search box as this list is very long.
-
Summary of Entity Alignment with Noisy Annotations From Large Language Models, by Shengyuan Chen et al.
-
Summary of Unified Editing Of Panorama, 3d Scenes, and Videos Through Disentangled Self-attention Injection, by Gihyun Kwon et al.
-
Summary of Tokenunify: Scalable Autoregressive Visual Pre-training with Mixture Token Prediction, by Yinda Chen et al.
-
Summary of Think Before You Act: a Two-stage Framework For Mitigating Gender Bias Towards Vision-language Tasks, by Yunqi Zhang et al.
-
Summary of Multiple Heads Are Better Than One: Mixture Of Modality Knowledge Experts For Entity Representation Learning, by Yichi Zhang et al.
-
Summary of A Large Language Model-based Multi-agent Manufacturing System For Intelligent Shopfloor, by Zhen Zhao et al.
-
Summary of Vocot: Unleashing Visually Grounded Multi-step Reasoning in Large Multi-modal Models, by Zejun Li et al.
-
Summary of Exploring the Llm Journey From Cognition to Expression with Linear Representations, by Yuzi Yan et al.
-
Summary of Position: Foundation Agents As the Paradigm Shift For Decision Making, by Xiaoqian Liu et al.
-
Summary of Vision-and-language Navigation Generative Pretrained Transformer, by Wen Hanlin
-
Summary of Compositional Few-shot Class-incremental Learning, by Yixiong Zou et al.
-
Summary of Retro-prob: Retrosynthetic Planning Based on a Probabilistic Model, by Chengyang Tian and Yangpeng Zhang and Yang Liu
-
Summary of How Well Do Deep Learning Models Capture Human Concepts? the Case Of the Typicality Effect, by Siddhartha K. Vemuri et al.
-
Summary of Greencod: a Green Camouflaged Object Detection Method, by Hong-shuo Chen et al.
-
Summary of Rocket Landing Control with Grid Fins and Path-following Using Mpc, by Junhao Yu et al.
-
Summary of Voodoo Xp: Expressive One-shot Head Reenactment For Vr Telepresence, by Phong Tran et al.
-
Summary of Automanual: Constructing Instruction Manuals by Llm Agents Via Interactive Environmental Learning, By Minghao Chen et al.
-
Summary of Assessing Image Inpainting Via Re-inpainting Self-consistency Evaluation, by Tianyi Chen et al.
-
Summary of Geneagent: Self-verification Language Agent For Gene Set Knowledge Discovery Using Domain Databases, by Zhizheng Wang et al.
-
Summary of Devil’s Advocate: Anticipatory Reflection For Llm Agents, by Haoyu Wang and Tao Li and Zhiwei Deng and Dan Roth and Yang Li
-
Summary of Learning to Reason Via Program Generation, Emulation, and Search, by Nathaniel Weir et al.
-
Summary of Disentangling Foreground and Background Motion For Enhanced Realism in Human Video Generation, by Jinlin Liu et al.
-
Summary of Assessing Empathy in Large Language Models with Real-world Physician-patient Interactions, by Man Luo et al.
-
Summary of Enhancing Feature Diversity Boosts Channel-adaptive Vision Transformers, by Chau Pham et al.
-
Summary of Cpsycoun: a Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework For Chinese Psychological Counseling, by Chenhao Zhang et al.
-
Summary of Vision-based Approach For Food Weight Estimation From 2d Images, by Chathura Wimalasiri et al.
-
Summary of The Importance Of Directional Feedback For Llm-based Optimizers, by Allen Nie et al.
-
Summary of Decomposing the Neurons: Activation Sparsity Via Mixture Of Experts For Continual Test Time Adaptation, by Rongyu Zhang et al.
-
Summary of Mamba4kt:an Efficient and Effective Mamba-based Knowledge Tracing Model, by Yang Cao et al.
-
Summary of Gamified Ai Approch For Early Detection Of Dementia, by Paramita Kundu Maji et al.
-
Summary of Sed: Self-evaluation Decoding Enhances Large Language Models For Better Generation, by Ziqin Luo et al.
-
Summary of Gecko: Generative Language Model For English, Code and Korean, by Sungwoo Oh and Donggyu Kim
-
Summary of Less Is More: Summarizing Patch Tokens For Efficient Multi-label Class-incremental Learning, by Thomas De Min et al.
-
Summary of Cohd: a Counting-aware Hierarchical Decoding Framework For Generalized Referring Expression Segmentation, by Zhuoyan Luo et al.
-
Summary of Exposing Image Classifier Shortcuts with Counterfactual Frequency (cof) Tables, by James Hinns et al.
-
Summary of Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in Lvlms, by Sreyan Ghosh and Chandra Kiran Reddy Evuru and Sonal Kumar and Utkarsh Tyagi and Oriol Nieto and Zeyu Jin and Dinesh Manocha
-
Summary of Instructavatar: Text-guided Emotion and Motion Control For Avatar Generation, by Yuchi Wang et al.
-
Summary of Enhancing Adverse Drug Event Detection with Multimodal Dataset: Corpus Creation and Model Development, by Pranab Sahoo et al.
-
Summary of Finite Groundings For Asp with Functions: a Journey Through Consistency, by Lukas Gerlach (tu Dresden) et al.
-
Summary of Defeaters and Eliminative Argumentation in Assurance 2.0, by Robin Bloomfield et al.
-
Summary of Explainable Human-ai Interaction: a Planning Perspective, by Sarath Sreedharan et al.
-
Summary of Ensuring Ground Truth Accuracy in Healthcare with the Evince Framework, by Edward Y. Chang
-
Summary of Spotnet: An Image Centric, Lidar Anchored Approach to Long Range Perception, by Louis Foucard et al.
-
Summary of Free Performance Gain From Mixing Multiple Partially Labeled Samples in Multi-label Image Classification, by Chak Fong Chong et al.
-
Summary of Belief-state Query Policies For Planning with Preferences Under Partial Observability, by Daniel Bramblett et al.
-
Summary of Zero-shot Spam Email Classification Using Pre-trained Large Language Models, by Sergio Rojas-galeano
-
Summary of Evaluating and Safeguarding the Adversarial Robustness Of Retrieval-based In-context Learning, by Simon Yu et al.
-
Summary of Synthai: a Multi Agent Generative Ai Framework For Automated Modular Hls Design Generation, by Seyed Arash Sheikholeslam et al.
-
Summary of Uncertainty Measurement Of Deep Learning System Based on the Convex Hull Of Training Sets, by Hyekyoung Hwang et al.
-
Summary of Mamballie: Implicit Retinex-aware Low Light Enhancement with Global-then-local State Space, by Jiangwei Weng et al.
-
Summary of Contrastive and Consistency Learning For Neural Noisy-channel Model in Spoken Language Understanding, by Suyoung Kim et al.
-
Summary of An Approximate Dynamic Programming Framework For Occlusion-robust Multi-object Tracking, by Pratyusha Musunuru et al.
-
Summary of Culturepark: Boosting Cross-cultural Understanding in Large Language Models, by Cheng Li et al.
-
Summary of Machine Unlearning in Large Language Models, by Saaketh Koundinya Gundavarapu et al.
-
Summary of A Solution-based Llm Api-using Methodology For Academic Information Seeking, by Yuanchun Wang et al.
-
Summary of An Evaluation Of Estimative Uncertainty in Large Language Models, by Zhisheng Tang et al.
-
Summary of Decoding at the Speed Of Thought: Harnessing Parallel Decoding Of Lexical Units For Llms, by Chenxi Sun et al.
-
Summary of Leveraging Unknown Objects to Construct Labeled-unlabeled Meta-relationships For Zero-shot Object Navigation, by Yanwei Zheng et al.
-
Summary of Self-contrastive Weakly Supervised Learning Framework For Prognostic Prediction Using Whole Slide Images, by Saul Fuster et al.
-
Summary of Retro: Reusing Teacher Projection Head For Efficient Embedding Distillation on Lightweight Models Via Self-supervised Learning, by Khanh-binh Nguyen and Chae Jung Park
-
Summary of Are Long-llms a Necessity For Long-context Tasks?, by Hongjin Qian et al.
-
Summary of Stacking Your Transformers: a Closer Look at Model Growth For Efficient Llm Pre-training, by Wenyu Du et al.
-
Summary of Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search, By Nicola Dainese et al.
-
Summary of V-zen: Efficient Gui Understanding and Precise Grounding with a Novel Multimodal Llm, by Abdur Rahman et al.
-
Summary of Language-driven Interactive Traffic Trajectory Generation, by Junkai Xia et al.
-
Summary of Luban: Building Open-ended Creative Agents Via Autonomous Embodied Verification, by Yuxuan Guo et al.
-
Summary of Text-guided 3d Human Motion Generation with Keyframe-based Parallel Skip Transformer, by Zichen Geng et al.
-
Summary of Benchmarking the Performance Of Pre-trained Llms Across Urdu Nlp Tasks, by Munief Hassan Tahir et al.
-
Summary of Omni-epic: Open-endedness Via Models Of Human Notions Of Interestingness with Environments Programmed in Code, by Maxence Faldor et al.
-
Summary of Randomized Heuristic Repair For Large-scale Multidimensional Knapsack Problem, by Jean P. Martins
-
Summary of Artificial Intelligence (ai) in Legal Data Mining, by Aniket Deroy et al.
-
Summary of G3: An Effective and Adaptive Framework For Worldwide Geolocalization Using Large Multi-modality Models, by Pengyue Jia et al.
-
Summary of Towards Cross-modal Backward-compatible Representation Learning For Vision-language Models, by Young Kyun Jang et al.
-
Summary of Htn-based Tutors: a New Intelligent Tutoring Framework Based on Hierarchical Task Networks, by Momin N. Siddiqui et al.
-
Summary of Stylex: a Trainable Metric For X-ray Style Distances, by Dominik Eckert et al.
-
Summary of Topologic: An Interpretable Pipeline For Lane Topology Reasoning on Driving Scenes, by Yanping Fu et al.
-
Summary of Generative Plant Growth Simulation From Sequence-informed Environmental Conditions, by Mohamed Debbagh et al.
-
Summary of Synergistic Global-space Camera and Human Reconstruction From Videos, by Yizhou Zhao et al.
-
Summary of Puzzleavatar: Assembling 3d Avatars From Personal Albums, by Yuliang Xiu et al.
-
Summary of Precise and Robust Sidewalk Detection: Leveraging Ensemble Learning to Surpass Llm Limitations in Urban Environments, by Ibne Farabi Shihab et al.
-
Summary of Automatic Coral Detection with Yolo: a Deep Learning Approach For Efficient and Accurate Coral Reef Monitoring, by Ouassine Younes (lisi et al.
-
Summary of Dissecting Query-key Interaction in Vision Transformers, by Xu Pan et al.
-
Summary of Creativity and Markov Decision Processes, by Joonas Lahikainen et al.
-
Summary of Evggs: a Collaborative Learning Framework For Event-based Generalizable Gaussian Splatting, by Jiaxu Wang et al.
-
Summary of Lova3: Learning to Visual Question Answering, Asking and Assessment, by Henry Hengyuan Zhao et al.
-
Summary of Eliciting Informative Text Evaluations with Large Language Models, by Yuxuan Lu et al.
-
Summary of Generating Camera Failures As a Class Of Physics-based Adversarial Examples, by Manav Prabhakar and Jwalandhar Girnar and Arpan Kusari
-
Summary of Reframing Spatial Reasoning Evaluation in Language Models: a Real-world Simulation Benchmark For Qualitative Reasoning, by Fangjun Li et al.
-
Summary of Mudreamer: Learning Predictive World Models Without Reconstruction, by Maxime Burchi et al.
-
Summary of Dissociation Of Faithful and Unfaithful Reasoning in Llms, by Evelyn Yee and Alice Li and Chenyu Tang and Yeon Ho Jung and Ramamohan Paturi and Leon Bergen
-
Summary of Proving Theorems Recursively, by Haiming Wang et al.
-
Summary of A Motion-based Compression Algorithm For Resource-constrained Video Camera Traps, by Malika Nisal Ratnayake et al.
-
Summary of Pipefusion: Patch-level Pipeline Parallelism For Diffusion Transformers Inference, by Jiarui Fang and Jinzhe Pan and Jiannan Wang and Aoyu Li and Xibo Sun
-
Summary of Rafe: Ranking Feedback Improves Query Rewriting For Rag, by Shengyu Mao et al.
-
Summary of Jointrf: End-to-end Joint Optimization For Dynamic Neural Radiance Field Representation and Compression, by Zihan Zheng et al.
-
Summary of Exploring the Use Of a Large Language Model For Data Extraction in Systematic Reviews: a Rapid Feasibility Study, by Lena Schmidt et al.
-
Summary of Magicdrive3d: Controllable 3d Generation For Any-view Rendering in Street Scenes, by Ruiyuan Gao et al.
-
Summary of Enhanced Spatiotemporal Prediction Using Physical-guided and Frequency-enhanced Recurrent Neural Networks, by Xuanle Zhao et al.