Paper List
We recommend you use the search box as this list is very long.
-
Summary of Focusllm: Precise Understanding Of Long Context by Dynamic Condensing, By Zhenyu Li et al.
-
Summary of Open-ended 3d Point Cloud Instance Segmentation, by Phuc D.a. Nguyen et al.
-
Summary of Timeline and Boundary Guided Diffusion Network For Video Shadow Detection, by Haipeng Zhou et al.
-
Summary of Codi: Conversational Distillation For Grounded Question Answering, by Patrick Huber et al.
-
Summary of The Dilemma Of Uncertainty Estimation For General Purpose Ai in the Eu Ai Act, by Matias Valdenegro-toro and Radina Stoykova
-
Summary of Automatic Image Annotation (aia) Of Almondnet-20 Method For Almond Detection by Improved Cnn-based Model, By Mohsen Asghari Ilani et al.
-
Summary of Towards Analyzing and Mitigating Sycophancy in Large Vision-language Models, by Yunpu Zhao et al.
-
Summary of Improving Speech Recognition Error Prediction For Modern and Off-the-shelf Speech Recognizers, by Prashant Serai et al.
-
Summary of Applying and Evaluating Large Language Models in Mental Health Care: a Scoping Review Of Human-assessed Generative Tasks, by Yining Hua et al.
-
Summary of Unifashion: a Unified Vision-language Model For Multimodal Fashion Retrieval and Generation, by Xiangyu Zhao et al.
-
Summary of Eeg-defender: Defending Against Jailbreak Through Early Exit Generation Of Large Language Models, by Chongwen Zhao et al.
-
Summary of Unlocking Adversarial Suffix Optimization Without Affirmative Phrases: Efficient Black-box Jailbreaking Via Llm As Optimizer, by Weipeng Jiang et al.
-
Summary of Swarm Intelligence in Geo-localization: a Multi-agent Large Vision-language Model Collaborative Framework, by Xiao Han et al.
-
Summary of Sarcasmbench: Towards Evaluating Large Language Models on Sarcasm Understanding, by Yazhou Zhang et al.
-
Summary of Probabilistic Medical Predictions Of Large Language Models, by Bowen Gu et al.
-
Summary of Automating Thought Of Search: a Journey Towards Soundness and Completeness, by Daniel Cao et al.
-
Summary of Multimodal Datasets and Benchmarks For Reasoning About Dynamic Spatio-temporality in Everyday Environments, by Takanori Ugai et al.
-
Summary of Plug, Play, and Fuse: Zero-shot Joint Decoding Via Word-level Re-ranking Across Diverse Vocabularies, by Sai Koneru et al.
-
Summary of Solving Decision Theory Problems with Probabilistic Answer Set Programming, by Damiano Azzolini et al.
-
Summary of Burextract-llama: An Llm For Clinical Concept Extraction in Breast Ultrasound Reports, by Yuxuan Chen et al.
-
Summary of Diagnosing and Remedying Knowledge Deficiencies in Llms Via Label-free Curricular Meaningful Learning, by Kai Xiong et al.
-
Summary of Epistemic Injustice in Generative Ai, by Jackie Kay et al.
-
Summary of Multi-agent Multi-armed Bandits with Stochastic Sharable Arm Capacities, by Hong Xie et al.
-
Summary of Dynamic Analysis and Adaptive Discriminator For Fake News Detection, by Xinqi Su et al.
-
Summary of Harmonizing Attention: Training-free Texture-aware Geometry Transfer, by Eito Ikuta et al.
-
Summary of V-roast: a New Dataset For Visual Road Assessment, by Natchapon Jongwiriyanurak et al.
-
Summary of On Learning Action Costs From Input Plans, by Marianela Morales et al.
-
Summary of Analytical and Empirical Study Of Herding Effects in Recommendation Systems, by Hong Xie et al.
-
Summary of Towards Efficient Formal Verification Of Spiking Neural Network, by Baekryun Seong et al.
-
Summary of Mtfineval:a Multi-domain Chinese Financial Benchmark with Eurypalynous Questions, by Xinyu Liu and Ke Jin
-
Summary of Lbc: Language-based-classifier For Out-of-variable Generalization, by Kangjun Noh et al.
-
Summary of Sdi-net: Toward Sufficient Dual-view Interaction For Low-light Stereo Image Enhancement, by Linlin Hu et al.
-
Summary of Hired: Attention-guided Token Dropping For Efficient Inference Of High-resolution Vision-language Models, by Kazi Hasan Ibn Arif et al.
-
Summary of Large Language Model Driven Recommendation, by Anton Korikov et al.
-
Summary of Dr.academy: a Benchmark For Evaluating Questioning Capability in Education For Large Language Models, by Yuyan Chen et al.
-
Summary of Hybrid Recurrent Models Support Emergent Descriptions For Hierarchical Planning and Control, by Poppy Collis et al.
-
Summary of Athena: Safe Autonomous Agents with Verbal Contrastive Learning, by Tanmana Sadhu et al.
-
Summary of Transfusion: Predict the Next Token and Diffuse Images with One Multi-modal Model, by Chunting Zhou and Lili Yu and Arun Babu and Kushal Tirumala and Michihiro Yasunaga and Leonid Shamis and Jacob Kahn and Xuezhe Ma and Luke Zettlemoyer and Omer Levy
-
Summary of Interactive-t2s: Multi-turn Interactions For Text-to-sql with Large Language Models, by Guanming Xiong et al.
-
Summary of Flame: Learning to Navigate with Multimodal Llm in Urban Environments, by Yunzhe Xu et al.
-
Summary of Near, Far: Patch-ordering Enhances Vision Foundation Models’ Scene Understanding, by Valentinos Pariza et al.
-
Summary of Quantum Inverse Contextual Vision Transformers (q-icvt): a New Frontier in 3d Object Detection For Avs, by Sanjay Bhargav Dharavath et al.
-
Summary of Minor Sft Loss For Llm Fine-tune to Increase Performance and Reduce Model Deviation, by Shiming Xie et al.
-
Summary of Beneath the Surface Of Consistency: Exploring Cross-lingual Knowledge Representation Sharing in Llms, by Maxim Ifergan et al.
-
Summary of Vocabulary-free 3d Instance Segmentation with Vision and Language Assistant, by Guofeng Mei and Luigi Riz and Yiming Wang and Fabio Poiesi
-
Summary of Rejection in Abstract Argumentation: Harder Than Acceptance?, by Johannes K. Fichte and Markus Hecher and Yasir Mahmood and Arne Meier
-
Summary of Genesis: Towards the Automation Of Systems Biology Research, by Ievgeniia A. Tiukova et al.
-
Summary of Fine-tuning and Deploying Large Language Models Over Edges: Issues and Approaches, by Yanjie Dong et al.
-
Summary of Coarse-to-fine Detection Of Multiple Seams For Robotic Welding, by Pengkun Wei et al.
-
Summary of Fine-tuning a Local Llama-3 Large Language Model For Automated Privacy-preserving Physician Letter Generation in Radiation Oncology, by Yihao Hou et al.
-
Summary of Investigating Context Effects in Similarity Judgements in Large Language Models, by Sagar Uprety et al.
-
Summary of Towards Efficient Large Language Models For Scientific Text: a Review, by Huy Quoc to et al.
-
Summary of Megen: Generative Backdoor in Large Language Models Via Model Editing, by Jiyang Qiu et al.
-
Summary of Sam-cod: Sam-guided Unified Framework For Weakly-supervised Camouflaged Object Detection, by Huafeng Chen et al.
-
Summary of Just a Hint: Point-supervised Camouflaged Object Detection, by Huafeng Chen et al.
-
Summary of Flexora: Flexible Low Rank Adaptation For Large Language Models, by Chenxing Wei et al.
-
Summary of Understanding the Skills Gap Between Higher Education and Industry in the Uk in Artificial Intelligence Sector, by Khushi Jaiswal et al.
-
Summary of Beyond English-centric Llms: What Language Do Multilingual Language Models Think In?, by Chengzhi Zhong et al.
-
Summary of Gs-kgc: a Generative Subgraph-based Framework For Knowledge Graph Completion with Large Language Models, by Rui Yang and Jiahao Zhu and Jianping Man and Hongze Liu and Li Fang and Yi Zhou
-
Summary of Zebrapose: Zebra Detection and Pose Estimation Using Only Synthetic Data, by Elia Bonetto and Aamir Ahmad
-
Summary of Delia: Diversity-enhanced Learning For Instruction Adaptation in Large Language Models, by Yuanhao Zeng et al.
-
Summary of Detecting Wildfires on Uavs with Real-time Segmentation Trained by Larger Teacher Models, By Julius Pesonen et al.
-
Summary of Qpo: Query-dependent Prompt Optimization Via Multi-loop Offline Reinforcement Learning, by Yilun Kong et al.
-
Summary of Data Augmentation Integrating Dialogue Flow and Style to Adapt Spoken Dialogue Systems to Low-resource User Groups, by Zhiyang Qi and Michimasa Inaba
-
Summary of Approximate Estimation Of High-dimension Execution Skill For Dynamic Agents in Continuous Domains, by Delma Nieves-rivera and Christopher Archibald
-
Summary of Edgenat: Transformer For Efficient Edge Detection, by Jinghuai Jie et al.
-
Summary of Xcb: An Effective Contextual Biasing Approach to Bias Cross-lingual Phrases in Speech Recognition, by Xucheng Wan et al.
-
Summary of Nutrifyai: An Ai-powered System For Real-time Food Detection, Nutritional Analysis, and Personalized Meal Recommendations, by Michelle Han et al.
-
Summary of Ai-based Ivr, by Gassyrbek Kosherbay et al.
-
Summary of Diff-pcc: Diffusion-based Neural Compression For 3d Point Clouds, by Kai Liu and Kang You and Pan Gao
-
Summary of Prompt-agnostic Adversarial Perturbation For Customized Diffusion Models, by Cong Wan et al.
-
Summary of Putting People in Llms’ Shoes: Generating Better Answers Via Question Rewriter, by Junhao Chen and Bowen Wang and Zhouqiang Jiang and Yuta Nakashima
-
Summary of Hologram Reasoning For Solving Algebra Problems with Geometry Diagrams, by Litian Huang et al.
-
Summary of Breast Tumor Classification Based on Self-supervised Contrastive Learning From Ultrasound Videos, by Yunxin Tang et al.
-
Summary of Mv-mos: Multi-view Feature Fusion For 3d Moving Object Segmentation, by Jintao Cheng et al.
-
Summary of Muses: 3d-controllable Image Generation Via Multi-modal Agent Collaboration, by Yanbo Ding et al.
-
Summary of Promoting Equality in Large Language Models: Identifying and Mitigating the Implicit Bias Based on Bayesian Theory, by Yongxin Deng (1) et al.
-
Summary of Novel Change Detection Framework in Remote Sensing Imagery Using Diffusion Models and Structural Similarity Index (ssim), by Andrew Kiruluta et al.
-
Summary of Wrim-net: Wide-ranging Information Mining Network For Visible-infrared Person Re-identification, by Yonggan Wu et al.
-
Summary of Generalizable Facial Expression Recognition, by Yuhang Zhang et al.
-
Summary of A Review Of Human-object Interaction Detection, by Yuxiao Wang et al.
-
Summary of Strategist: Learning Strategic Skills by Llms Via Bi-level Tree Search, By Jonathan Light and Min Cai and Weiqin Chen and Guanzhi Wang and Xiusi Chen and Wei Cheng and Yisong Yue and Ziniu Hu
-
Summary of Fairness Under Cover: Evaluating the Impact Of Occlusions on Demographic Bias in Facial Recognition, by Rafael M. Mamede et al.
-
Summary of Target-dependent Multimodal Sentiment Analysis Via Employing Visual-to Emotional-caption Translation Network Using Visual-caption Pairs, by Ananya Pandey et al.
-
Summary of Vyang-net: a Novel Multi-modal Sarcasm Recognition Model by Uncovering Visual, Acoustic and Glossary Features, By Ananya Pandey et al.
-
Summary of Optical Music Recognition in Manuscripts From the Ricordi Archive, by Federico Simonetta et al.
-
Summary of A Disguised Wolf Is More Harmful Than a Toothless Tiger: Adaptive Malicious Code Injection Backdoor Attack Leveraging User Behavior As Triggers, by Shangxi Wu and Jitao Sang
-
Summary of Legalbench-rag: a Benchmark For Retrieval-augmented Generation in the Legal Domain, by Nicholas Pipitone et al.
-
Summary of Hasper: An Image Repository For Hand Shadow Puppet Recognition, by Syed Rifat Raiyan et al.
-
Summary of Query Languages For Neural Networks, by Martin Grohe et al.
-
Summary of Ai-driven Review Systems: Evaluating Llms in Scalable and Bias-aware Academic Reviews, by Keith Tyser et al.
-
Summary of Evaluating Image-based Face and Eye Tracking with Event Cameras, by Khadija Iddrisu et al.
-
Summary of Development Of An Ai Anti-bullying System Using Large Language Model Key Topic Detection, by Matthew Tassava et al.
-
Summary of Towards Automation Of Human Stage Of Decay Identification: An Artificial Intelligence Approach, by Anna-maria Nau et al.
-
Summary of Webcam-based Pupil Diameter Prediction Benefits From Upscaling, by Vijul Shah et al.
-
Summary of Feasibility Of Assessing Cognitive Impairment Via Distributed Camera Network and Privacy-preserving Edge Computing, by Chaitra Hegde et al.
-
Summary of The Brittleness Of Ai-generated Image Watermarking Techniques: Examining Their Robustness Against Visual Paraphrasing Attacks, by Niyar R Barman et al.
-
Summary of Idea: Enhancing the Rule Learning Ability Of Large Language Model Agent Through Induction, Deduction, and Abduction, by Kaiyu He et al.
-
Summary of Mambaevt: Event Stream Based Visual Object Tracking Using State Space Model, by Xiao Wang et al.