Paper List
We recommend you use the search box as this list is very long.
-
Summary of Oneactor: Consistent Character Generation Via Cluster-conditioned Guidance, by Jiahao Wang et al.
-
Summary of Future Language Modeling From Temporal Document History, by Changmao Li and Jeffrey Flanigan
-
Summary of Learnable Prompt For Few-shot Semantic Segmentation in Remote Sensing Domain, by Steve Andreas Immanuel et al.
-
Summary of Towards Complex Ontology Alignment Using Large Language Models, by Reihaneh Amini et al.
-
Summary of Exploring the Role Of Token in Transformer-based Time Series Forecasting, by Jianqi Zhang et al.
-
Summary of Prescribing the Right Remedy: Mitigating Hallucinations in Large Vision-language Models Via Targeted Instruction Tuning, by Rui Hu et al.
-
Summary of Reasoning on Efficient Knowledge Paths:knowledge Graph Guides Large Language Model For Domain Question Answering, by Yuqi Wang et al.
-
Summary of Cnn-based Explanation Ensembling For Dataset, Representation and Explanations Evaluation, by Weronika Hryniewska-guzik et al.
-
Summary of Disentangling Instructive Information From Ranked Multiple Candidates For Multi-document Scientific Summarization, by Pancheng Wang et al.
-
Summary of Meel: Multi-modal Event Evolution Learning, by Zhengwei Tao et al.
-
Summary of Explainable Generative Ai (genxai): a Survey, Conceptualization, and Research Agenda, by Johannes Schneider
-
Summary of Transformers, Contextualism, and Polysemy, by Jumbly Grindrod
-
Summary of Large Language Models and Linguistic Intentionality, by Jumbly Grindrod
-
Summary of Modelling Language, by Jumbly Grindrod
-
Summary of Uniaa: a Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark, by Zhaokun Zhou et al.
-
Summary of Action Model Learning with Guarantees, by Diego Aineto et al.
-
Summary of Multi-news+: Cost-efficient Dataset Cleansing Via Llm-based Data Annotation, by Juhwan Choi et al.
-
Summary of Are Large Language Models Reliable Argument Quality Annotators?, by Nailia Mirzakhmedova et al.
-
Summary of Kg-ctg: Citation Generation Through Knowledge Graph-guided Large Language Models, by Avinash Anand et al.
-
Summary of 3d Face Tracking From 2d Video Through Iterative Dense Uv to Image Flow, by Felix Taubner et al.
-
Summary of Negation Triplet Extraction with Syntactic Dependency and Semantic Consistency, by Yuchen Shi et al.
-
Summary of Video2game: Real-time, Interactive, Realistic and Browser-compatible Environment From a Single Video, by Hongchi Xia et al.
-
Summary of Empowering Embodied Visual Tracking with Visual Foundation Models and Offline Rl, by Fangwei Zhong et al.
-
Summary of Synergising Human-like Responses and Machine Intelligence For Planning in Disaster Response, by Savvas Papaioannou et al.
-
Summary of Zero-shot Building Age Classification From Facade Image Using Gpt-4, by Zichao Zeng et al.
-
Summary of Zero-shot Detection Of Buildings in Mobile Lidar Using Language Vision Model, by June Moh Goo et al.
-
Summary of A Survey on Deep Learning For Theorem Proving, by Zhaoyu Li et al.
-
Summary of Evolving Interpretable Visual Classifiers with Large Language Models, by Mia Chiquier et al.
-
Summary of Hq-edit: a High-quality Dataset For Instruction-based Image Editing, by Mude Hui et al.
-
Summary of Mmina: Benchmarking Multihop Multimodal Internet Agents, by Ziniu Zhang et al.
-
Summary of Rethinking Iterative Stereo Matching From Diffusion Bridge Model Perspective, by Yuguang Shi
-
Summary of Toner: Type-oriented Named Entity Recognition with Generative Language Model, by Guochao Jiang et al.
-
Summary of Fusion-mamba For Cross-modality Object Detection, by Wenhao Dong et al.
-
Summary of Gemquad : Generating Multilingual Question Answering Datasets From Large Language Models Using Few Shot Learning, by Amani Namboori et al.
-
Summary of Texthawk: Exploring Efficient Fine-grained Perception Of Multimodal Large Language Models, by Ya-qi Yu et al.
-
Summary of Loopanimate: Loopable Salient Object Animation, by Fanyi Wang et al.
-
Summary of Fedccl: Federated Dual-clustered Feature Contrast Under Domain Heterogeneity, by Yu Qiao et al.
-
Summary of Task-driven Exploration: Decoupling and Inter-task Feedback For Joint Moment Retrieval and Highlight Detection, by Jin Yang et al.
-
Summary of Bridging Data Islands: Geographic Heterogeneity-aware Federated Learning For Collaborative Remote Sensing Semantic Segmentation, by Jieyi Tan et al.
-
Summary of Monte Carlo Search Algorithms Discovering Monte Carlo Tree Search Exploration Terms, by Tristan Cazenave
-
Summary of Owloop: Interfaces For Mapping Owl Axioms Into Oop Hierarchies, by Luca Buoncompagni and Fulvio Mastrogiovanni
-
Summary of Self-selected Attention Span For Accelerating Large Language Model Inference, by Tian Jin et al.
-
Summary of Understanding the Role Of Temperature in Diverse Question Generation by Gpt-4, By Arav Agarwal et al.
-
Summary of Watermark-embedded Adversarial Examples For Copyright Protection Against Diffusion Models, by Peifei Zhu et al.
-
Summary of Tokenization, Fusion, and Augmentation: Towards Fine-grained Multi-modal Entity Representation, by Yichi Zhang et al.
-
Summary of Improving Weakly-supervised Object Localization Using Adversarial Erasing and Pseudo Label, by Byeongkeun Kang and Sinhae Cha and Yeejin Lee
-
Summary of Mitigating Hallucination in Abstractive Summarization with Domain-conditional Mutual Information, by Kyubyung Chae et al.
-
Summary of Ranlaynet: a Dataset For Document Layout Detection Used For Domain Adaptation and Generalization, by Avinash Anand et al.
-
Summary of Leveraging Multi-ai Agents For Cross-domain Knowledge Discovery, by Shiva Aryal et al.
-
Summary of Memory Traces: Are Transformers Tulving Machines?, by Jean-marie Chauvet
-
Summary of Analyzing Decades-long Environmental Changes in Namibia Using Archival Aerial Photography and Deep Learning, by Girmaw Abebe Tadesse et al.
-
Summary of Catp: Cross-attention Token Pruning For Accuracy Preserved Multimodal Model Inference, by Ruqi Liao et al.
-
Summary of Idd-x: a Multi-view Dataset For Ego-relative Important Object Localization and Explanation in Dense and Unstructured Traffic, by Chirag Parikh et al.
-
Summary of Vision-aware Text Features in Referring Image Segmentation: From Object Understanding to Context Understanding, by Hai Nguyen-truong et al.
-
Summary of Fashionfail: Addressing Failure Cases in Fashion Object Detection and Segmentation, by Riza Velioglu et al.
-
Summary of Automatic Quantification Of Serial Pet/ct Images For Pediatric Hodgkin Lymphoma Patients Using a Longitudinally-aware Segmentation Network, by Xin Tie et al.
-
Summary of Optimal Path For Biomedical Text Summarization Using Pointer Gpt, by Hyunkyung Han et al.
-
Summary of Effects Of Different Prompts on the Quality Of Gpt-4 Responses to Dementia Care Questions, by Zhuochun Li et al.
-
Summary of Linear Cross-document Event Coreference Resolution with X-amr, by Shafiuddin Rehan Ahmed et al.
-
Summary of Is English the New Programming Language? How About Pseudo-code Engineering?, by Gian Alexandre Michaelsen et al.
-
Summary of Enhancing Question Answering For Enterprise Knowledge Bases Using Large Language Models, by Feihu Jiang and Chuan Qin and Kaichun Yao and Chuyu Fang and Fuzhen Zhuang and Hengshu Zhu and Hui Xiong
-
Summary of Dyknow: Dynamically Verifying Time-sensitive Factual Knowledge in Llms, by Seyed Mahed Mousavi et al.
-
Summary of The Generation Gap: Exploring Age Bias in the Value Systems Of Large Language Models, by Siyang Liu et al.
-
Summary of Mm-phyqa: Multimodal Physics Question-answering with Multi-image Cot Prompting, by Avinash Anand et al.
-
Summary of Game Generation Via Large Language Models, by Chengpeng Hu et al.
-
Summary of A Lightweight Spatiotemporal Network For Online Eye Tracking with Event Camera, by Yan Ru Pei et al.
-
Summary of A Fourier-enhanced Multi-modal 3d Small Object Optical Mark Recognition and Positioning Method For Percutaneous Abdominal Puncture Surgical Navigation, by Zezhao Guo (1) et al.
-
Summary of Exploring Explainability in Video Action Recognition, by Avinab Saha et al.
-
Summary of Guiding Large Language Models to Post-edit Machine Translation with Error Annotations, by Dayeon Ki et al.
-
Summary of Mindbridge: a Cross-subject Brain Decoding Framework, by Shizun Wang et al.
-
Summary of High-dimension Human Value Representation in Large Language Models, by Samuel Cahyawijaya et al.
-
Summary of Designqa: a Multimodal Benchmark For Evaluating Large Language Models’ Understanding Of Engineering Documentation, by Anna C. Doris et al.
-
Summary of Parameter Hierarchical Optimization For Visible-infrared Person Re-identification, by Zeng Yu and Yunxiao Shi
-
Summary of Content Knowledge Identification with Multi-agent Large Language Models (llms), by Kaiqi Yang et al.
-
Summary of Osworld: Benchmarking Multimodal Agents For Open-ended Tasks in Real Computer Environments, by Tianbao Xie et al.
-
Summary of Rho-1: Not All Tokens Are What You Need, by Zhenghao Lin et al.
-
Summary of Self-supervised Dataset Distillation: a Good Compression Is All You Need, by Muxin Zhou and Zeyuan Yin and Shitong Shao and Zhiqiang Shen
-
Summary of Rethinking Artistic Copyright Infringements in the Era Of Text-to-image Generative Models, by Mazda Moayeri et al.
-
Summary of Ai-guided Feature Segmentation Techniques to Model Features From Single Crystal Diamond Growth, by Rohan Reddy Mekala et al.
-
Summary of Data-augmentation-based Dialectal Adaptation For Llms, by Fahim Faisal and Antonios Anastasopoulos
-
Summary of S3editor: a Sparse Semantic-disentangled Self-training Framework For Face Video Editing, by Guangzhi Wang et al.
-
Summary of Pretraining and Updates Of Domain-specific Llm: a Case Study in the Japanese Business Domain, by Kosuke Takahashi et al.
-
Summary of Ifvit: Interpretable Fixed-length Representation For Fingerprint Matching Via Vision Transformer, by Yuhang Qiu et al.
-
Summary of A Survey Of Neural Network Robustness Assessment in Image Recognition, by Jie Wang et al.
-
Summary of Improving Health Question Answering with Reliable and Time-aware Evidence Retrieval, by Juraj Vladika et al.
-
Summary of The Integration Of Semantic and Structural Knowledge in Knowledge Graph Entity Typing, by Muzhi Li et al.
-
Summary of Look at the Text: Instruction-tuned Language Models Are More Robust Multiple Choice Selectors Than You Think, by Xinpeng Wang et al.
-
Summary of Mitigating Language-level Performance Disparity in Mplms Via Teacher Language Selection and Cross-lingual Self-distillation, by Haozhe Zhao et al.
-
Summary of Exploring the Frontier Of Vision-language Models: a Survey Of Current Methodologies and Future Directions, by Akash Ghosh et al.
-
Summary of Is Complexity An Illusion?, by Michael Timothy Bennett
-
Summary of Personality-affected Emotion Generation in Dialog Systems, by Zhiyuan Wen et al.
-
Summary of Ai-guided Defect Detection Techniques to Model Single Crystal Diamond Growth, by Rohan Reddy Mekala et al.
-
Summary of Learn From Failure: Fine-tuning Llms with Trial-and-error Data For Intuitionistic Propositional Logic Proving, by Chenyang An et al.
-
Summary of Jetmoe: Reaching Llama2 Performance with 0.1m Dollars, by Yikang Shen et al.
-
Summary of Behavior Trees Enable Structured Programming Of Language Model Agents, by Richard Kelley
-
Summary of Wese: Weak Exploration to Strong Exploitation For Llm Agents, by Xu Huang et al.
-
Summary of An Audit on the Perspectives and Challenges Of Hallucinations in Nlp, by Pranav Narayanan Venkit et al.
-
Summary of Contrastive-based Deep Embeddings For Label Noise-resilient Histopathology Image Classification, by Lucas Dedieu et al.