Paper List
We recommend you use the search box as this list is very long.
-
Summary of Data Collection Of Real-life Knowledge Work in Context: the Rlkwic Dataset, by Mahta Bakhshizadeh et al.
-
Summary of Videosage: Video Summarization with Graph Representation Learning, by Jose M. Rojas Chaves et al.
-
Summary of Into the Fog: Evaluating Robustness Of Multiple Object Tracking, by Nadezda Kirillova et al.
-
Summary of Are Large Language Models Reliable Argument Quality Annotators?, by Nailia Mirzakhmedova et al.
-
Summary of Negation Triplet Extraction with Syntactic Dependency and Semantic Consistency, by Yuchen Shi et al.
-
Summary of Video2game: Real-time, Interactive, Realistic and Browser-compatible Environment From a Single Video, by Hongchi Xia et al.
-
Summary of Empowering Embodied Visual Tracking with Visual Foundation Models and Offline Rl, by Fangwei Zhong et al.
-
Summary of Synergising Human-like Responses and Machine Intelligence For Planning in Disaster Response, by Savvas Papaioannou et al.
-
Summary of Zero-shot Building Age Classification From Facade Image Using Gpt-4, by Zichao Zeng et al.
-
Summary of Zero-shot Detection Of Buildings in Mobile Lidar Using Language Vision Model, by June Moh Goo et al.
-
Summary of Evolving Interpretable Visual Classifiers with Large Language Models, by Mia Chiquier et al.
-
Summary of A Survey on Deep Learning For Theorem Proving, by Zhaoyu Li et al.
-
Summary of Hq-edit: a High-quality Dataset For Instruction-based Image Editing, by Mude Hui et al.
-
Summary of Mmina: Benchmarking Multihop Multimodal Internet Agents, by Ziniu Zhang et al.
-
Summary of Vision Augmentation Prediction Autoencoder with Attention Design (vapaad), by Yiqiao Yin
-
Summary of Aigen: An Adversarial Approach For Instruction Generation in Vln, by Niyati Rawal et al.
-
Summary of Chinchilla Scaling: a Replication Attempt, by Tamay Besiroglu et al.
-
Summary of Reinforcement Learning From Multi-role Debates As Feedback For Bias Mitigation in Llms, by Ruoxi Cheng et al.
-
Summary of High-resolution Detection Of Earth Structural Heterogeneities From Seismic Amplitudes Using Convolutional Neural Networks with Attention Layers, by Luiz Schirmer et al.
-
Summary of Clasheval: Quantifying the Tug-of-war Between An Llm’s Internal Prior and External Evidence, by Kevin Wu and Eric Wu and James Zou
-
Summary of Culture-gen: Revealing Global Cultural Perception in Language Models Through Natural Language Prompting, by Huihan Li et al.
-
Summary of Tel’m: Test and Evaluation Of Language Models, by George Cybenko et al.
-
Summary of Compressible and Searchable: Ai-native Multi-modal Retrieval System with Learned Image Compression, by Jixiang Luo
-
Summary of Task-driven Exploration: Decoupling and Inter-task Feedback For Joint Moment Retrieval and Highlight Detection, by Jin Yang et al.
-
Summary of Monte Carlo Search Algorithms Discovering Monte Carlo Tree Search Exploration Terms, by Tristan Cazenave
-
Summary of Owloop: Interfaces For Mapping Owl Axioms Into Oop Hierarchies, by Luca Buoncompagni and Fulvio Mastrogiovanni
-
Summary of Self-selected Attention Span For Accelerating Large Language Model Inference, by Tian Jin et al.
-
Summary of Understanding the Role Of Temperature in Diverse Question Generation by Gpt-4, By Arav Agarwal et al.
-
Summary of Watermark-embedded Adversarial Examples For Copyright Protection Against Diffusion Models, by Peifei Zhu et al.
-
Summary of Tokenization, Fusion, and Augmentation: Towards Fine-grained Multi-modal Entity Representation, by Yichi Zhang et al.
-
Summary of Improving Weakly-supervised Object Localization Using Adversarial Erasing and Pseudo Label, by Byeongkeun Kang and Sinhae Cha and Yeejin Lee
-
Summary of Mitigating Hallucination in Abstractive Summarization with Domain-conditional Mutual Information, by Kyubyung Chae et al.
-
Summary of Large Language Models and Linguistic Intentionality, by Jumbly Grindrod
-
Summary of Ranlaynet: a Dataset For Document Layout Detection Used For Domain Adaptation and Generalization, by Avinash Anand et al.
-
Summary of Transformers, Contextualism, and Polysemy, by Jumbly Grindrod
-
Summary of Modelling Language, by Jumbly Grindrod
-
Summary of Explainable Generative Ai (genxai): a Survey, Conceptualization, and Research Agenda, by Johannes Schneider
-
Summary of Action Model Learning with Guarantees, by Diego Aineto et al.
-
Summary of Uniaa: a Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark, by Zhaokun Zhou et al.
-
Summary of Multi-news+: Cost-efficient Dataset Cleansing Via Llm-based Data Annotation, by Juhwan Choi et al.
-
Summary of Kg-ctg: Citation Generation Through Knowledge Graph-guided Large Language Models, by Avinash Anand et al.
-
Summary of 3d Face Tracking From 2d Video Through Iterative Dense Uv to Image Flow, by Felix Taubner et al.
-
Summary of Bridging Data Islands: Geographic Heterogeneity-aware Federated Learning For Collaborative Remote Sensing Semantic Segmentation, by Jieyi Tan et al.
-
Summary of Linear Cross-document Event Coreference Resolution with X-amr, by Shafiuddin Rehan Ahmed et al.
-
Summary of Effects Of Different Prompts on the Quality Of Gpt-4 Responses to Dementia Care Questions, by Zhuochun Li et al.
-
Summary of Is English the New Programming Language? How About Pseudo-code Engineering?, by Gian Alexandre Michaelsen et al.
-
Summary of Enhancing Question Answering For Enterprise Knowledge Bases Using Large Language Models, by Feihu Jiang and Chuan Qin and Kaichun Yao and Chuyu Fang and Fuzhen Zhuang and Hengshu Zhu and Hui Xiong
-
Summary of Dyknow: Dynamically Verifying Time-sensitive Factual Knowledge in Llms, by Seyed Mahed Mousavi et al.
-
Summary of Mm-phyqa: Multimodal Physics Question-answering with Multi-image Cot Prompting, by Avinash Anand et al.
-
Summary of Game Generation Via Large Language Models, by Chengpeng Hu et al.
-
Summary of The Generation Gap: Exploring Age Bias in the Value Systems Of Large Language Models, by Siyang Liu et al.
-
Summary of A Lightweight Spatiotemporal Network For Online Eye Tracking with Event Camera, by Yan Ru Pei et al.
-
Summary of A Fourier-enhanced Multi-modal 3d Small Object Optical Mark Recognition and Positioning Method For Percutaneous Abdominal Puncture Surgical Navigation, by Zezhao Guo (1) et al.
-
Summary of Rethinking Iterative Stereo Matching From Diffusion Bridge Model Perspective, by Yuguang Shi
-
Summary of Exploring Explainability in Video Action Recognition, by Avinab Saha et al.
-
Summary of Toner: Type-oriented Named Entity Recognition with Generative Language Model, by Guochao Jiang et al.
-
Summary of Fusion-mamba For Cross-modality Object Detection, by Wenhao Dong et al.
-
Summary of Gemquad : Generating Multilingual Question Answering Datasets From Large Language Models Using Few Shot Learning, by Amani Namboori et al.
-
Summary of Loopanimate: Loopable Salient Object Animation, by Fanyi Wang et al.
-
Summary of Fedccl: Federated Dual-clustered Feature Contrast Under Domain Heterogeneity, by Yu Qiao et al.
-
Summary of Texthawk: Exploring Efficient Fine-grained Perception Of Multimodal Large Language Models, by Ya-qi Yu et al.
-
Summary of Ai-guided Feature Segmentation Techniques to Model Features From Single Crystal Diamond Growth, by Rohan Reddy Mekala et al.
-
Summary of Data-augmentation-based Dialectal Adaptation For Llms, by Fahim Faisal and Antonios Anastasopoulos
-
Summary of Rethinking Artistic Copyright Infringements in the Era Of Text-to-image Generative Models, by Mazda Moayeri et al.
-
Summary of S3editor: a Sparse Semantic-disentangled Self-training Framework For Face Video Editing, by Guangzhi Wang et al.
-
Summary of Ifvit: Interpretable Fixed-length Representation For Fingerprint Matching Via Vision Transformer, by Yuhang Qiu et al.
-
Summary of Pretraining and Updates Of Domain-specific Llm: a Case Study in the Japanese Business Domain, by Kosuke Takahashi et al.
-
Summary of A Survey Of Neural Network Robustness Assessment in Image Recognition, by Jie Wang et al.
-
Summary of The Integration Of Semantic and Structural Knowledge in Knowledge Graph Entity Typing, by Muzhi Li et al.
-
Summary of Improving Health Question Answering with Reliable and Time-aware Evidence Retrieval, by Juraj Vladika et al.
-
Summary of Look at the Text: Instruction-tuned Language Models Are More Robust Multiple Choice Selectors Than You Think, by Xinpeng Wang et al.
-
Summary of Mitigating Language-level Performance Disparity in Mplms Via Teacher Language Selection and Cross-lingual Self-distillation, by Haozhe Zhao et al.
-
Summary of Leveraging Multi-ai Agents For Cross-domain Knowledge Discovery, by Shiva Aryal et al.
-
Summary of Memory Traces: Are Transformers Tulving Machines?, by Jean-marie Chauvet
-
Summary of Analyzing Decades-long Environmental Changes in Namibia Using Archival Aerial Photography and Deep Learning, by Girmaw Abebe Tadesse et al.
-
Summary of Idd-x: a Multi-view Dataset For Ego-relative Important Object Localization and Explanation in Dense and Unstructured Traffic, by Chirag Parikh et al.
-
Summary of Catp: Cross-attention Token Pruning For Accuracy Preserved Multimodal Model Inference, by Ruqi Liao et al.
-
Summary of Vision-aware Text Features in Referring Image Segmentation: From Object Understanding to Context Understanding, by Hai Nguyen-truong et al.
-
Summary of Fashionfail: Addressing Failure Cases in Fashion Object Detection and Segmentation, by Riza Velioglu et al.
-
Summary of Automatic Quantification Of Serial Pet/ct Images For Pediatric Hodgkin Lymphoma Patients Using a Longitudinally-aware Segmentation Network, by Xin Tie et al.
-
Summary of Optimal Path For Biomedical Text Summarization Using Pointer Gpt, by Hyunkyung Han et al.
-
Summary of Mitigating Object Dependencies: Improving Point Cloud Self-supervised Learning Through Object Exchange, by Yanhao Wu et al.
-
Summary of Cat: Contrastive Adapter Training For Personalized Image Generation, by Jae Wan Park et al.
-
Summary of From Words to Numbers: Your Large Language Model Is Secretly a Capable Regressor When Given In-context Examples, by Robert Vacareanu et al.
-
Summary of Contrastive-based Deep Embeddings For Label Noise-resilient Histopathology Image Classification, by Lucas Dedieu et al.
-
Summary of Finding Dino: a Plug-and-play Framework For Zero-shot Detection Of Out-of-distribution Objects Using Prototypes, by Poulami Sinhamahapatra et al.
-
Summary of Oda: Observation-driven Agent For Integrating Llms and Knowledge Graphs, by Lei Sun et al.
-
Summary of Model-based Cleaning Of the Quilt-1m Pathology Dataset For Text-conditional Image Synthesis, by Marc Aubreville et al.
-
Summary of Run-time Monitoring Of 3d Object Detection in Automated Driving Systems Using Early Layer Neural Activation Patterns, by Hakan Yekta Yatbaz et al.
-
Summary of Depth Estimation Using Weighted-loss and Transfer Learning, by Muhammad Adeel Hafeez et al.
-
Summary of Reframing the Mind-body Picture: Applying Formal Systems to the Relationship Of Mind and Matter, by Ryan Williams
-
Summary of Aug: a New Dataset and An Efficient Model For Aerial Image Urban Scene Graph Generation, by Yansheng Li et al.
-
Summary of Mindbridge: a Cross-subject Brain Decoding Framework, by Shizun Wang et al.
-
Summary of Guiding Large Language Models to Post-edit Machine Translation with Error Annotations, by Dayeon Ki et al.
-
Summary of High-dimension Human Value Representation in Large Language Models, by Samuel Cahyawijaya et al.
-
Summary of Designqa: a Multimodal Benchmark For Evaluating Large Language Models’ Understanding Of Engineering Documentation, by Anna C. Doris et al.
-
Summary of Parameter Hierarchical Optimization For Visible-infrared Person Re-identification, by Zeng Yu and Yunxiao Shi
-
Summary of Content Knowledge Identification with Multi-agent Large Language Models (llms), by Kaiqi Yang et al.