Paper List

We recommend you use the search box as this list is very long.

Summary of Toward Optimal Llm Alignments Using Two-player Games, by Rui Zheng et al.
Summary of 3d Gaze Tracking For Studying Collaborative Interactions in Mixed-reality Environments, by Eduardo Davalos et al.
Summary of Balancing Rigor and Utility: Mitigating Cognitive Biases in Large Language Models For Multiple-choice Questions, by Liman Wang et al.
Summary of Consistency-diversity-realism Pareto Fronts Of Conditional Image Generative Models, by Pietro Astolfi et al.
Summary of From Words to Worlds: Transforming One-line Prompt Into Immersive Multi-modal Digital Stories with Communicative Llm Agent, by Samuel S. Sohn and Danrui Li and Sen Zhang and Che-jui Chang and Mubbasir Kapadia
Summary of Reactor Mk.1 Performances: Mmlu, Humaneval and Bbh Test Results, by Tj Dunham et al.
Summary of Unlocking Large Language Model’s Planning Capabilities with Maximum Diversity Fine-tuning, by Wenjun Li et al.
Summary of Self Pre-training with Topology- and Spatiality-aware Masked Autoencoders For 3d Medical Image Segmentation, by Pengfei Gu et al.
Summary of Generating and Evolving Reward Functions For Highway Driving with Large Language Models, by Xu Han et al.
Summary of Qda-sql: Questions Enhanced Dialogue Augmentation For Multi-turn Text-to-sql, by Yinggang Sun et al.
Summary of Explain the Black Box For the Sake Of Science: the Scientific Method in the Era Of Generative Artificial Intelligence, by Gianmarco Mengaldo
Summary of Public Computer Vision Datasets For Precision Livestock Farming: a Systematic Survey, by Anil Bhujel et al.
Summary of Nerfdeformer: Nerf Transformation From a Single View Via 3d Scene Flows, by Zhenggang Tang et al.
Summary of Structext-eval: Evaluating Large Language Model’s Reasoning Ability in Structure-rich Text, by Zhouhong Gu et al.
Summary of Emerging Safety Attack and Defense in Federated Instruction Tuning Of Large Language Models, by Rui Ye et al.
Summary of Automating Pharmacovigilance Evidence Generation: Using Large Language Models to Produce Context-aware Sql, by Jeffery L. Painter et al.
Summary of Applications Of Generative Ai in Healthcare: Algorithmic, Ethical, Legal and Societal Considerations, by Onyekachukwu R. Okonji et al.
Summary of Synthet2c: Generating Synthetic Data For Fine-tuning Large Language Models on the Text2cypher Task, by Ziije Zhong et al.
Summary of Object Detection Using Oriented Window Learning Vi-sion Transformer: Roadway Assets Recognition, by Taqwa Alhadidi et al.
Summary of Quantifying Generative Media Bias with a Corpus Of Real-world and Generated News Articles, by Filip Trhlik and Pontus Stenetorp
Summary of Sharelora: Parameter Efficient and Robust Large Language Model Fine-tuning Via Shared Low-rank Adaptation, by Yurun Song et al.
Summary of Exploring the Zero-shot Capabilities Of Llms Handling Multiple Problems at Once, by Zhengxiang Wang et al.
Summary of Hiddentables & Pyqtax: a Cooperative Game and Dataset For Tableqa to Ensure Scale and Data Privacy Across a Myriad Of Taxonomies, by William Watson et al.
Summary of A Reality Check Of the Benefits Of Llm in Business, by Ming Cheung
Summary of On the Worst Prompt Performance Of Large Language Models, by Bowen Cao et al.
Summary of The Impact Of Quantization on Retrieval-augmented Generation: An Analysis Of Small Llms, by Mert Yazan et al.
Summary of Improving Language Models For Emotion Analysis: Insights From Cognitive Science, by Constant Bonard (unibe) et al.
Summary of Foodsky: a Food-oriented Large Language Model That Passes the Chef and Dietetic Examination, by Pengfei Zhou et al.
Summary of Unused Information in Token Probability Distribution Of Generative Llm: Improving Llm Reading Comprehension Through Calculation Of Expected Values, by Krystian Zawistowski
Summary of Autograding Mathematical Induction Proofs with Natural Language Processing, by Chenyan Zhao et al.
Summary of Beyond Words: on Large Language Models Actionability in Mission-critical Risk Analysis, by Matteo Esposito et al.
Summary of Prompt-based Length Controlled Generation with Multiple Control Types, by Renlong Jie et al.
Summary of Veract Scan: Retrieval-augmented Fake News Detection with Justifiable Reasoning, by Cheng Niu et al.
Summary of Clst: Cold-start Mitigation in Knowledge Tracing by Aligning a Generative Language Model As a Students’ Knowledge Tracer, By Heeseok Jung et al.
Summary of Researcharena: Benchmarking Large Language Models’ Ability to Collect and Organize Information As Research Agents, by Hao Kang et al.
Summary of Sememelm: a Sememe Knowledge Enhanced Method For Long-tail Relation Representation, by Shuyi Li and Shaojuan Wu and Xiaowang Zhang and Zhiyong Feng
Summary of What Is the Best Model? Application-driven Evaluation For Large Language Models, by Shiguo Lian et al.
Summary of A Survey on Large Language Models From General Purpose to Medical Applications: Datasets, Methodologies, and Evaluations, by Jinqiang Wang et al.
Summary of Teg-db: a Comprehensive Dataset and Benchmark Of Textual-edge Graphs, by Zhuofeng Li et al.
Summary of Chisafetybench: a Chinese Hierarchical Safety Benchmark For Large Language Models, by Wenjing Zhang et al.
Summary of Creating a Lens Of Chinese Culture: a Multimodal Dataset For Chinese Pun Rebus Art Understanding, by Tuo Zhang et al.
Summary of Efficient Prompting For Llm-based Generative Internet Of Things, by Bin Xiao et al.
Summary of What Is the Visual Cognition Gap Between Humans and Multimodal Llms?, by Xu Cao et al.
Summary of Localizing Events in Videos with Multimodal Queries, by Gengyuan Zhang and Mang Ling Ada Fok and Jialu Ma and Yan Xia and Daniel Cremers and Philip Torr and Volker Tresp and Jindong Gu
Summary of Details Make a Difference: Object State-sensitive Neurorobotic Task Planning, by Xiaowen Sun et al.
Summary of Tilt and Average : Geometric Adjustment Of the Last Layer For Recalibration, by Gyusang Cho and Chan-hyun Youn
Summary of First Multi-dimensional Evaluation Of Flowchart Comprehension For Multimodal Large Language Models, by Enming Zhang et al.
Summary of Fzi-wim at Semeval-2024 Task 2: Self-consistent Cot For Complex Nli in Biomedical Domain, by Jin Liu and Steffen Thoma
Summary of Skysensegpt: a Fine-grained Instruction Tuning Dataset and Model For Remote Sensing Vision-language Understanding, by Junwei Luo et al.
Summary of Exploration by Learning Diverse Skills Through Successor State Measures, By Paul-antoine Le Tolguenec et al.
Summary of Improving Rule Mining Via Embedding-based Link Prediction, by N’dah Jean Kouagou et al.
Summary of Babilong: Testing the Limits Of Llms with Long Context Reasoning-in-a-haystack, by Yuri Kuratov et al.
Summary of Sycophancy to Subterfuge: Investigating Reward-tampering in Large Language Models, by Carson Denison et al.
Summary of Trip-pal: Travel Planning with Guarantees by Combining Large Language Models and Automated Planners, By Tomas De La Rosa et al.
Summary of Meshanything: Artist-created Mesh Generation with Autoregressive Transformers, by Yiwen Chen et al.
Summary of Sstfb: Leveraging Self-supervised Pretext Learning and Temporal Self-attention with Feature Branching For Real-time Video Polyp Segmentation, by Ziang Xu et al.
Summary of Make It Count: Text-to-image Generation with An Accurate Number Of Objects, by Lital Binyamin et al.
Summary of Regularizing Hidden States Enables Learning Generalizable Reward Model For Llms, by Rui Yang et al.
Summary of Long Story Short: Story-level Video Understanding From 20k Short Films, by Ridouane Ghermi et al.
Summary of Videogui: a Benchmark For Gui Automation From Instructional Videos, by Kevin Qinghong Lin et al.
Summary of Vega: Learning Interleaved Image-text Comprehension in Vision-language Large Models, by Chenyu Zhou et al.
Summary of Object Criticality For Safer Navigation, by Andrea Ceccarelli et al.
Summary of Qcqa: Quality and Capacity-aware Grouped Query Attention, by Vinay Joshi et al.
Summary of Analyzing Gender Polarity in Short Social Media Texts with Bert: the Role Of Emojis and Emoticons, by Saba Yousefian Jazi et al.
Summary of Speech Reallm — Real-time Streaming Speech Recognition with Multimodal Llms by Teaching the Flow Of Time, By Frank Seide et al.
Summary of Multi-modal Retrieval For Large Language Model Based Speech Recognition, by Jari Kolehmainen et al.
Summary of Dsl-fiqa: Assessing Facial Image Quality Via Dual-set Degradation Learning and Landmark-guided Transformer, by Wei-ting Chen and Gurunandan Krishnan and Qiang Gao and Sy-yen Kuo and Sizhuo Ma and Jian Wang
Summary of Robustsam: Segment Anything Robustly on Degraded Images, by Wei-ting Chen and Yu-jiet Vong and Sy-yen Kuo and Sizhuo Ma and Jian Wang
Summary of Learning Language Structures Through Grounding, by Freda Shi
Summary of A Survey Of Video Datasets For Grounded Event Understanding, by Kate Sanders and Benjamin Van Durme
Summary of Fine-grained Urban Flow Inference with Multi-scale Representation Learning, by Shilu Yuan et al.
Summary of Self-knowledge Distillation For Learning Ambiguity, by Hancheol Park et al.
Summary of Controlvar: Exploring Controllable Visual Autoregressive Modeling, by Xiang Li et al.
Summary of Mix Q-learning For Lane Changing: a Collaborative Decision-making Method in Multi-agent Deep Reinforcement Learning, by Xiaojun Bi et al.
Summary of Research on Edge Detection Of Lidar Images Based on Artificial Intelligence Technology, by Haowei Yang et al.
Summary of Retrieval Augmented Fact Verification by Synthesizing Contrastive Arguments, By Zhenrui Yue et al.
Summary of Ospc: Detecting Harmful Memes with Large Language Model As a Catalyst, by Jingtao Cao et al.
Summary of From Manifestations to Cognitive Architectures: a Scalable Framework, by Alfredo Ibias et al.
Summary of Vision-language Models Meet Meteorology: Developing Models For Extreme Weather Events Detection with Heatmaps, by Jian Chen et al.
Summary of Shmamba: Structured Hyperbolic State Space Model For Audio-visual Question Answering, by Zhe Yang et al.
Summary of Knowledge Editing in Language Models Via Adapted Direct Preference Optimization, by Amit Rozner et al.
Summary of Hiro: Hierarchical Information Retrieval Optimization, by Krish Goel et al.
Summary of Experiments in News Bias Detection with Pre-trained Neural Transformers, by Tim Menzner et al.
Summary of A Large-scale Universal Evaluation Benchmark For Face Forgery Detection, by Yijun Bei et al.
Summary of Applying Multi-agent Negotiation to Solve the Production Routing Problem with Privacy Preserving, by Luiza Pellin Biasoto et al.
Summary of Readctrl: Personalizing Text Generation with Readability-controlled Instruction Learning, by Hieu Tran et al.
Summary of Towards a Characterisation Of Monte-carlo Tree Search Performance in Different Games, by Dennis J.n.j. Soemers et al.
Summary of Deep Transformer Network For Monocular Pose Estimation Of Ship-based Uav, by Maneesha Wickramasuriya et al.
Summary of Action2sound: Ambient-aware Generation Of Action Sounds From Egocentric Videos, by Changan Chen et al.
Summary of Parameter-efficient Active Learning For Foundational Models, by Athmanarayanan Lakshmi Narayanan et al.
Summary of Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms, by Miaosen Zhang et al.
Summary of Star: a First-ever Dataset and a Large-scale Benchmark For Scene Graph Generation in Large-size Satellite Imagery, by Yansheng Li et al.
Summary of Mmscan: a Multi-modal 3d Scene Dataset with Hierarchical Grounded Language Annotations, by Ruiyuan Lyu et al.
Summary of Muirbench: a Comprehensive Benchmark For Robust Multi-image Understanding, by Fei Wang et al.
Summary of Pandora: Towards General World Model with Natural Language Actions and Video States, by Jiannan Xiang et al.
Summary of Advancing High Resolution Vision-language Models in Biomedicine, by Zekai Chen and Arda Pekis and Kevin Brown
Summary of Updating Clip to Prefer Descriptions Over Captions, by Amir Zur et al.
Summary of Svitt-ego: a Sparse Video-text Transformer For Egocentric Video, by Hector A. Valdez and Kyle Min and Subarna Tripathi
Summary of Gpt-ology, Computational Models, Silicon Sampling: How Should We Think About Llms in Cognitive Science?, by Desmond C. Ong
Summary of Talking Heads: Understanding Inter-layer Communication in Transformer Language Models, by Jack Merullo et al.