Paper List
We recommend you use the search box as this list is very long.
-
Summary of Exploring the Zero-shot Capabilities Of Llms Handling Multiple Problems at Once, by Zhengxiang Wang et al.
-
Summary of Kgpa: Robustness Evaluation For Large Language Models Via Cross-domain Knowledge Graphs, by Aihua Pei (1) et al.
-
Summary of Hiddentables & Pyqtax: a Cooperative Game and Dataset For Tableqa to Ensure Scale and Data Privacy Across a Myriad Of Taxonomies, by William Watson et al.
-
Summary of Ptt5-v2: a Closer Look at Continued Pretraining Of T5 Models For the Portuguese Language, by Marcos Piau et al.
-
Summary of Post-hoc Utterance Refining Method by Entity Mining For Faithful Knowledge Grounded Conversations, By Yoonna Jang et al.
-
Summary of Llmfactor: Extracting Profitable Factors Through Prompts For Explainable Stock Movement Prediction, by Meiyun Wang et al.
-
Summary of Gui-world: a Dataset For Gui-oriented Multimodal Llm-based Agents, by Dongping Chen et al.
-
Summary of Algorithm Selection For Optimal Multi-agent Path Finding Via Graph Embedding, by Carmel Shabalin et al.
-
Summary of Large Language Models For Automatic Milestone Detection in Group Discussions, by Zhuoxu Duan et al.
-
Summary of Torchopera: a Compound Ai System For Llm Safety, by Shanshan Han et al.
-
Summary of Ig2: Integrated Gradient on Iterative Gradient Path For Feature Attribution, by Yue Zhuo et al.
-
Summary of Alps: An Auto-labeling and Pre-training Scheme For Remote Sensing Segmentation with Segment Anything Model, by Song Zhang et al.
-
Summary of Researcharena: Benchmarking Large Language Models’ Ability to Collect and Organize Information As Research Agents, by Hao Kang et al.
-
Summary of Clst: Cold-start Mitigation in Knowledge Tracing by Aligning a Generative Language Model As a Students’ Knowledge Tracer, By Heeseok Jung et al.
-
Summary of What Is the Best Model? Application-driven Evaluation For Large Language Models, by Shiguo Lian et al.
-
Summary of Sememelm: a Sememe Knowledge Enhanced Method For Long-tail Relation Representation, by Shuyi Li and Shaojuan Wu and Xiaowang Zhang and Zhiyong Feng
-
Summary of A Survey on Large Language Models From General Purpose to Medical Applications: Datasets, Methodologies, and Evaluations, by Jinqiang Wang et al.
-
Summary of Teg-db: a Comprehensive Dataset and Benchmark Of Textual-edge Graphs, by Zhuofeng Li et al.
-
Summary of Creating a Lens Of Chinese Culture: a Multimodal Dataset For Chinese Pun Rebus Art Understanding, by Tuo Zhang et al.
-
Summary of Chisafetybench: a Chinese Hierarchical Safety Benchmark For Large Language Models, by Wenjing Zhang et al.
-
Summary of What Is the Visual Cognition Gap Between Humans and Multimodal Llms?, by Xu Cao et al.
-
Summary of Efficient Prompting For Llm-based Generative Internet Of Things, by Bin Xiao et al.
-
Summary of Consistency-diversity-realism Pareto Fronts Of Conditional Image Generative Models, by Pietro Astolfi et al.
-
Summary of From Words to Worlds: Transforming One-line Prompt Into Immersive Multi-modal Digital Stories with Communicative Llm Agent, by Samuel S. Sohn and Danrui Li and Sen Zhang and Che-jui Chang and Mubbasir Kapadia
-
Summary of Unlocking Large Language Model’s Planning Capabilities with Maximum Diversity Fine-tuning, by Wenjun Li et al.
-
Summary of Reactor Mk.1 Performances: Mmlu, Humaneval and Bbh Test Results, by Tj Dunham et al.
-
Summary of Self Pre-training with Topology- and Spatiality-aware Masked Autoencoders For 3d Medical Image Segmentation, by Pengfei Gu et al.
-
Summary of Generating and Evolving Reward Functions For Highway Driving with Large Language Models, by Xu Han et al.
-
Summary of Nerfdeformer: Nerf Transformation From a Single View Via 3d Scene Flows, by Zhenggang Tang et al.
-
Summary of Explain the Black Box For the Sake Of Science: the Scientific Method in the Era Of Generative Artificial Intelligence, by Gianmarco Mengaldo
-
Summary of Qda-sql: Questions Enhanced Dialogue Augmentation For Multi-turn Text-to-sql, by Yinggang Sun et al.
-
Summary of Public Computer Vision Datasets For Precision Livestock Farming: a Systematic Survey, by Anil Bhujel et al.
-
Summary of Meshanything: Artist-created Mesh Generation with Autoregressive Transformers, by Yiwen Chen et al.
-
Summary of Trip-pal: Travel Planning with Guarantees by Combining Large Language Models and Automated Planners, By Tomas De La Rosa et al.
-
Summary of Make It Count: Text-to-image Generation with An Accurate Number Of Objects, by Lital Binyamin et al.
-
Summary of Sstfb: Leveraging Self-supervised Pretext Learning and Temporal Self-attention with Feature Branching For Real-time Video Polyp Segmentation, by Ziang Xu et al.
-
Summary of Regularizing Hidden States Enables Learning Generalizable Reward Model For Llms, by Rui Yang et al.
-
Summary of Long Story Short: Story-level Video Understanding From 20k Short Films, by Ridouane Ghermi et al.
-
Summary of Videogui: a Benchmark For Gui Automation From Instructional Videos, by Kevin Qinghong Lin et al.
-
Summary of Vega: Learning Interleaved Image-text Comprehension in Vision-language Large Models, by Chenyu Zhou et al.
-
Summary of Object Criticality For Safer Navigation, by Andrea Ceccarelli et al.
-
Summary of Qcqa: Quality and Capacity-aware Grouped Query Attention, by Vinay Joshi et al.
-
Summary of On the Worst Prompt Performance Of Large Language Models, by Bowen Cao et al.
-
Summary of A Reality Check Of the Benefits Of Llm in Business, by Ming Cheung
-
Summary of The Impact Of Quantization on Retrieval-augmented Generation: An Analysis Of Small Llms, by Mert Yazan et al.
-
Summary of Foodsky: a Food-oriented Large Language Model That Passes the Chef and Dietetic Examination, by Pengfei Zhou et al.
-
Summary of Unused Information in Token Probability Distribution Of Generative Llm: Improving Llm Reading Comprehension Through Calculation Of Expected Values, by Krystian Zawistowski
-
Summary of Autograding Mathematical Induction Proofs with Natural Language Processing, by Chenyan Zhao et al.
-
Summary of Beyond Words: on Large Language Models Actionability in Mission-critical Risk Analysis, by Matteo Esposito et al.
-
Summary of Improving Language Models For Emotion Analysis: Insights From Cognitive Science, by Constant Bonard (unibe) et al.
-
Summary of Prompt-based Length Controlled Generation with Multiple Control Types, by Renlong Jie et al.
-
Summary of Veract Scan: Retrieval-augmented Fake News Detection with Justifiable Reasoning, by Cheng Niu et al.
-
Summary of Mix Q-learning For Lane Changing: a Collaborative Decision-making Method in Multi-agent Deep Reinforcement Learning, by Xiaojun Bi et al.
-
Summary of Research on Edge Detection Of Lidar Images Based on Artificial Intelligence Technology, by Haowei Yang et al.
-
Summary of Ospc: Detecting Harmful Memes with Large Language Model As a Catalyst, by Jingtao Cao et al.
-
Summary of From Manifestations to Cognitive Architectures: a Scalable Framework, by Alfredo Ibias et al.
-
Summary of Shmamba: Structured Hyperbolic State Space Model For Audio-visual Question Answering, by Zhe Yang et al.
-
Summary of Vision-language Models Meet Meteorology: Developing Models For Extreme Weather Events Detection with Heatmaps, by Jian Chen et al.
-
Summary of Knowledge Editing in Language Models Via Adapted Direct Preference Optimization, by Amit Rozner et al.
-
Summary of Experiments in News Bias Detection with Pre-trained Neural Transformers, by Tim Menzner et al.
-
Summary of Hiro: Hierarchical Information Retrieval Optimization, by Krish Goel et al.
-
Summary of Details Make a Difference: Object State-sensitive Neurorobotic Task Planning, by Xiaowen Sun et al.
-
Summary of Tilt and Average : Geometric Adjustment Of the Last Layer For Recalibration, by Gyusang Cho and Chan-hyun Youn
-
Summary of Fzi-wim at Semeval-2024 Task 2: Self-consistent Cot For Complex Nli in Biomedical Domain, by Jin Liu and Steffen Thoma
-
Summary of First Multi-dimensional Evaluation Of Flowchart Comprehension For Multimodal Large Language Models, by Enming Zhang et al.
-
Summary of Localizing Events in Videos with Multimodal Queries, by Gengyuan Zhang and Mang Ling Ada Fok and Jialu Ma and Yan Xia and Daniel Cremers and Philip Torr and Volker Tresp and Jindong Gu
-
Summary of Skysensegpt: a Fine-grained Instruction Tuning Dataset and Model For Remote Sensing Vision-language Understanding, by Junwei Luo et al.
-
Summary of Exploration by Learning Diverse Skills Through Successor State Measures, By Paul-antoine Le Tolguenec et al.
-
Summary of Babilong: Testing the Limits Of Llms with Long Context Reasoning-in-a-haystack, by Yuri Kuratov et al.
-
Summary of Sycophancy to Subterfuge: Investigating Reward-tampering in Large Language Models, by Carson Denison et al.
-
Summary of Muirbench: a Comprehensive Benchmark For Robust Multi-image Understanding, by Fei Wang et al.
-
Summary of Advancing High Resolution Vision-language Models in Biomedicine, by Zekai Chen and Arda Pekis and Kevin Brown
-
Summary of Pandora: Towards General World Model with Natural Language Actions and Video States, by Jiannan Xiang et al.
-
Summary of Updating Clip to Prefer Descriptions Over Captions, by Amir Zur et al.
-
Summary of Svitt-ego: a Sparse Video-text Transformer For Egocentric Video, by Hector A. Valdez and Kyle Min and Subarna Tripathi
-
Summary of Semopo: Learning High-quality Model and Policy From Low-quality Offline Visual Datasets, by Shenghua Wan et al.
-
Summary of Gpt-ology, Computational Models, Silicon Sampling: How Should We Think About Llms in Cognitive Science?, by Desmond C. Ong
-
Summary of Talking Heads: Understanding Inter-layer Communication in Transformer Language Models, by Jack Merullo et al.
-
Summary of My Body My Choice: Human-centric Full-body Anonymization, by Umur Aybars Ciftci et al.
-
Summary of Speech Reallm — Real-time Streaming Speech Recognition with Multimodal Llms by Teaching the Flow Of Time, By Frank Seide et al.
-
Summary of Analyzing Gender Polarity in Short Social Media Texts with Bert: the Role Of Emojis and Emoticons, by Saba Yousefian Jazi et al.
-
Summary of Introducing Hot3d: An Egocentric Dataset For 3d Hand and Object Tracking, by Prithviraj Banerjee et al.
-
Summary of Multi-modal Retrieval For Large Language Model Based Speech Recognition, by Jari Kolehmainen et al.
-
Summary of Dsl-fiqa: Assessing Facial Image Quality Via Dual-set Degradation Learning and Landmark-guided Transformer, by Wei-ting Chen and Gurunandan Krishnan and Qiang Gao and Sy-yen Kuo and Sizhuo Ma and Jian Wang
-
Summary of Robustsam: Segment Anything Robustly on Degraded Images, by Wei-ting Chen and Yu-jiet Vong and Sy-yen Kuo and Sizhuo Ma and Jian Wang
-
Summary of A Survey Of Video Datasets For Grounded Event Understanding, by Kate Sanders and Benjamin Van Durme
-
Summary of Fine-grained Urban Flow Inference with Multi-scale Representation Learning, by Shilu Yuan et al.
-
Summary of Learning Language Structures Through Grounding, by Freda Shi
-
Summary of Self-knowledge Distillation For Learning Ambiguity, by Hancheol Park et al.
-
Summary of Controlvar: Exploring Controllable Visual Autoregressive Modeling, by Xiang Li et al.
-
Summary of Egoexo-fitness: Towards Egocentric and Exocentric Full-body Action Understanding, by Yuan-ming Li et al.
-
Summary of Exploring Multilingual Unseen Speaker Emotion Recognition: Leveraging Co-attention Cues in Multitask Learning, by Arnav Goel et al.
-
Summary of Introducing Brain-like Concepts to Embodied Hand-crafted Dialog Management System, by Frank Joublin et al.
-
Summary of Multi-agent Software Development Through Cross-team Collaboration, by Zhuoyun Du et al.
-
Summary of Language Models Are Crossword Solvers, by Soumadeep Saha and Sutanoya Chakraborty and Saptarshi Saha and Utpal Garain
-
Summary of Suitability Of Kans For Computer Vision: a Preliminary Investigation, by Basim Azam and Naveed Akhtar
-
Summary of Towards Reliable Detection Of Llm-generated Texts: a Comprehensive Evaluation Framework with Cudrt, by Zhen Tao et al.
-
Summary of Pc-lora: Low-rank Adaptation For Progressive Model Compression with Knowledge Distillation, by Injoon Hwang et al.