Paper List
We recommend you use the search box as this list is very long.
-
Summary of The Cognitive Capabilities Of Generative Ai: a Comparative Analysis with Human Benchmarks, by Isaac R. Galatzer-levy et al.
-
Summary of Wall-e: World Alignment by Rule Learning Improves World Model-based Llm Agents, By Siyu Zhou et al.
-
Summary of Using Llms to Discover Legal Factors, by Morgan Gray and Jaromir Savelka and Wesley Oliver and Kevin Ashley
-
Summary of Onenet: a Fine-tuning Free Framework For Few-shot Entity Linking Via Large Language Model Prompting, by Xukai Liu et al.
-
Summary of Comma: a Communicative Multimodal Multi-agent Benchmark, by Timothy Ossowski et al.
-
Summary of Krag Framework For Enhancing Llms in the Legal Domain, by Nguyen Ha Thanh et al.
-
Summary of When and Where Did It Happen? An Encoder-decoder Model to Identify Scenario Context, by Enrique Noriega-atala et al.
-
Summary of Moyun: a Diffusion-based Model For Style-specific Chinese Calligraphy Generation, by Kaiyuan Liu et al.
-
Summary of A Unified Debiasing Approach For Vision-language Models Across Modalities and Tasks, by Hoin Jung et al.
-
Summary of Harivo: Harnessing Text-to-image Models For Video Generation, by Mingi Kwon et al.
-
Summary of Macpo: Weak-to-strong Alignment Via Multi-agent Contrastive Preference Optimization, by Yougang Lyu et al.
-
Summary of Mentalarena: Self-play Training Of Language Models For Diagnosis and Treatment Of Mental Health Disorders, by Cheng Li et al.
-
Summary of A Trilogy Of Ai Safety Frameworks: Paths From Facts and Knowledge Gaps to Reliable Predictions and New Knowledge, by Simon Kasif
-
Summary of Self-boosting Large Language Models with Synthetic Preference Data, by Qingxiu Dong et al.
-
Summary of Uncovering Factor Level Preferences to Improve Human-model Alignment, by Juhyun Oh et al.
-
Summary of Personal Intelligence System Unilm: Hybrid On-device Small Language Model and Server-based Large Language Model For Malay Nusantara, by Azree Nazri et al.
-
Summary of Adaptive High-frequency Transformer For Diverse Wildlife Re-identification, by Chenyue Li et al.
-
Summary of Cursorcore: Assist Programming Through Aligning Anything, by Hao Jiang et al.
-
Summary of Pap2pat: Benchmarking Outline-guided Long-text Patent Generation with Patent-paper Pairs, by Valentin Knappich et al.
-
Summary of Positionid: Llms Can Control Lengths, Copy and Paste with Explicit Positional Awareness, by Zekun Wang et al.
-
Summary of Rejecting Hallucinated State Targets During Planning, by Mingde Zhao et al.
-
Summary of I Want to Break Free! Persuasion and Anti-social Behavior Of Llms in Multi-agent Settings with Social Hierarchy, by Gian Maria Campedelli et al.
-
Summary of Vhelm: a Holistic Evaluation Of Vision Language Models, by Tony Lee et al.
-
Summary of Cross-task Pretraining For Cross-organ Cross-scanner Adenocarcinoma Segmentation, by Adrian Galdran
-
Summary of Mental Disorders Detection in the Era Of Large Language Models, by Gleb Kuzmin et al.
-
Summary of Better Language Models Exhibit Higher Visual Alignment, by Jona Ruthardt et al.
-
Summary of Taking a Turn For the Better: Conversation Redirection Throughout the Course Of Mental-health Therapy, by Vivian Nguyen et al.
-
Summary of Technical Report: Competition Solution For Modelscope-sora, by Shengfu Chen and Hailong Liu and Wenzhao Wei
-
Summary of Aaai Workshop on Ai Planning For Cyber-physical Systems — Caipi24, by Oliver Niggemann et al.
-
Summary of Learning Content-aware Multi-modal Joint Input Pruning Via Bird’s-eye-view Representation, by Yuxin Li et al.
-
Summary of Par: Prompt-aware Token Reduction Method For Efficient Large Multimodal Models, by Yingen Liu et al.
-
Summary of Integrating Planning Into Single-turn Long-form Text Generation, by Yi Liang et al.
-
Summary of Egosocialarena: Benchmarking the Social Intelligence Of Large Language Models From a First-person Perspective, by Guiyang Hou et al.
-
Summary of Probing the Robustness Of Theory Of Mind in Large Language Models, by Christian Nickel et al.
-
Summary of Predict: Preference Reasoning by Evaluating Decomposed Preferences Inferred From Candidate Trajectories, By Stephane Aroca-ouellette et al.
-
Summary of A Taxonomy Of Collectible Card Games From a Game-playing Ai Perspective, by Ronaldo E Silva Vieira et al.
-
Summary of Validation Of the Scientific Literature Via Chemputation Augmented by Large Language Models, By Sebastian Pagel et al.
-
Summary of Tackling the Abstraction and Reasoning Corpus with Vision Transformers: the Importance Of 2d Representation, Positions, and Objects, by Wenhao Li et al.
-
Summary of The Sampling-gaussian For Stereo Matching, by Baiyu Pan and Jichao Jiao and Bowen Yao and Jianxin Pang and Jun Cheng
-
Summary of Chip-tuning: Classify Before Language Models Say, by Fangwei Zhu et al.
-
Summary of Investigating Cost-efficiency Of Llm-generated Training Data For Conversational Semantic Frame Analysis, by Shiho Matta et al.
-
Summary of The Accuracy Paradox in Rlhf: When Better Reward Models Don’t Yield Better Language Models, by Yanjun Chen et al.
-
Summary of Learning Evolving Tools For Large Language Models, by Guoxin Chen et al.
-
Summary of Subtle Errors Matter: Preference Learning Via Error-injected Self-editing, by Kaishuai Xu et al.
-
Summary of Decouple-then-merge: Finetune Diffusion Models As Multi-task Learning, by Qianli Ma et al.
-
Summary of Large Language Models As Code Executors: An Exploratory Study, by Chenyang Lyu et al.
-
Summary of St-webagentbench: a Benchmark For Evaluating Safety and Trustworthiness in Web Agents, by Ido Levy et al.
-
Summary of Calibrating Verbalized Probabilities For Large Language Models, by Cheng Wang et al.
-
Summary of Suppress Content Shift: Better Diffusion Features Via Off-the-shelf Generation Techniques, by Benyuan Meng et al.
-
Summary of Which Programming Language and What Features at Pre-training Stage Affect Downstream Logical Inference Performance?, by Fumiya Uchiyama et al.
-
Summary of Weak-eval-strong: Evaluating and Eliciting Lateral Thinking Of Llms with Situation Puzzles, by Qi Chen et al.
-
Summary of Postcast: Generalizable Postprocessing For Precipitation Nowcasting Via Unsupervised Blurriness Modeling, by Junchao Gong et al.
-
Summary of Communicating with Speakers and Listeners Of Different Pragmatic Levels, by Kata Naszadi et al.
-
Summary of Bottom-up Anytime Discovery Of Generalised Multimodal Graph Patterns For Knowledge Graphs, by Xander Wilcke et al.
-
Summary of From Tokens to Words: on the Inner Lexicon Of Llms, by Guy Kaplan et al.
-
Summary of Believing Is Seeing: Unobserved Object Detection Using Generative Models, by Subhransu S. Bhattacharjee and Dylan Campbell and Rahul Shome
-
Summary of Heuristics For Partially Observable Stochastic Contingent Planning, by Guy Shani
-
Summary of Automatic Summarization Of Long Documents, by Naman Chhibbar et al.
-
Summary of Mexa: Multilingual Evaluation Of English-centric Llms Via Cross-lingual Alignment, by Amir Hossein Kargaran et al.
-
Summary of Give Me a Hint: Can Llms Take a Hint to Solve Math Problems?, by Vansh Agrawal et al.
-
Summary of Beyond Captioning: Task-specific Prompting For Improved Vlm Performance in Mathematical Reasoning, by Ayush Singh et al.
-
Summary of Emma: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment, by Yifei Xing et al.
-
Summary of Athanor: Local Search Over Abstract Constraint Specifications, by Saad Attieh et al.
-
Summary of Stnet: Deep Audio-visual Fusion Network For Robust Speaker Tracking, by Yidi Li and Hong Liu and Bing Yang
-
Summary of Pdf-wukong: a Large Multimodal Model For Efficient Long Pdf Reading with End-to-end Sparse Sampling, by Xudong Xie et al.
-
Summary of Vector Grimoire: Codebook-based Shape Generation Under Raster Image Supervision, by Moritz Feuerpfeil et al.
-
Summary of Block Induced Signature Generative Adversarial Network (bisgan): Signature Spoofing Using Gans and Their Evaluation, by Haadia Amjad et al.
-
Summary of Tower: Tree Organized Weighting For Evaluating Complex Instructions, by Noah Ziems et al.
-
Summary of Coevolving with the Other You: Fine-tuning Llm with Sequential Cooperative Multi-agent Reinforcement Learning, by Hao Ma et al.
-
Summary of Multimodal Situational Safety, by Kaiwen Zhou et al.
-
Summary of Conceptagent: Llm-driven Precondition Grounding and Tree Search For Robust Task Planning and Execution, by Corban Rivera et al.
-
Summary of Toward General Object-level Mapping From Sparse Views with 3d Diffusion Priors, by Ziwei Liao et al.
-
Summary of On Instruction-finetuning Neural Machine Translation Models, by Vikas Raunak et al.
-
Summary of Accelerating Flood Warnings by 10 Hours: the Power Of River Network Topology in Ai-enhanced Flood Forecasting, By Hongjun Wang et al.
-
Summary of Narrative-of-thought: Improving Temporal Reasoning Of Large Language Models Via Recounted Narratives, by Xinliang Frederick Zhang et al.
-
Summary of Claimbrush: a Novel Framework For Automated Patent Claim Refinement Based on Large Language Models, by Seiya Kawano et al.
-
Summary of Teasergen: Generating Teasers For Long Documentaries, by Weihan Xu et al.
-
Summary of Closer: Towards Better Representation Learning For Few-shot Class-incremental Learning, by Junghun Oh et al.
-
Summary of A Unified Framework For Motion Reasoning and Generation in Human Interaction, by Jeongeun Park et al.
-
Summary of On the Modeling Capabilities Of Large Language Models For Sequential Decision Making, by Martin Klissarov et al.
-
Summary of Acpbench: Reasoning About Action, Change, and Planning, by Harsha Kokel et al.
-
Summary of T2v-turbo-v2: Enhancing Video Generation Model Post-training Through Data, Reward, and Conditional Guidance Design, by Jiachen Li et al.
-
Summary of Pixlens: a Novel Framework For Disentangled Evaluation in Diffusion-based Image Editing with Object Detection + Sam, by Stefan Stefanache et al.
-
Summary of A Two-step Approach For Data-efficient French Pronunciation Learning, by Hoyeon Lee et al.
-
Summary of Reducing Fuzzy Relation Equations Via Concept Lattices, by David Lobo et al.
-
Summary of Mero Nagarikta: Advanced Nepali Citizenship Data Extractor with Deep Learning-powered Text Detection and Ocr, by Sisir Dhakal et al.
-
Summary of Equi-gspr: Equivariant Se(3) Graph Network Model For Sparse Point Cloud Registration, by Xueyang Kang and Zhaoliang Luan and Kourosh Khoshelham and Bing Wang
-
Summary of Grounding Is All You Need? Dual Temporal Grounding For Video Dialog, by You Qin et al.
-
Summary of Core Tokensets For Data-efficient Sequential Training Of Transformers, by Subarnaduti Paul et al.
-
Summary of Retrieving, Rethinking and Revising: the Chain-of-verification Can Improve Retrieval Augmented Generation, by Bolei He et al.
-
Summary of Scalable and Accurate Graph Reasoning with Llm-based Multi-agents, by Yuwei Hu et al.
-
Summary of Synthetic Generation Of Dermatoscopic Images with Gan and Closed-form Factorization, by Rohan Reddy Mekala et al.
-
Summary of Ctc-gmm: Ctc Guided Modality Matching For Fast and Accurate Streaming Speech Translation, by Rui Zhao et al.
-
Summary of Preserving Multi-modal Capabilities Of Pre-trained Vlms For Improving Vision-linguistic Compositionality, by Youngtaek Oh et al.
-
Summary of Beyond Correlation: Interpretable Evaluation Of Machine Translation Metrics, by Stefano Perrella et al.