Paper List
We recommend you use the search box as this list is very long.
-
Summary of Fine-grained Domain Generalization with Feature Structuralization, by Wenlong Yu et al.
-
Summary of Language Complexity and Speech Recognition Accuracy: Orthographic Complexity Hurts, Phonological Complexity Doesn’t, by Chihiro Taguchi and David Chiang
-
Summary of A Large-scale Universal Evaluation Benchmark For Face Forgery Detection, by Yijun Bei et al.
-
Summary of Readctrl: Personalizing Text Generation with Readability-controlled Instruction Learning, by Hieu Tran et al.
-
Summary of Applying Multi-agent Negotiation to Solve the Production Routing Problem with Privacy Preserving, by Luiza Pellin Biasoto et al.
-
Summary of Towards a Characterisation Of Monte-carlo Tree Search Performance in Different Games, by Dennis J.n.j. Soemers et al.
-
Summary of Deep Transformer Network For Monocular Pose Estimation Of Ship-based Uav, by Maneesha Wickramasuriya et al.
-
Summary of Action2sound: Ambient-aware Generation Of Action Sounds From Egocentric Videos, by Changan Chen et al.
-
Summary of Parameter-efficient Active Learning For Foundational Models, by Athmanarayanan Lakshmi Narayanan et al.
-
Summary of Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms, by Miaosen Zhang et al.
-
Summary of Mmscan: a Multi-modal 3d Scene Dataset with Hierarchical Grounded Language Annotations, by Ruiyuan Lyu et al.
-
Summary of Star: a First-ever Dataset and a Large-scale Benchmark For Scene Graph Generation in Large-size Satellite Imagery, by Yansheng Li et al.
-
Summary of Taste: Teaching Large Language Models to Translate Through Self-reflection, by Yutong Wang et al.
-
Summary of Olmes: a Standard For Language Model Evaluations, by Yuling Gu et al.
-
Summary of Magpie: Alignment Data Synthesis From Scratch by Prompting Aligned Llms with Nothing, By Zhangchen Xu et al.
-
Summary of Asi As the New God: Technocratic Theocracy, by Tevfik Uyar
-
Summary of Surprise! Using Physiological Stress For Allostatic Regulation Under the Active Inference Framework [pre-print], by Imran Khan and Robert Lowe
-
Summary of Language Model Council: Democratically Benchmarking Foundation Models on Highly Subjective Tasks, by Justin Zhao et al.
-
Summary of A Generative Marker Enhanced End-to-end Framework For Argument Mining, by Nilmadhab Das et al.
-
Summary of Reversing the Forget-retain Objectives: An Efficient Llm Unlearning Framework From Logit Difference, by Jiabao Ji et al.
-
Summary of Tc-bench: Benchmarking Temporal Compositionality in Text-to-video and Image-to-video Generation, by Weixi Feng et al.
-
Summary of Alphazeroes: Direct Score Maximization Outperforms Planning Loss Minimization, by Carlos Martin et al.
-
Summary of Vlind-bench: Measuring Language Priors in Large Vision-language Models, by Kang-il Lee et al.
-
Summary of Batch-instructed Gradient For Prompt Evolution:systematic Prompt Optimization For Enhanced Text-to-image Synthesis, by Xinrui Yang et al.
-
Summary of 3d Building Generation in Minecraft Via Large Language Models, by Shiying Hu et al.
-
Summary of Srfund: a Multi-granularity Hierarchical Structure Reconstruction Benchmark in Form Understanding, by Jiefeng Ma et al.
-
Summary of Injecting Combinatorial Optimization Into Mcts: Application to the Board Game Boop, by Florian Richoux
-
Summary of A Survey on Compositional Learning Of Ai Models: Theoretical and Experimental Practices, by Sania Sinha et al.
-
Summary of Computer Vision-based Model For Detecting Turning Lane Features on Florida’s Public Roadways, by Richard Boadu Antwi et al.
-
Summary of An Approach to Build Zero-shot Slot-filling System For Industry-grade Conversational Assistants, by G P Shrivatsa Bhargav et al.
-
Summary of Zoom and Shift Are All You Need, by Jiahao Qin
-
Summary of Openobj: Open-vocabulary Object-level Neural Radiance Fields with Fine-grained Understanding, by Yinan Deng et al.
-
Summary of Fewer Tokens and Fewer Videos: Extending Video Understanding Abilities in Large Vision-language Models, by Shimin Chen et al.
-
Summary of Lvbench: An Extreme Long Video Understanding Benchmark, by Weihan Wang et al.
-
Summary of Multimodal Table Understanding, by Mingyu Zheng et al.
-
Summary of Austrotox: a Dataset For Target-based Austrian German Offensive Language Detection, by Pia Pachinger et al.
-
Summary of Supportiveness-based Knowledge Rewriting For Retrieval-augmented Language Modeling, by Zile Qiao et al.
-
Summary of Legend: Leveraging Representation Engineering to Annotate Safety Margin For Preference Datasets, by Duanyu Feng et al.
-
Summary of Making Ai Intelligible: Philosophical Foundations, by Herman Cappelen and Josh Dever
-
Summary of Continuous Fake Media Detection: Adapting Deepfake Detectors to New Generative Techniques, by Francesco Tassone et al.
-
Summary of A Sociotechnical Lens For Evaluating Computer Vision Models: a Case Study on Detecting and Reasoning About Gender and Emotion, by Sha Luo et al.
-
Summary of Mobileagentbench: An Efficient and User-friendly Benchmark For Mobile Llm Agents, by Luyuan Wang et al.
-
Summary of Research Trends For the Interplay Between Large Language Models and Knowledge Graphs, by Hanieh Khorashadizadeh et al.
-
Summary of Using Deep Convolutional Neural Networks to Detect Rendered Glitches in Video Games, by Carlos Garcia Ling et al.
-
Summary of From a Social Cognitive Perspective: Context-aware Visual Social Relationship Recognition, by Shiwei Wu et al.
-
Summary of 2.5d Multi-view Averaging Diffusion Model For 3d Medical Image Translation: Application to Low-count Pet Reconstruction with Ct-less Attenuation Correction, by Tianqi Chen et al.
-
Summary of Mmworld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos, by Xuehai He et al.
-
Summary of Tailoring Generative Ai Chatbots For Multiethnic Communities in Disaster Preparedness Communication: Extending the Casa Paradigm, by Xinyan Zhao et al.
-
Summary of Omnicorpus: a Unified Multimodal Corpus Of 10 Billion-level Images Interleaved with Text, by Qingyun Li et al.
-
Summary of Awgunet: Attention-aided Wavelet Guided U-net For Nuclei Segmentation in Histopathology Images, by Ayush Roy et al.
-
Summary of Next-generation Database Interfaces: a Survey Of Llm-based Text-to-sql, by Zijin Hong et al.
-
Summary of Test-time Fairness and Robustness in Large Language Models, by Leonardo Cotta and Chris J. Maddison
-
Summary of Mllmguard: a Multi-dimensional Safety Evaluation Suite For Multimodal Large Language Models, by Tianle Gu et al.
-
Summary of Cupid: Contextual Understanding Of Prompt-conditioned Image Distributions, by Yayan Zhao et al.
-
Summary of The Muse 2024 Multimodal Sentiment Analysis Challenge: Social Perception and Humor Recognition, by Shahin Amiriparian et al.
-
Summary of Making Task-oriented Dialogue Datasets More Natural by Synthetically Generating Indirect User Requests, By Amogh Mannekote et al.
-
Summary of Judging the Judges: a Systematic Study Of Position Bias in Llm-as-a-judge, by Lin Shi et al.
-
Summary of Collective Constitutional Ai: Aligning a Language Model with Public Input, by Saffron Huang et al.
-
Summary of Are Large Language Models Good Statisticians?, by Yizhang Zhu et al.
-
Summary of Sense Less, Generate More: Pre-training Lidar Perception with Masked Autoencoders For Ultra-efficient 3d Sensing, by Sina Tayebati et al.
-
Summary of Multi-agent Reinforcement Learning with Deep Networks For Diverse Q-vectors, by Zhenglong Luo et al.
-
Summary of Sciriff: a Resource to Enhance Language Model Instruction-following Over Scientific Literature, by David Wadden et al.
-
Summary of Dynamic Stochastic Decoding Strategy For Open-domain Dialogue Generation, by Yiwei Li et al.
-
Summary of Let’s Go Real Talk: Spoken Dialogue Model For Face-to-face Conversation, by Se Jin Park et al.
-
Summary of Unveiling the Power Of Wavelets: a Wavelet-based Kolmogorov-arnold Network For Hyperspectral Image Classification, by Seyd Teymoor Seydi and Zavareh Bozorgasl and Hao Chen
-
Summary of Exploring Self-supervised Multi-view Contrastive Learning For Speech Emotion Recognition with Limited Annotations, by Bulat Khaertdinov et al.
-
Summary of Designing a Dashboard For Transparency and Control Of Conversational Ai, by Yida Chen et al.
-
Summary of Toward a Method to Generate Capability Ontologies From Natural Language Descriptions, by Luis Miguel Vieira Da Silva et al.
-
Summary of Efficient Adaptation in Mixed-motive Environments Via Hierarchical Opponent Modeling and Planning, by Yizhe Huang et al.
-
Summary of Shacl2fol: An Fol Toolkit For Shacl Decision Problems, by Paolo Pareti
-
Summary of Is One Gpu Enough? Pushing Image Generation at Higher-resolutions with Foundation Models, by Athanasios Tragakis et al.
-
Summary of Scholarly Question Answering Using Large Language Models in the Nfdi4datascience Gateway, by Hamed Babaei Giglou et al.
-
Summary of Dca-bench: a Benchmark For Dataset Curation Agents, by Benhao Huang et al.
-
Summary of Speaking Your Language: Spatial Relationships in Interpretable Emergent Communication, by Olaf Lipinski et al.
-
Summary of Can We Achieve High-quality Direct Speech-to-speech Translation Without Parallel Speech Data?, by Qingkai Fang et al.
-
Summary of Unsupervised Object Detection with Theoretical Guarantees, by Marian Longa et al.
-
Summary of Ctc-based Non-autoregressive Textless Speech-to-speech Translation, by Qingkai Fang et al.
-
Summary of Toxic Memes: a Survey Of Computational Perspectives on the Detection and Explanation Of Meme Toxicities, by Delfina Sol Martinez Pandiani et al.
-
Summary of Bvsp: Broad-view Soft Prompting For Few-shot Aspect Sentiment Quad Prediction, by Yinhao Bai et al.
-
Summary of Large Language Models For Constrained-based Causal Discovery, by Kai-hendrik Cohrs et al.
-
Summary of Accessing Gpt-4 Level Mathematical Olympiad Solutions Via Monte Carlo Tree Self-refine with Llama-3 8b, by Di Zhang et al.
-
Summary of Textual Similarity As a Key Metric in Machine Translation Quality Estimation, by Kun Sun et al.
-
Summary of Cads: a Systematic Literature Review on the Challenges Of Abstractive Dialogue Summarization, by Frederic Kirstein et al.
-
Summary of Neural Gaffer: Relighting Any Object Via Diffusion, by Haian Jin et al.
-
Summary of Commonsense-t2i Challenge: Can Text-to-image Generation Models Understand Commonsense?, by Xingyu Fu et al.
-
Summary of Structured Active Inference (extended Abstract), by Toby St Clere Smithe
-
Summary of Open-llm-leaderboard: From Multi-choice to Open-style Questions For Llms Evaluation, Benchmark, and Arena, by Aidar Myrzakhan and Sondos Mahmoud Bsharat and Zhiqiang Shen
-
Summary of Situated Ground Truths: Enhancing Bias-aware Ai by Situating Data Labels with Situannotate, By Delfina Sol Martinez Pandiani and Valentina Presutti
-
Summary of Brainchat: Decoding Semantic Information From Fmri Using Vision-language Pretrained Models, by Wanaiu Huang
-
Summary of Modeling Sustainable Resource Management Using Active Inference, by Mahault Albarracin et al.
-
Summary of Caap: Context-aware Action Planning Prompting to Solve Computer Tasks with Front-end Ui Only, by Junhee Cho et al.
-
Summary of Evolving Subnetwork Training For Large Language Models, by Hanqi Li et al.
-
Summary of Dual Thinking and Logical Processing — Are Multi-modal Large Language Models Closing the Gap with Human Vision ?, by Kailas Dayanandan et al.
-
Summary of Mitigating Boundary Ambiguity and Inherent Bias For Text Classification in the Era Of Large Language Models, by Zhenyi Lu et al.
-
Summary of Eye-for-an-eye: Appearance Transfer with Semantic Correspondence in Diffusion Models, by Sooyeon Go and Kyungmook Choi and Minjung Shin and Youngjung Uh
-
Summary of Delving Into Chatgpt Usage in Academic Writing Through Excess Vocabulary, by Dmitry Kobak et al.
-
Summary of Improving Multi-hop Logical Reasoning in Knowledge Graphs with Context-aware Query Representation Learning, by Jeonghoon Kim et al.
-
Summary of Paying More Attention to Source Context: Mitigating Unfaithful Translations From Large Language Model, by Hongbin Zhang et al.