Paper List
We recommend you use the search box as this list is very long.
-
Summary of Location-guided Head Pose Estimation For Fisheye Image, by Bing Li et al.
-
Summary of Focus on Your Question! Interpreting and Mitigating Toxic Cot Problems in Commonsense Reasoning, by Jiachun Li et al.
-
Summary of Objective and Interpretable Breast Cosmesis Evaluation with Attention Guided Denoising Diffusion Anomaly Detection Model, by Sangjoon Park et al.
-
Summary of Pandas: Prototype-based Novel Class Discovery and Detection, by Tyler L. Hayes et al.
-
Summary of Demonstrating and Reducing Shortcuts in Vision-language Representation Learning, by Maurits Bleeker et al.
-
Summary of Deep Learning Based Named Entity Recognition Models For Recipes, by Mansi Goel et al.
-
Summary of Predict the Next Word: Humans Exhibit Uncertainty in This Task and Language Models _____, by Evgenia Ilia and Wilker Aziz
-
Summary of Cocoa: Cbt-based Conversational Counseling Agent Using Memory Specialized in Cognitive Distortions and Dynamic Prompt, by Suyeon Lee et al.
-
Summary of Agent-pro: Learning to Evolve Via Policy-level Reflection and Optimization, by Wenqi Zhang et al.
-
Summary of Are Llms Capable Of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data, by Xiao Liu et al.
-
Summary of Omniact: a Dataset and Benchmark For Enabling Multimodal Generalist Autonomous Agents For Desktop and Web, by Raghav Kapoor et al.
-
Summary of Reprune: Channel Pruning Via Kernel Representative Selection, by Mincheol Park et al.
-
Summary of Case-based or Rule-based: How Do Transformers Do the Math?, by Yi Hu et al.
-
Summary of Researchy Questions: a Dataset Of Multi-perspective, Decompositional Questions For Llm Web Agents, by Corby Rosset et al.
-
Summary of Extracting Lexical Features From Dialects Via Interpretable Dialect Classifiers, by Roy Xie et al.
-
Summary of Inducing Generalization Across Languages and Tasks Using Featurized Low-rank Mixtures, by Chu-cheng Lin and Xinyi Wang and Jonathan H. Clark and Han Lu and Yun Zhu and Chenxi Whitehouse and Hongkun Yu
-
Summary of Adversarial Math Word Problem Generation, by Roy Xie et al.
-
Summary of Vision Language Model-based Caption Evaluation Method Leveraging Visual Context Extraction, by Koki Maeda et al.
-
Summary of All in An Aggregated Image For In-image Learning, by Lei Wang et al.
-
Summary of A Sentiment Consolidation Framework For Meta-review Generation, by Miao Li and Jey Han Lau and Eduard Hovy
-
Summary of A Survey on Recent Advances in Llm-based Multi-turn Dialogue Systems, by Zihao Yi et al.
-
Summary of Do Large Language Models Mirror Cognitive Language Processing?, by Yuqi Ren et al.
-
Summary of Reslora: Identity Residual Mapping in Low-rank Adaption, by Shuhua Shi et al.
-
Summary of Theoretical Unification Of the Fractured Aspects Of Information, by Marcin J. Schroeder
-
Summary of Successfully Guiding Humans with Imperfect Instructions by Highlighting Potential Errors and Suggesting Corrections, By Lingjun Zhao et al.
-
Summary of Generative Retrieval with Large Language Models, by Ye Wang et al.
-
Summary of T-hitl Effectively Addresses Problematic Associations in Image Generation and Maintains Overall Visual Quality, by Susan Epstein et al.
-
Summary of Re-ex: Revising After Explanation Reduces the Factual Errors in Llm Responses, by Juyeon Kim et al.
-
Summary of Video As the New Language For Real-world Decision Making, by Sherry Yang et al.
-
Summary of Large Language Model For Participatory Urban Planning, by Zhilun Zhou et al.
-
Summary of Benchmarking Data Science Agents, by Yuge Zhang et al.
-
Summary of An Effective Mixture-of-experts Approach For Code-switching Speech Recognition Leveraging Encoder Disentanglement, by Tzu-ting Yang et al.
-
Summary of Playground V2.5: Three Insights Towards Enhancing Aesthetic Quality in Text-to-image Generation, by Daiqing Li et al.
-
Summary of Speak Out Of Turn: Safety Vulnerability Of Large Language Models in Multi-turn Dialogue, by Zhenhong Zhou et al.
-
Summary of Vcd: Knowledge Base Guided Visual Commonsense Discovery in Images, by Xiangqing Shen et al.
-
Summary of Enhancing Hyperspectral Images Via Diffusion Model and Group-autoencoder Super-resolution Network, by Zhaoyang Wang et al.
-
Summary of Probing Multimodal Large Language Models For Global and Local Semantic Representations, by Mingxu Tao et al.
-
Summary of Capt: Category-level Articulation Estimation From a Single Point Cloud Using Transformer, by Lian Fu et al.
-
Summary of Socialcvae: Predicting Pedestrian Trajectory Via Interaction Conditioned Latents, by Wei Xiang et al.
-
Summary of Fairbelief — Assessing Harmful Beliefs in Language Models, by Mattia Setzu et al.
-
Summary of Benchmarking Gpt-4 on Algorithmic Problems: a Systematic Evaluation Of Prompting Strategies, by Flavio Petruzzellis et al.
-
Summary of Exploiting Emotion-semantic Correlations For Empathetic Response Generation, by Zhou Yang et al.
-
Summary of Medit: Multilingual Text Editing Via Instruction Tuning, by Vipul Raheja and Dimitris Alikaniotis and Vivek Kulkarni and Bashar Alhafni and Dhruv Kumar
-
Summary of On Languaging a Simulation Engine, by Han Liu et al.
-
Summary of Intelligent Known and Novel Aircraft Recognition — a Shift From Classification to Similarity Learning For Combat Identification, by Ahmad Saeed et al.
-
Summary of Memory Gaps: Would Llms Pass the Tulving Test?, by Jean-marie Chauvet
-
Summary of Aligning Large Language Models to a Domain-specific Graph Database For Nl2gql, by Yuanyuan Liang et al.
-
Summary of Understanding the Dataset Practitioners Behind Large Language Model Development, by Crystal Qian et al.
-
Summary of A Comprehensive Survey Of Belief Rule Base (brb) Hybrid Expert System: Bridging Decision Science and Professional Services, by Karim Derrick
-
Summary of Genainet: Enabling Wireless Collective Intelligence Via Knowledge Transfer and Reasoning, by Hang Zou et al.
-
Summary of Gigapevt: Multimodal Medical Assistant, by Pavel Blinov et al.
-
Summary of Repoagent: An Llm-powered Open-source Framework For Repository-level Code Documentation Generation, by Qinyu Luo et al.
-
Summary of Automated Floodwater Depth Estimation Using Large Multimodal Model For Rapid Flood Mapping, by Temitope Akinboyewa et al.
-
Summary of Adaptation Of Biomedical and Clinical Pretrained Models to French Long Documents: a Comparative Study, by Adrien Bazoge et al.
-
Summary of Codechameleon: Personalized Encryption Framework For Jailbreaking Large Language Models, by Huijie Lv et al.
-
Summary of Generating Effective Ensembles For Sentiment Analysis, by Itay Etelis et al.
-
Summary of Dress: Dataset For Rubric-based Essay Scoring on Efl Writing, by Haneul Yoo et al.
-
Summary of Misc: Ultra-low Bitrate Image Semantic Compression Driven by Large Multimodal Model, By Chunyi Li et al.
-
Summary of Value Preferences Estimation and Disambiguation in Hybrid Participatory Systems, by Enrico Liscio et al.
-
Summary of A Comprehensive Evaluation Of Quantization Strategies For Large Language Models, by Renren Jin et al.
-
Summary of Cross-modal Projection in Multimodal Llms Doesn’t Really Project Visual Attributes to Textual Space, by Gaurav Verma et al.
-
Summary of Groundhog: Grounding Large Language Models to Holistic Segmentation, by Yichi Zhang et al.
-
Summary of Deep Homography Estimation For Visual Place Recognition, by Feng Lu et al.
-
Summary of Lstprompt: Large Language Models As Zero-shot Time Series Forecasters by Long-short-term Prompting, By Haoxin Liu et al.
-
Summary of Towards Accurate Post-training Quantization For Reparameterized Models, by Luoming Zhang et al.
-
Summary of From Text to Transformation: a Comprehensive Review Of Large Language Models’ Versatility, by Pravneet Kaur et al.
-
Summary of Hitting “probe”rty with Non-linearity, and More, by Avik Pal et al.
-
Summary of Gennbv: Generalizable Next-best-view Policy For Active 3d Reconstruction, by Xiao Chen and Quanyi Li and Tai Wang and Tianfan Xue and Jiangmiao Pang
-
Summary of One-stage Prompt-based Continual Learning, by Youngeun Kim et al.
-
Summary of Hsonet:a Siamese Foreground Association-driven Hard Case Sample Optimization Network For High-resolution Remote Sensing Image Change Detection, by Chao Tao et al.
-
Summary of Topic-to-essay Generation with Knowledge-based Content Selection, by Jieyong Wang et al.
-
Summary of Perltqa: a Personal Long-term Memory Dataset For Memory Classification, Retrieval, and Synthesis in Question Answering, by Yiming Du et al.
-
Summary of Mv-swin-t: Mammogram Classification with Multi-view Swin Transformer, by Sushmita Sarker et al.
-
Summary of Cross-domain Chinese Sentence Pattern Parsing, by Jingsi Yu et al.
-
Summary of Contingency Planning Using Bi-level Markov Decision Processes For Space Missions, by Somrita Banerjee and Edward Balaban and Mark Shirley and Kevin Bradner and Marco Pavone
-
Summary of Chain-of-discussion: a Multi-model Framework For Complex Evidence-based Question Answering, by Mingxu Tao and Dongyan Zhao and Yansong Feng
-
Summary of Mathgenie: Generating Synthetic Data with Question Back-translation For Enhancing Mathematical Reasoning Of Llms, by Zimu Lu et al.
-
Summary of Layer-wise Regularized Dropout For Neural Language Models, by Shiwen Ni et al.
-
Summary of Llm Inference Unveiled: Survey and Roofline Model Insights, by Zhihang Yuan et al.
-
Summary of Tear: Improving Llm-based Machine Translation with Systematic Self-refinement, by Zhaopeng Feng et al.
-
Summary of Mozip: a Multilingual Benchmark to Evaluate Large Language Models in Intellectual Property, by Shiwen Ni et al.
-
Summary of Defending Llms Against Jailbreaking Attacks Via Backtranslation, by Yihan Wang et al.
-
Summary of How Do Humans Write Code? Large Models Do It the Same Way Too, by Long Li et al.
-
Summary of Gaokao-mm: a Chinese Human-level Benchmark For Multimodal Models Evaluation, by Yi Zong et al.
-
Summary of Intelligent Director: An Automatic Framework For Dynamic Visual Composition Using Chatgpt, by Sixiao Zheng et al.
-
Summary of Chimera: a Lossless Decoding Method For Accelerating Large Language Models Inference by Fusing All Tokens, By Ziqian Zeng et al.
-
Summary of Tv-sam: Increasing Zero-shot Segmentation Performance on Multimodal Medical Images Using Gpt-4 Generated Descriptive Prompts Without Human Annotation, by Zekun Jiang et al.
-
Summary of Res-vmamba: Fine-grained Food Category Visual Classification Using Selective State Space Models with Deep Residual Learning, by Chi-sheng Chen et al.
-
Summary of Look Before You Leap: Problem Elaboration Prompting Improves Mathematical Reasoning in Large Language Models, by Haoran Liao et al.
-
Summary of Construction and Application Of Artificial Intelligence Crowdsourcing Map Based on Multi-track Gps Data, by Yong Wang et al.
-
Summary of Dart: Depth-enhanced Accurate and Real-time Background Matting, by Hanxi Li et al.
-
Summary of Empowering Large Language Model Agents Through Action Learning, by Haiteng Zhao et al.
-
Summary of Multiple Instance Learning For Glioma Diagnosis Using Hematoxylin and Eosin Whole Slide Images: An Indian Cohort Study, by Ekansh Chauhan et al.
-
Summary of Multicontrievers: Analysis Of Dense Retrieval Representations, by Seraphina Goldfarb-tarrant et al.
-
Summary of Budget-constrained Tool Learning with Planning, by Yuanhang Zheng et al.
-
Summary of Likelihood-based Mitigation Of Evaluation Bias in Large Language Models, by Masanari Ohi et al.
-
Summary of Pidformer: Transformer Meets Control Theory, by Tam Nguyen et al.
-
Summary of Tmt: Tri-modal Translation Between Speech, Image, and Text by Processing Different Modalities As Different Languages, By Minsu Kim et al.
-
Summary of Don’t Forget Your Reward Values: Language Model Alignment Via Value-based Calibration, by Xin Mao et al.