Paper List
We recommend you use the search box as this list is very long.
-
Summary of Rethinking Kullback-leibler Divergence in Knowledge Distillation For Large Language Models, by Taiqiang Wu et al.
-
Summary of Non-negative Subspace Feature Representation For Few-shot Learning in Medical Imaging, by Keqiang Fan et al.
-
Summary of Pejorativity: Disambiguating Pejorative Epithets to Improve Misogyny Detection in Italian Tweets, by Arianna Muti et al.
-
Summary of Imitation Game: a Model-based and Imitation Learning Deep Reinforcement Learning Hybrid, by Eric Msp Veith et al.
-
Summary of Beyond Accuracy: Evaluating the Reasoning Behavior Of Large Language Models — a Survey, by Philipp Mondorf and Barbara Plank
-
Summary of Rave: Residual Vector Embedding For Clip-guided Backlit Image Enhancement, by Tatiana Gaintseva et al.
-
Summary of Real, Fake and Synthetic Faces — Does the Coin Have Three Sides?, by Shahzeb Naeem et al.
-
Summary of Scanner: Knowledge-enhanced Approach For Robust Multi-modal Named Entity Recognition Of Unseen Entities, by Hyunjong Ok et al.
-
Summary of Sgsh: Stimulate Large Language Models with Skeleton Heuristics For Knowledge Base Question Generation, by Shasha Guo et al.
-
Summary of Improving Bird’s Eye View Semantic Segmentation by Task Decomposition, By Tianhao Zhao et al.
-
Summary of Multiparadetox: Extending Text Detoxification with Parallel Data to New Languages, by Daryna Dementieva et al.
-
Summary of A Survey on Large Language Model-based Game Agents, by Sihao Hu et al.
-
Summary of Cross-lingual Text Classification Transfer: the Case Of Ukrainian, by Daryna Dementieva et al.
-
Summary of Long-context Llms Struggle with Long In-context Learning, by Tianle Li et al.
-
Summary of Segment Any 3d Object with Language, by Seungjun Lee et al.
-
Summary of Chosen: Contrastive Hypothesis Selection For Multi-view Depth Refinement, by Di Qiu et al.
-
Summary of Extracting Norms From Contracts Via Chatgpt: Opportunities and Challenges, by Amanul Haque and Munindar P. Singh
-
Summary of Ofmpnet: Deep End-to-end Model For Occupancy and Flow Prediction in Urban Environment, by Youshaa Murhij and Dmitry Yudin
-
Summary of One Noise to Rule Them All: Multi-view Adversarial Attacks with Universal Perturbation, by Mehmet Ergezer and Phat Duong and Christian Green and Tommy Nguyen and Abdurrahman Zeybey
-
Summary of Collapse Of Self-trained Language Models, by David Herel and Tomas Mikolov
-
Summary of Comparative Study Of Domain Driven Terms Extraction Using Large Language Models, by Sandeep Chataut et al.
-
Summary of Multi-bert: Leveraging Adapters and Prompt Tuning For Low-resource Multi-domain Adaptation, by Parham Abed Azad and Hamid Beigy
-
Summary of Ovfoodseg: Elevating Open-vocabulary Food Image Segmentation Via Image-informed Textual Representation, by Xiongwei Wu et al.
-
Summary of Neural Implicit Representation For Building Digital Twins Of Unknown Articulated Objects, by Yijia Weng et al.
-
Summary of Generation and Detection Of Sign Language Deepfakes – a Linguistic and Visual Analysis, by Shahzeb Naeem et al.
-
Summary of Finding Regions Of Interest in Whole Slide Images Using Multiple Instance Learning, by Martim Afonso et al.
-
Summary of Unveiling Divergent Inductive Biases Of Llms on Temporal Data, by Sindhu Kishore et al.
-
Summary of Modality Translation For Object Detection Adaptation Without Forgetting Prior Knowledge, by Heitor Rapela Medeiros et al.
-
Summary of Some Orders Are Important: Partially Preserving Orders in Top-quality Planning, by Michael Katz et al.
-
Summary of On Train-test Class Overlap and Detection For Image Retrieval, by Chull Hwan Song et al.
-
Summary of Categorical Semiotics: Foundations For Knowledge Integration, by Carlos Leandro
-
Summary of Mchartqa: a Universal Benchmark For Multimodal Chart Question Answer Based on Vision-language Alignment and Reasoning, by Jingxuan Wei et al.
-
Summary of Bert-enhanced Retrieval Tool For Homework Plagiarism Detection System, by Jiarong Xian et al.
-
Summary of Helmsman Of the Masses? Evaluate the Opinion Leadership Of Large Language Models in the Werewolf Game, by Silin Du et al.
-
Summary of Classifying Cancer Stage with Open-source Clinical Large Language Models, by Chia-hsuan Chang et al.
-
Summary of Towards Better Generalization in Open-domain Question Answering by Mitigating Context Memorization, By Zixuan Zhang et al.
-
Summary of Ai Walkup: a Computer-vision Approach to Quantifying Mds-updrs in Parkinson’s Disease, by Xiang Xiang et al.
-
Summary of Upsample Guidance: Scale Up Diffusion Models Without Training, by Juno Hwang et al.
-
Summary of Towards Generalizable and Faithful Logic Reasoning Over Natural Language Via Resolution Refutation, by Zhouhao Sun et al.
-
Summary of Generative Ai For Immersive Communication: the Next Frontier in Internet-of-senses Through 6g, by Nassim Sehad et al.
-
Summary of Unleash the Potential Of Clip For Video Highlight Detection, by Donghoon Han et al.
-
Summary of Stereotype Detection in Llms: a Multiclass, Explainable, and Benchmark-driven Approach, by Zekun Wu et al.
-
Summary of Regularized Best-of-n Sampling with Minimum Bayes Risk Objective For Language Model Alignment, by Yuu Jinnai et al.
-
Summary of Advancing Ai with Integrity: Ethical Challenges and Solutions in Neural Machine Translation, by Richard Kimera et al.
-
Summary of Ails-ntua at Semeval-2024 Task 9: Cracking Brain Teasers: Transformer Models For Lateral Thinking Puzzles, by Ioannis Panagiotopoulos et al.
-
Summary of Texture-preserving Diffusion Models For High-fidelity Virtual Try-on, by Xu Yang et al.
-
Summary of Condition-aware Neural Network For Controlled Image Generation, by Han Cai et al.
-
Summary of Syncmask: Synchronized Attentional Masking For Fashion-centric Vision-language Pretraining, by Chull Hwan Song et al.
-
Summary of Direct Preference Optimization Of Video Large Multimodal Models From Language Model Reward, by Ruohong Zhang et al.
-
Summary of Fables: Evaluating Faithfulness and Content Selection in Book-length Summarization, by Yekyung Kim et al.
-
Summary of Isobench: Benchmarking Multimodal Foundation Models on Isomorphic Representations, by Deqing Fu et al.
-
Summary of A Review Of Multi-modal Large Language and Vision Models, by Kilian Carolan and Laura Fennelly and Alan F. Smeaton
-
Summary of Towards Safety and Helpfulness Balanced Responses Via Controllable Large Language Models, by Yi-lin Tuan et al.
-
Summary of Llava-gemma: Accelerating Multimodal Foundation Models with a Compact Language Model, by Musashi Hinck et al.
-
Summary of Finefake: a Knowledge-enriched Dataset For Fine-grained Multi-domain Fake News Detection, by Ziyi Zhou et al.
-
Summary of Humane Speech Synthesis Through Zero-shot Emotion and Disfluency Generation, by Rohan Chaudhury et al.
-
Summary of Diffagent: Fast and Accurate Text-to-image Api Selection with Large Language Model, by Lirui Zhao et al.
-
Summary of Chops: Chat with Customer Profile Systems For Customer Service with Llms, by Jingzhe Shi et al.
-
Summary of Object-conditioned Bag Of Instances For Few-shot Personalized Instance Recognition, by Umberto Michieli et al.
-
Summary of Fairness in Large Language Models: a Taxonomic Survey, by Zhibo Chu et al.
-
Summary of Learning to Generate Conditional Tri-plane For 3d-aware Expression Controllable Portrait Animation, by Taekyung Ki et al.
-
Summary of Drct: Saving Image Super-resolution Away From Information Bottleneck, by Chih-chung Hsu et al.
-
Summary of Wavllm: Towards Robust and Adaptive Speech Large Language Model, by Shujie Hu et al.
-
Summary of Benchmark Transparency: Measuring the Impact Of Data on Evaluation, by Venelin Kovatchev and Matthew Lease
-
Summary of Llm Meets Vision-language Models For Zero-shot One-class Classification, by Yassir Bendou et al.
-
Summary of On the True Distribution Approximation Of Minimum Bayes-risk Decoding, by Atsumoto Ohashi et al.
-
Summary of Towards Realistic Scene Generation with Lidar Diffusion Models, by Haoxi Ran et al.
-
Summary of Tsom: Small Object Motion Detection Neural Network Inspired by Avian Visual Circuit, By Pignge Hu et al.
-
Summary of Self-demos: Eliciting Out-of-demonstration Generalizability in Large Language Models, by Wei He et al.
-
Summary of Mtlight: Efficient Multi-task Reinforcement Learning For Traffic Signal Control, by Liwen Zhu et al.
-
Summary of Llama-excitor: General Instruction Tuning Via Indirect Feature Interaction, by Bo Zou et al.
-
Summary of Mm3dgs Slam: Multi-modal 3d Gaussian Splatting For Slam Using Vision, Depth, and Inertial Measurements, by Lisong C. Sun et al.
-
Summary of A Survey on Multilingual Large Language Models: Corpora, Alignment, and Bias, by Yuemei Xu et al.
-
Summary of Evalverse: Unified and Accessible Library For Large Language Model Evaluation, by Jihoo Kim et al.
-
Summary of 360+x: a Panoptic Multi-modal Scene Understanding Dataset, by Hao Chen et al.
-
Summary of Llm-radjudge: Achieving Radiologist-level Evaluation For X-ray Report Generation, by Zilong Wang et al.
-
Summary of Teeth-seg: An Efficient Instance Segmentation Framework For Orthodontic Treatment Based on Anthropic Prior Knowledge, by Bo Zou et al.
-
Summary of Source-aware Training Enables Knowledge Attribution in Language Models, by Muhammad Khalifa et al.
-
Summary of Survey Of Bias in Text-to-image Generation: Definition, Evaluation, and Mitigation, by Yixin Wan et al.
-
Summary of Is Factuality Enhancement a Free Lunch For Llms? Better Factuality Can Lead to Worse Context-faithfulness, by Baolong Bi et al.
-
Summary of Deft: Decoding with Flash Tree-attention For Efficient Tree-structured Llm Inference, by Jinwei Yao et al.
-
Summary of Your Co-workers Matter: Evaluating Collaborative Capabilities Of Language Models in Blocks World, by Guande Wu et al.
-
Summary of Instruction-driven Game Engines on Large Language Models, by Hongqiu Wu et al.
-
Summary of Long-tailed Recognition on Binary Networks by Calibrating a Pre-trained Model, By Jihun Kim et al.
-
Summary of Bayesian Exploration Of Pre-trained Models For Low-shot Image Classification, by Yibo Miao et al.
-
Summary of Advancing Multimodal Data Fusion in Pain Recognition: a Strategy Leveraging Statistical Correlation and Human-centered Perspectives, by Xingrui Gu et al.
-
Summary of Memory-scalable and Simplified Functional Map Learning, by Robin Magnet et al.
-
Summary of Ontology in Holonic Cooperative Manufacturing: a Solution to Share and Exchange the Knowledge, by Ahmed R.sadik et al.
-
Summary of Taco — Twitter Arguments From Conversations, by Marc Feger and Stefan Dietze
-
Summary of Can Llms Master Math? Investigating Large Language Models on Math Stack Exchange, by Ankit Satpute and Noah Giessing and Andre Greiner-petter and Moritz Schubotz and Olaf Teschke and Akiko Aizawa and Bela Gipp
-
Summary of Dialectical Alignment: Resolving the Tension Of 3h and Security Threats Of Llms, by Shu Yang et al.
-
Summary of Configurable Safety Tuning Of Language Models with Synthetic Preference Data, by Victor Gallego
-
Summary of Embodied Active Defense: Leveraging Recurrent Feedback to Counter Adversarial Patches, by Lingxuan Wu et al.
-
Summary of A Theory For Length Generalization in Learning to Reason, by Changnan Xiao and Bing Liu
-
Summary of Rlgnet: Repeating-local-global History Network For Temporal Knowledge Graph Reasoning, by Ao Lv et al.