Paper List
We recommend you use the search box as this list is very long.
-
Summary of Raw Text Is All You Need: Knowledge-intensive Multi-turn Instruction Tuning For Large Language Model, by Xia Hou et al.
-
Summary of Enhancements For Real-time Monte-carlo Tree Search in General Video Game Playing, by Dennis J.n.j. Soemers and Chiara F. Sironi and Torsten Schuster and Mark H.m. Winands
-
Summary of Learning Disentangled Representation in Object-centric Models For Visual Dynamics Prediction Via Transformers, by Sanket Gandhi et al.
-
Summary of Imc 2024 Methods & Solutions Review, by Shyam Gupta et al.
-
Summary of Improving Retrieval-augmented Text-to-sql with Ast-based Ranking and Schema Pruning, by Zhili Shen and Pavlos Vougiouklis and Chenxin Diao and Kaustubh Vyas and Yuanyi Ji and Jeff Z. Pan
-
Summary of Vchar:variance-driven Complex Human Activity Recognition Framework with Generative Representation, by Yuan Sun et al.
-
Summary of Improved Noise Schedule For Diffusion Training, by Tiankai Hang et al.
-
Summary of Anole: Adapting Diverse Compressed Models For Cross-scene Prediction on Mobile Devices, by Yunzhe Li et al.
-
Summary of Ddpm-moco: Advancing Industrial Surface Defect Generation and Detection with Generative and Contrastive Learning, by Yangfan He et al.
-
Summary of An Outline Of Prognostics and Health Management Large Model: Concepts, Paradigms, and Challenges, by Laifa Tao et al.
-
Summary of Collaborative Quest Completion with Llm-driven Non-player Characters in Minecraft, by Sudha Rao et al.
-
Summary of Improving Llm Abilities in Idiomatic Translation, by Sundesh Donthi et al.
-
Summary of Belief Sharing: a Blessing or a Curse, by Ozan Catal et al.
-
Summary of Free Energy in a Circumplex Model Of Emotion, by Candice Pattisapu et al.
-
Summary of Mmedagent: Learning to Use Medical Tools with Multi-modal Agent, by Binxu Li et al.
-
Summary of Minds, Brains, Ai, by Jay Seitz
-
Summary of Autosplat: Constrained Gaussian Splatting For Autonomous Driving Scene Reconstruction, by Mustafa Khan and Hamidreza Fazlali and Dhruv Sharma and Tongtong Cao and Dongfeng Bai and Yuan Ren and Bingbing Liu
-
Summary of Wildfire Autonomous Response and Prediction Using Cellular Automata (warp-ca), by Abdelrahman Ramadan
-
Summary of Nollywood: Let’s Go to the Movies!, by John E. Ortega and Ibrahim Said Ahmad and William Chen
-
Summary of A Practical Review Of Mechanistic Interpretability For Transformer-based Language Models, by Daking Rai et al.
-
Summary of Adversarial Magnification to Deceive Deepfake Detection Through Super Resolution, by Davide Alessandro Coccomini et al.
-
Summary of Reasoning in Large Language Models: a Geometric Perspective, by Romain Cosentino et al.
-
Summary of Medvh: Towards Systematic Evaluation Of Hallucination For Large Vision Language Models in the Medical Context, by Zishan Gu et al.
-
Summary of Artificial Intelligence and Machine Learning Generated Conjectures with Txgraffiti, by Randy Davila
-
Summary of Emotion and Intent Joint Understanding in Multimodal Conversation: a Benchmarking Dataset, by Rui Liu et al.
-
Summary of Images Speak Louder Than Words: Understanding and Mitigating Bias in Vision-language Model From a Causal Mediation Perspective, by Zhaotian Weng et al.
-
Summary of 52b to 1t: Lessons Learned Via Tele-flm Series, by Xiang Li et al.
-
Summary of Mindbench: a Comprehensive Benchmark For Mind Map Structure Recognition and Analysis, by Lei Chen et al.
-
Summary of Fast Maneuver Recovery From Aerial Observation: Trajectory Clustering and Outliers Rejection, by Nelson De Moura (astra) et al.
-
Summary of Translatotron-v(ison): An End-to-end Model For In-image Machine Translation, by Zhibin Lan et al.
-
Summary of Gracore: Benchmarking Graph Comprehension and Complex Reasoning in Large Language Models, by Zike Yuan et al.
-
Summary of Towards Negotiative Dialogue For the Talkamatic Dialogue Manager, by Staffan Larsson et al.
-
Summary of Integrate the Essence and Eliminate the Dross: Fine-grained Self-consistency For Free-form Language Generation, by Xinglin Wang et al.
-
Summary of Hrsam: Efficient Interactive Segmentation in High-resolution Images, by You Huang et al.
-
Summary of Gemmar: Enhancing Llms Through Arabic Instruction-tuning, by Hasna Chouikhi et al.
-
Summary of Research on Reliable and Safe Occupancy Grid Prediction in Underground Parking Lots, by Jiaqi Luo
-
Summary of Automatic Adaptation Rule Optimization Via Large Language Models, by Yusei Ishimizu et al.
-
Summary of How to Learn in a Noisy World? Self-correcting the Real-world Data Noise in Machine Translation, by Yan Meng et al.
-
Summary of Generative Monoculture in Large Language Models, by Fan Wu et al.
-
Summary of Fedia: Federated Medical Image Segmentation with Heterogeneous Annotation Completeness, by Yangyang Xiang et al.
-
Summary of Mtmamba: Enhancing Multi-task Dense Scene Understanding by Mamba-based Decoders, By Baijiong Lin et al.
-
Summary of A Refreshed Similarity-based Upsampler For Direct High-ratio Feature Upsampling, by Minghao Zhou et al.
-
Summary of Vfimamba: Video Frame Interpolation with State Space Models, by Guozhen Zhang and Chunxu Liu and Yutao Cui and Xiaotong Zhao and Kai Ma and Limin Wang
-
Summary of Rethinking Data Augmentation For Robust Lidar Semantic Segmentation in Adverse Weather, by Junsung Park et al.
-
Summary of Rvisa: Reasoning and Verification For Implicit Sentiment Analysis, by Wenna Lai et al.
-
Summary of Exploring the Role Of Transliteration in In-context Learning For Low-resource Languages Written in Non-latin Scripts, by Chunlan Ma et al.
-
Summary of Talking to Machines: Do You Read Me?, by Lina M. Rojas-barahona
-
Summary of Face Reconstruction Transfer Attack As Out-of-distribution Generalization, by Yoon Gyo Jung et al.
-
Summary of Reinforcement Learning and Machine Ethics:a Systematic Review, by Ajay Vishwanath and Louise A. Dennis and Marija Slavkovik
-
Summary of Meta 3d Assetgen: Text-to-mesh Generation with High-quality Geometry, Texture, and Pbr Materials, by Yawar Siddiqui et al.
-
Summary of Predicting Vs. Acting: a Trade-off Between World Modeling & Agent Modeling, by Margaret Li et al.
-
Summary of Ensemble Of Pre-trained Language Models and Data Augmentation For Hate Speech Detection From Arabic Tweets, by Kheir Eddine Daouadi et al.
-
Summary of Crab: Cross-environment Agent Benchmark For Multimodal Language Model Agents, by Tianqi Xu et al.
-
Summary of Scanreason: Empowering 3d Visual Grounding with Reasoning Capabilities, by Chenming Zhu et al.
-
Summary of Fish-bone Diagram Of Research Issue: Gain a Bird’s-eye View on a Specific Research Topic, by Jinghong Li et al.
-
Summary of Sparkle: Enhancing Sparql Generation with Direct Kg Integration in Decoding, by Jaebok Lee and Hyeonjeong Shin
-
Summary of Nlpguard: a Framework For Mitigating the Use Of Protected Attributes by Nlp Classifiers, By Salvatore Greco et al.
-
Summary of Optimized Learning For X-ray Image Classification For Multi-class Disease Diagnoses with Accelerated Computing Strategies, by Sebastian A. Cruz Romero et al.
-
Summary of Deciphering the Factors Influencing the Efficacy Of Chain-of-thought: Probability, Memorization, and Noisy Reasoning, by Akshara Prabhakar et al.
-
Summary of Addressing a Fundamental Limitation in Deep Vision Models: Lack Of Spatial Attention, by Ali Borji
-
Summary of Survey on Knowledge Distillation For Large Language Models: Methods, Evaluation, and Application, by Chuanpeng Yang et al.
-
Summary of Spatio-temporal Graphical Counterfactuals: An Overview, by Mingyu Kang and Duxin Chen and Ziyuan Pu and Jianxi Gao and Wenwu Yu
-
Summary of Grasp: a Grid-based Benchmark For Evaluating Commonsense Spatial Reasoning, by Zhisheng Tang et al.
-
Summary of Sequential Manipulation Against Rank Aggregation: Theory and Algorithm, by Ke Ma et al.
-
Summary of What We Talk About When We Talk About Lms: Implicit Paradigm Shifts and the Ship Of Language Models, by Shengqi Zhu and Jeffrey M. Rzeszotarski
-
Summary of Certainly Uncertain: a Benchmark and Metric For Multimodal Epistemic and Aleatoric Awareness, by Khyathi Raghavi Chandu et al.
-
Summary of Simple Augmentations Of Logical Rules For Neuro-symbolic Knowledge Graph Completion, by Ananjan Nandi et al.
-
Summary of Save: Segment Audio-visual Easy Way Using Segment Anything Model, by Khanh-binh Nguyen and Chae Jung Park
-
Summary of Scaledreamer: Scalable Text-to-3d Synthesis with Asynchronous Score Distillation, by Zhiyuan Ma et al.
-
Summary of Fake News Detection and Manipulation Reasoning Via Large Vision-language Models, by Ruihan Jin et al.
-
Summary of Abstract Dialectical Frameworks Are Boolean Networks (full Version), by Jesse Heyninck et al.
-
Summary of Augmenting Document-level Relation Extraction with Efficient Multi-supervision, by Xiangyu Lin et al.
-
Summary of Face4rag: Factual Consistency Evaluation For Retrieval Augmented Generation in Chinese, by Yunqi Xu et al.
-
Summary of Pron Vs Prompt: Can Large Language Models Already Challenge a World-class Fiction Author at Creative Text Writing?, by Guillermo Marco et al.
-
Summary of Ibsen: Director-actor Agent Collaboration For Controllable and Interactive Drama Script Generation, by Senyu Han et al.
-
Summary of Investigating the Potential Of Sparse Mixtures-of-experts For Multi-domain Neural Machine Translation, by Nadezhda Chirkova et al.
-
Summary of An Empirical Comparison Of Generative Approaches For Product Attribute-value Identification, by Kassem Sabeh et al.
-
Summary of Integrated Feature Analysis For Deep Learning Interpretation and Class Activation Maps, by Yanli Li et al.
-
Summary of Multi-view Black-box Physical Attacks on Infrared Pedestrian Detectors Using Adversarial Infrared Grid, by Kalibinuer Tiliwalidi et al.
-
Summary of Mirai: Evaluating Llm Agents For Event Forecasting, by Chenchen Ye et al.
-
Summary of Sgccnet: Single-stage 3d Object Detector with Saliency-guided Data Augmentation and Confidence Correction Mechanism, by Ao Liang et al.
-
Summary of Large Language Models Are Zero-shot Recognizers For Activities Of Daily Living, by Gabriele Civitarese et al.
-
Summary of Sinkt: a Structure-aware Inductive Knowledge Tracing Model with Large Language Model, by Lingyue Fu et al.
-
Summary of Robot Instance Segmentation with Few Annotations For Grasping, by Moshe Kimhi et al.
-
Summary of Mask and Compress: Efficient Skeleton-based Action Recognition in Continual Learning, by Matteo Mosconi et al.
-
Summary of Adapting Multilingual Llms to Low-resource Languages with Knowledge Graphs Via Adapters, by Daniil Gurgurov et al.
-
Summary of Hyperspectral Pansharpening: Critical Review, Tools and Future Perspectives, by Matteo Ciotola et al.
-
Summary of Dynamic Few-shot Learning For Knowledge Graph Question Answering, by Jacopo D’abramo et al.
-
Summary of Retrieval-augmented Generation in Multilingual Settings, by Nadezhda Chirkova et al.
-
Summary of Self-cognition in Large Language Models: An Exploratory Study, by Dongping Chen et al.
-
Summary of Regmix: Data Mixture As Regression For Language Model Pre-training, by Qian Liu et al.
-
Summary of Scmil: Sparse Context-aware Multiple Instance Learning For Predicting Cancer Survival Probability Distribution in Whole Slide Images, by Zekang Yang and Hong Liu and Xiangdong Wang
-
Summary of Learning Formal Mathematics From Intrinsic Motivation, by Gabriel Poesia et al.
-
Summary of Cafnet: a Confidence-driven Framework For Radar Camera Depth Estimation, by Huawei Sun et al.
-
Summary of Chest-diffusion: a Light-weight Text-to-image Model For Report-to-cxr Generation, by Peng Huang et al.
-
Summary of A Comparative Study Of Quality Evaluation Methods For Text Summarization, by Huyen Nguyen et al.
-
Summary of Diffusion Models and Representation Learning: a Survey, by Michael Fuest et al.
-
Summary of Towards Shutdownable Agents Via Stochastic Choice, by Elliott Thornley et al.
-
Summary of Towards Robust Speech Representation Learning For Thousands Of Languages, by William Chen et al.