Paper List

We recommend you use the search box as this list is very long.

Summary of Chatsop: An Sop-guided Mcts Planning Framework For Controllable Llm Dialogue Agents, by Zhigen Li et al.
Summary of Do Generalised Classifiers Really Work on Human Drawn Sketches?, by Hmrishav Bandyopadhyay et al.
Summary of Mobileexperts: a Dynamic Tool-enabled Agent Team in Mobile Devices, by Jiayi Zhang et al.
Summary of Emotion and Intent Joint Understanding in Multimodal Conversation: a Benchmarking Dataset, by Rui Liu et al.
Summary of 52b to 1t: Lessons Learned Via Tele-flm Series, by Xiang Li et al.
Summary of Images Speak Louder Than Words: Understanding and Mitigating Bias in Vision-language Model From a Causal Mediation Perspective, by Zhaotian Weng et al.
Summary of Mindbench: a Comprehensive Benchmark For Mind Map Structure Recognition and Analysis, by Lei Chen et al.
Summary of Fast Maneuver Recovery From Aerial Observation: Trajectory Clustering and Outliers Rejection, by Nelson De Moura (astra) et al.
Summary of Translatotron-v(ison): An End-to-end Model For In-image Machine Translation, by Zhibin Lan et al.
Summary of Gracore: Benchmarking Graph Comprehension and Complex Reasoning in Large Language Models, by Zike Yuan et al.
Summary of Towards Negotiative Dialogue For the Talkamatic Dialogue Manager, by Staffan Larsson et al.
Summary of Unified Anomaly Detection Methods on Edge Device Using Knowledge Distillation and Quantization, by Sushovan Jena et al.
Summary of Large Language Models As Evaluators For Scientific Synthesis, by Julia Evans et al.
Summary of Mast Kalandar at Semeval-2024 Task 8: on the Trail Of Textual Origins: Roberta-bilstm Approach to Detect Ai-generated Text, by Jainit Sushil Bafna et al.
Summary of Are Large Language Models Consistent Over Value-laden Questions?, by Jared Moore et al.
Summary of Semiollm: Assessing Large Language Models For Semiological Analysis in Epilepsy Research, by Meghal Dani et al.
Summary of Human-like Linguistic Biases in Neural Speech Models: Phonetic Categorization and Phonotactic Constraints in Wav2vec2.0, by Marianne De Heer Kloots et al.
Summary of What Affects the Stability Of Tool Learning? An Empirical Study on the Robustness Of Tool Learning Frameworks, by Chengrui Huang et al.
Summary of Raw Text Is All You Need: Knowledge-intensive Multi-turn Instruction Tuning For Large Language Model, by Xia Hou et al.
Summary of An Organism Starts with a Single Pix-cell: a Neural Cellular Diffusion For High-resolution Image Synthesis, by Marawan Elbatel et al.
Summary of Enhancements For Real-time Monte-carlo Tree Search in General Video Game Playing, by Dennis J.n.j. Soemers and Chiara F. Sironi and Torsten Schuster and Mark H.m. Winands
Summary of Imc 2024 Methods & Solutions Review, by Shyam Gupta et al.
Summary of Learning Disentangled Representation in Object-centric Models For Visual Dynamics Prediction Via Transformers, by Sanket Gandhi et al.
Summary of Rvisa: Reasoning and Verification For Implicit Sentiment Analysis, by Wenna Lai et al.
Summary of Exploring the Role Of Transliteration in In-context Learning For Low-resource Languages Written in Non-latin Scripts, by Chunlan Ma et al.
Summary of Talking to Machines: Do You Read Me?, by Lina M. Rojas-barahona
Summary of Face Reconstruction Transfer Attack As Out-of-distribution Generalization, by Yoon Gyo Jung et al.
Summary of Reinforcement Learning and Machine Ethics:a Systematic Review, by Ajay Vishwanath and Louise A. Dennis and Marija Slavkovik
Summary of Meta 3d Assetgen: Text-to-mesh Generation with High-quality Geometry, Texture, and Pbr Materials, by Yawar Siddiqui et al.
Summary of Ensemble Of Pre-trained Language Models and Data Augmentation For Hate Speech Detection From Arabic Tweets, by Kheir Eddine Daouadi et al.
Summary of Predicting Vs. Acting: a Trade-off Between World Modeling & Agent Modeling, by Margaret Li et al.
Summary of Belief Sharing: a Blessing or a Curse, by Ozan Catal et al.
Summary of Free Energy in a Circumplex Model Of Emotion, by Candice Pattisapu et al.
Summary of Mmedagent: Learning to Use Medical Tools with Multi-modal Agent, by Binxu Li et al.
Summary of Minds, Brains, Ai, by Jay Seitz
Summary of Wildfire Autonomous Response and Prediction Using Cellular Automata (warp-ca), by Abdelrahman Ramadan
Summary of Autosplat: Constrained Gaussian Splatting For Autonomous Driving Scene Reconstruction, by Mustafa Khan and Hamidreza Fazlali and Dhruv Sharma and Tongtong Cao and Dongfeng Bai and Yuan Ren and Bingbing Liu
Summary of Nollywood: Let’s Go to the Movies!, by John E. Ortega and Ibrahim Said Ahmad and William Chen
Summary of A Practical Review Of Mechanistic Interpretability For Transformer-based Language Models, by Daking Rai et al.
Summary of Adversarial Magnification to Deceive Deepfake Detection Through Super Resolution, by Davide Alessandro Coccomini et al.
Summary of Reasoning in Large Language Models: a Geometric Perspective, by Romain Cosentino et al.
Summary of Medvh: Towards Systematic Evaluation Of Hallucination For Large Vision Language Models in the Medical Context, by Zishan Gu et al.
Summary of Artificial Intelligence and Machine Learning Generated Conjectures with Txgraffiti, by Randy Davila
Summary of Sequential Manipulation Against Rank Aggregation: Theory and Algorithm, by Ke Ma et al.
Summary of A Bounding Box Is Worth One Token: Interleaving Layout and Text in a Large Language Model For Document Understanding, by Jinghui Lu et al.
Summary of Simple Augmentations Of Logical Rules For Neuro-symbolic Knowledge Graph Completion, by Ananjan Nandi et al.
Summary of Save: Segment Audio-visual Easy Way Using Segment Anything Model, by Khanh-binh Nguyen and Chae Jung Park
Summary of Certainly Uncertain: a Benchmark and Metric For Multimodal Epistemic and Aleatoric Awareness, by Khyathi Raghavi Chandu et al.
Summary of Scaledreamer: Scalable Text-to-3d Synthesis with Asynchronous Score Distillation, by Zhiyuan Ma et al.
Summary of Fake News Detection and Manipulation Reasoning Via Large Vision-language Models, by Ruihan Jin et al.
Summary of Abstract Dialectical Frameworks Are Boolean Networks (full Version), by Jesse Heyninck et al.
Summary of Integrate the Essence and Eliminate the Dross: Fine-grained Self-consistency For Free-form Language Generation, by Xinglin Wang et al.
Summary of Hrsam: Efficient Interactive Segmentation in High-resolution Images, by You Huang et al.
Summary of Gemmar: Enhancing Llms Through Arabic Instruction-tuning, by Hasna Chouikhi et al.
Summary of Research on Reliable and Safe Occupancy Grid Prediction in Underground Parking Lots, by Jiaqi Luo
Summary of Automatic Adaptation Rule Optimization Via Large Language Models, by Yusei Ishimizu et al.
Summary of Generative Monoculture in Large Language Models, by Fan Wu et al.
Summary of How to Learn in a Noisy World? Self-correcting the Real-world Data Noise in Machine Translation, by Yan Meng et al.
Summary of Mtmamba: Enhancing Multi-task Dense Scene Understanding by Mamba-based Decoders, By Baijiong Lin et al.
Summary of A Refreshed Similarity-based Upsampler For Direct High-ratio Feature Upsampling, by Minghao Zhou et al.
Summary of Fedia: Federated Medical Image Segmentation with Heterogeneous Annotation Completeness, by Yangyang Xiang et al.
Summary of Rethinking Data Augmentation For Robust Lidar Semantic Segmentation in Adverse Weather, by Junsung Park et al.
Summary of Vfimamba: Video Frame Interpolation with State Space Models, by Guozhen Zhang and Chunxu Liu and Yutao Cui and Xiaotong Zhao and Kai Ma and Limin Wang
Summary of Robot Instance Segmentation with Few Annotations For Grasping, by Moshe Kimhi et al.
Summary of Mask and Compress: Efficient Skeleton-based Action Recognition in Continual Learning, by Matteo Mosconi et al.
Summary of Hyperspectral Pansharpening: Critical Review, Tools and Future Perspectives, by Matteo Ciotola et al.
Summary of Adapting Multilingual Llms to Low-resource Languages with Knowledge Graphs Via Adapters, by Daniil Gurgurov et al.
Summary of Dynamic Few-shot Learning For Knowledge Graph Question Answering, by Jacopo D’abramo et al.
Summary of Retrieval-augmented Generation in Multilingual Settings, by Nadezhda Chirkova et al.
Summary of Regmix: Data Mixture As Regression For Language Model Pre-training, by Qian Liu et al.
Summary of Self-cognition in Large Language Models: An Exploratory Study, by Dongping Chen et al.
Summary of Crab: Cross-environment Agent Benchmark For Multimodal Language Model Agents, by Tianqi Xu et al.
Summary of Fish-bone Diagram Of Research Issue: Gain a Bird’s-eye View on a Specific Research Topic, by Jinghong Li et al.
Summary of Scanreason: Empowering 3d Visual Grounding with Reasoning Capabilities, by Chenming Zhu et al.
Summary of Deciphering the Factors Influencing the Efficacy Of Chain-of-thought: Probability, Memorization, and Noisy Reasoning, by Akshara Prabhakar et al.
Summary of Sparkle: Enhancing Sparql Generation with Direct Kg Integration in Decoding, by Jaebok Lee and Hyeonjeong Shin
Summary of Optimized Learning For X-ray Image Classification For Multi-class Disease Diagnoses with Accelerated Computing Strategies, by Sebastian A. Cruz Romero et al.
Summary of Nlpguard: a Framework For Mitigating the Use Of Protected Attributes by Nlp Classifiers, By Salvatore Greco et al.
Summary of Addressing a Fundamental Limitation in Deep Vision Models: Lack Of Spatial Attention, by Ali Borji
Summary of Spatio-temporal Graphical Counterfactuals: An Overview, by Mingyu Kang and Duxin Chen and Ziyuan Pu and Jianxi Gao and Wenwu Yu
Summary of Survey on Knowledge Distillation For Large Language Models: Methods, Evaluation, and Application, by Chuanpeng Yang et al.
Summary of Grasp: a Grid-based Benchmark For Evaluating Commonsense Spatial Reasoning, by Zhisheng Tang et al.
Summary of What We Talk About When We Talk About Lms: Implicit Paradigm Shifts and the Ship Of Language Models, by Shengqi Zhu and Jeffrey M. Rzeszotarski
Summary of Large Language Model Enhanced Knowledge Representation Learning: a Survey, by Xin Wang et al.
Summary of Finesure: Fine-grained Summarization Evaluation Using Llms, by Hwanjun Song et al.
Summary of Deep Learning For Automated Detection Of Breast Cancer in Deep Ultraviolet Fluorescence Images with Diffusion Probabilistic Model, by Sepehr Salem Ghahfarokhi et al.
Summary of Tokenize the World Into Object-level Knowledge to Address Long-tail Events in Autonomous Driving, by Ran Tian et al.
Summary of Acceleration Method For Generating Perception Failure Scenarios Based on Editing Markov Process, by Canjie Cai
Summary of Mobile-bench: An Evaluation Benchmark For Llm-based Mobile Agents, by Shihan Deng et al.
Summary of Frog: Evaluating Fuzzy Reasoning Of Generalized Quantifiers in Large Language Models, by Yiyuan Li et al.
Summary of Embedded Prompt Tuning: Towards Enhanced Calibration Of Pretrained Models For Medical Images, by Wenqiang Zu and Shenghao Xie and Qing Zhao and Guoqi Li and Lei Ma
Summary of Augmenting Document-level Relation Extraction with Efficient Multi-supervision, by Xiangyu Lin et al.
Summary of Face4rag: Factual Consistency Evaluation For Retrieval Augmented Generation in Chinese, by Yunqi Xu et al.
Summary of Ibsen: Director-actor Agent Collaboration For Controllable and Interactive Drama Script Generation, by Senyu Han et al.
Summary of Pron Vs Prompt: Can Large Language Models Already Challenge a World-class Fiction Author at Creative Text Writing?, by Guillermo Marco et al.
Summary of Investigating the Potential Of Sparse Mixtures-of-experts For Multi-domain Neural Machine Translation, by Nadezhda Chirkova et al.
Summary of An Empirical Comparison Of Generative Approaches For Product Attribute-value Identification, by Kassem Sabeh et al.
Summary of Multi-view Black-box Physical Attacks on Infrared Pedestrian Detectors Using Adversarial Infrared Grid, by Kalibinuer Tiliwalidi et al.
Summary of Integrated Feature Analysis For Deep Learning Interpretation and Class Activation Maps, by Yanli Li et al.
Summary of Mirai: Evaluating Llm Agents For Event Forecasting, by Chenchen Ye et al.