Paper List
We recommend you use the search box as this list is very long.
-
Summary of Lacoste: Exploiting Stereo and Temporal Contexts For Surgical Instrument Segmentation, by Qiyuan Wang and Shang Zhao and Zikang Xu and S Kevin Zhou
-
Summary of Ai-driven Virtual Teacher For Enhanced Educational Efficiency: Leveraging Large Pretrain Models For Autonomous Error Analysis and Correction, by Tianlong Xu et al.
-
Summary of Amber — Advanced Segformer For Multi-band Image Segmentation: An Application to Hyperspectral Imaging, by Andrea Dosi et al.
-
Summary of Lagrange Duality and Compound Multi-attention Transformer For Semi-supervised Medical Image Segmentation, by Fuchen Zheng et al.
-
Summary of Ugad: Universal Generative Ai Detector Utilizing Frequency Fingerprints, by Inzamamul Alam et al.
-
Summary of Autonomous Vehicle Controllers From End-to-end Differentiable Simulation, by Asen Nachkov et al.
-
Summary of Probtalk3d: Non-deterministic Emotion Controllable Speech-driven 3d Facial Animation Synthesis Using Vq-vae, by Sichun Wu and Kazi Injamamul Haque and Zerrin Yumak
-
Summary of Travelagent: An Ai Assistant For Personalized Travel Planning, by Aili Chen et al.
-
Summary of The Clc-uket Dataset: Benchmarking Case Outcome Prediction For the Uk Employment Tribunal, by Huiyuan Xie et al.
-
Summary of Audiobert: Audio Knowledge Augmented Language Model, by Hyunjong Ok et al.
-
Summary of Source2synth: Synthetic Data Generation and Curation Grounded in Real Data Sources, by Alisia Lupidi et al.
-
Summary of Lt3sd: Latent Trees For 3d Scene Diffusion, by Quan Meng et al.
-
Summary of Windows Agent Arena: Evaluating Multi-modal Os Agents at Scale, by Rogerio Bonatti et al.
-
Summary of Flashsplat: 2d to 3d Gaussian Splatting Segmentation Solved Optimally, by Qiuhong Shen et al.
-
Summary of Ifadapter: Instance Feature Control For Grounded Text-to-image Generation, by Yinwei Wu et al.
-
Summary of 360pant: Training-free Text-driven 360-degree Panorama-to-panorama Translation, by Hai Wang et al.
-
Summary of Bayesian Inverse Graphics For Few-shot Concept Learning, by Octavio Arriaga et al.
-
Summary of When Context Leads but Parametric Memory Follows in Large Language Models, by Yufei Tao et al.
-
Summary of Knowledge Tagging with Large Language Model Based Multi-agent System, by Hang Li et al.
-
Summary of Inter Observer Variability Assessment Through Ordered Weighted Belief Divergence Measure in Magdm Application to the Ensemble Classifier Feature Fusion, by Pragya Gupta (1) et al.
-
Summary of A Bert-based Summarization Approach For Depression Detection, by Hossein Salahshoor Gavalan et al.
-
Summary of Expediting and Elevating Large Language Model Reasoning Via Hidden Chain-of-thought Decoding, by Tianqiao Liu et al.
-
Summary of Medic: Towards a Comprehensive Framework For Evaluating Llms in Clinical Applications, by Praveen K Kanithi et al.
-
Summary of Explanation, Debate, Align: a Weak-to-strong Framework For Language Model Generalization, by Mehrdad Zakershahrak et al.
-
Summary of Securing Vision-language Models with a Robust Encoder Against Jailbreak and Adversarial Attacks, by Md Zarif Hossain et al.
-
Summary of Awaking the Slides: a Tuning-free and Knowledge-regulated Ai Tutoring System Via Language Model Coordination, by Daniel Zhang-li et al.
-
Summary of “my Grade Is Wrong!”: a Contestable Ai Framework For Interactive Feedback in Evaluating Student Essays, by Shengxin Hong et al.
-
Summary of Super: Evaluating Agents on Setting Up and Executing Tasks From Research Repositories, by Ben Bogin et al.
-
Summary of Reflective Human-machine Co-adaptation For Enhanced Text-to-image Generation Dialogue System, by Yuheng Feng et al.
-
Summary of Machine Learning and Constraint Programming For Efficient Healthcare Scheduling, by Aymen Ben Said and Malek Mouhoub
-
Summary of Small Object Detection For Indoor Assistance to the Blind Using Yolo Nas Small and Super Gradients, by Rashmi Bn (jss Academy Of Technical Education et al.
-
Summary of A Novel Mathematical Framework For Objective Characterization Of Ideas Through Vector Embeddings in Llm, by B. Sankar and Dibakar Sen
-
Summary of An Unsupervised Dialogue Topic Segmentation Model Based on Utterance Rewriting, by Xia Hou et al.
-
Summary of Open-vocabulary Remote Sensing Image Semantic Segmentation, by Qinglong Cao et al.
-
Summary of Dsbench: How Far Are Data Science Agents to Becoming Data Science Experts?, by Liqiang Jing et al.
-
Summary of Advancing Depth Anything Model For Unsupervised Monocular Depth Estimation in Endoscopy, by Bojian Li et al.
-
Summary of Transfer Learning Applied to Computer Vision Problems: Survey on Current Progress, Limitations, and Opportunities, by Aaryan Panda et al.
-
Summary of Firestereo: Forest Infrared Stereo Dataset For Uas Depth Perception in Visually Degraded Environments, by Devansh Dhrafani et al.
-
Summary of Multi-object Event Graph Representation Learning For Video Question Answering, by Yanan Wang and Shuichiro Haruta and Donghuo Zeng and Julio Vizcarra and Mori Kurokawa
-
Summary of Top-down Activity Representation Learning For Video Question Answering, by Yanan Wang and Shuichiro Haruta and Donghuo Zeng and Julio Vizcarra and Mori Kurokawa
-
Summary of Affsegnet: Adaptive Feature Fusion Segmentation Network For Microtumors and Multi-organ Segmentation, by Fuchen Zheng et al.
-
Summary of Modeling Image Tone Dichotomy with the Power Function, by Axel Martinez et al.
-
Summary of Hint-ad: Holistically Aligned Interpretability in End-to-end Autonomous Driving, by Kairui Ding et al.
-
Summary of Lime: Less Is More For Mllm Evaluation, by King Zhu et al.
-
Summary of Nsp: a Neuro-symbolic Natural Language Navigational Planner, by William English et al.
-
Summary of Intrapartum Ultrasound Image Segmentation Of Pubic Symphysis and Fetal Head Using Dual Student-teacher Framework with Cnn-vit Collaborative Learning, by Jianmei Jiang et al.
-
Summary of Fsmdet: Vision-guided Feature Diffusion For Fully Sparse 3d Detector, by Tianran Liu et al.
-
Summary of You Have Thirteen Hours in Which to Solve the Labyrinth: Enhancing Ai Game Masters with Function Calling, by Jaewoo Song et al.
-
Summary of Beyond Iid: Optimizing Instruction Learning From the Perspective Of Instruction Interaction and Dependency, by Hanyu Zhao et al.
-
Summary of Native Vs Non-native Language Prompting: a Comparative Analysis, by Mohamed Bayan Kmainasi et al.
-
Summary of Multimodal Emotion Recognition with Vision-language Prompting and Modality Dropout, by Anbin Qi et al.
-
Summary of Legal Fact Prediction: the Missing Piece in Legal Judgment Prediction, by Junkai Liu et al.
-
Summary of Ontology-free General-domain Knowledge Graph-to-text Generation Dataset Synthesis Using Large Language Model, by Daehee Kim et al.
-
Summary of Redundancy-aware Camera Selection For Indoor Scene Neural Rendering, by Zehao Wang et al.
-
Summary of Credibility-limited Revision For Epistemic Spaces, by Kai Sauerwald
-
Summary of Dcmac: Demand-aware Customized Multi-agent Communication Via Upper Bound Training, by Dongkun Huo et al.
-
Summary of Leveraging Unstructured Text Data For Federated Instruction Tuning Of Large Language Models, by Rui Ye et al.
-
Summary of Enhancing Angular Resolution Via Directionality Encoding and Geometric Constraints in Brain Diffusion Tensor Imaging, by Sheng Chen et al.
-
Summary of Propaganda to Hate: a Multimodal Analysis Of Arabic Memes with Multi-agent Llms, by Firoj Alam et al.
-
Summary of Thermalgaussian: Thermal 3d Gaussian Splatting, by Rongfeng Lu et al.
-
Summary of Module-wise Adaptive Adversarial Training For End-to-end Autonomous Driving, by Tianyuan Zhang et al.
-
Summary of Applying Attribution Explanations in Truth-discovery Quantitative Bipolar Argumentation Frameworks, by Xiang Yin et al.
-
Summary of Messirve: a Large-scale Spanish Information Retrieval Dataset, by Francisco Valentini et al.
-
Summary of Case Study: Leveraging Genai to Build Ai-based Surrogates and Regressors For Modeling Radio Frequency Heating in Fusion Energy Science, by E. Wes Bethel et al.
-
Summary of Accelerating Large Language Model Pretraining Via Lfr Pedagogy: Learn, Focus, and Review, by Neha Prakriya et al.
-
Summary of Larger Language Models Don’t Care How You Think: Why Chain-of-thought Prompting Fails in Subjective Tasks, by Georgios Chochlakis et al.
-
Summary of Novi : Chatbot System For University Novice with Bert and Llms, by Yoonji Nam et al.
-
Summary of Towards Generalizable Scene Change Detection, by Jaewoo Kim et al.
-
Summary of Keyword-aware Asr Error Augmentation For Robust Dialogue State Tracking, by Jihyun Lee et al.
-
Summary of Enhancing Long Video Understanding Via Hierarchical Event-based Memory, by Dingxin Cheng et al.
-
Summary of Texture-ad: An Anomaly Detection Dataset and Benchmark For Real Algorithm Development, by Tianwu Lei and Bohan Wang and Silin Chen and Shurong Cao and Ningmu Zou
-
Summary of Magda: Multi-agent Guideline-driven Diagnostic Assistance, by David Bani-harouni et al.
-
Summary of Learning Generative Interactive Environments by Trained Agent Exploration, By Naser Kazemi et al.
-
Summary of Distilling Generative-discriminative Representations For Very Low-resolution Face Recognition, by Junzheng Zhang et al.
-
Summary of Elucidating Optimal Reward-diversity Tradeoffs in Text-to-image Diffusion Models, by Rohit Jena et al.
-
Summary of An Effective Context-balanced Adaptation Approach For Long-tailed Speech Recognition, by Yi-cheng Wang et al.
-
Summary of Questioning Internal Knowledge Structure Of Large Language Models Through the Lens Of the Olympic Games, by Juhwan Choi et al.
-
Summary of World-grounded Human Motion Recovery Via Gravity-view Coordinates, by Zehong Shen et al.
-
Summary of Eyeclip: a Visual-language Foundation Model For Multi-modal Ophthalmic Image Analysis, by Danli Shi et al.
-
Summary of Quantifying and Enabling the Interpretability Of Clip-like Models, by Avinash Madasu et al.
-
Summary of Llama-omni: Seamless Speech Interaction with Large Language Models, by Qingkai Fang et al.
-
Summary of Semifactual Explanations For Reinforcement Learning, by Jasmina Gajcin et al.
-
Summary of Ad-net: Attention-based Dilated Convolutional Residual Network with Guided Decoder For Robust Skin Lesion Segmentation, by Asim Naveed et al.
-
Summary of Elsevier Arena: Human Evaluation Of Chemistry/biology/health Foundational Large Language Models, by Camilo Thorne et al.
-
Summary of Harmonic Reasoning in Large Language Models, by Anna Kruspe
-
Summary of Seeing Through the Mask: Rethinking Adversarial Examples For Captchas, by Yahya Jabary et al.
-
Summary of Hmaflow: Learning More Accurate Optical Flow Via Hierarchical Motion Field Alignment, by Dianbo Ma et al.
-
Summary of Causejudger: Identifying the Cause with Llms For Abductive Logical Reasoning, by Jinwei He and Feng Lu
-
Summary of Lerojd: Lidar Extended Radar-only Object Detection, by Patrick Palmer et al.
-
Summary of Memorag: Moving Towards Next-gen Rag Via Memory-inspired Knowledge Discovery, by Hongjin Qian et al.
-
Summary of Exddi: Explaining Drug-drug Interaction Predictions with Natural Language, by Zhaoyue Sun et al.
-
Summary of Adapted-moe: Mixture Of Experts with Test-time Adaption For Anomaly Detection, by Tianwu Lei et al.
-
Summary of 3d-sar Tomography and Machine Learning For High-resolution Tree Height Estimation, by Grace Colverd et al.
-
Summary of Replay Consolidation with Label Propagation For Continual Object Detection, by Riccardo De Monte et al.
-
Summary of Rirag: Regulatory Information Retrieval and Answer Generation, by Tuba Gokhan and Kexin Wang and Iryna Gurevych and Ted Briscoe
-
Summary of Referring Expression Generation in Visually Grounded Dialogue with Discourse-aware Comprehension Guiding, by Bram Willemsen et al.