Student model – Page 4 – GrooveSquid.com

July 13, 2025

TPD: Enhancing Student Language Model Reasoning via Principle Discovery and Guidanceby Haorui Wang, Rongzhi Zhang,…

July 13, 2025

DualTeacher: Bridging Coexistence of Unlabelled Classes for Semi-supervised Incremental Object Detectionby Ziqi Yuan, Liyuan Wang,…

July 13, 2025

On Explaining Knowledge Distillation: Measuring and Visualising the Knowledge Transfer Processby Gereziher Adhane, Mohammad Mahdi…

July 13, 2025

A Theoretical Analysis of Soft-Label vs Hard-Label Training in Neural Networksby Saptarshi Mandal, Xiaojun Lin,…

July 13, 2025

Reverse Thinking Makes LLMs Stronger Reasonersby Justin Chih-Yao Chen, Zifeng Wang, Hamid Palangi, Rujun Han,…

July 13, 2025

Pre-Training Graph Contrastive Masked Autoencoders are Strong Distillers for EEGby Xinxu Wei, Kanhao Zhao, Yong…

July 13, 2025

Adaptive Group Robust Ensemble Knowledge Distillationby Patrik Kenfack, Ulrich Aïvodji, Samira Ebrahimi KahouFirst submitted to…

July 13, 2025

Quantifying Knowledge Distillation Using Partial Information Decompositionby Pasan Dissanayake, Faisal Hamman, Barproda Halder, Ilia Sucholutsky,…

July 13, 2025

Multi-Level Feature Distillation of Joint Teachers Trained on Distinct Image Datasetsby Adrian Iordache, Bogdan Alexe,…

July 13, 2025

Knowledge Distillation Using Frontier Open-source LLMs: Generalizability and the Role of Synthetic Databy Anup Shirgaonkar,…