Summary of A Post-training Enhanced Optimization Approach For Small Language Models, by Keke Zhai
A Post-Training Enhanced Optimization Approach for Small Language Modelsby Keke ZhaiFirst submitted to arxiv on:…
A Post-Training Enhanced Optimization Approach for Small Language Modelsby Keke ZhaiFirst submitted to arxiv on:…
Controlling for Unobserved Confounding with Large Language Model Classification of Patient Smoking Statusby Samuel Lee,…
TeleOracle: Fine-Tuned Retrieval-Augmented Generation with Long-Context Support for Networkby Nouf Alabbasi, Omar Erak, Omar Alhussein,…
Graph-based Confidence Calibration for Large Language Modelsby Yukun Li, Sijia Wang, Lifu Huang, Li-Ping LiuFirst…
Regress, Don’t Guess – A Regression-like Loss on Number Tokens for Language Modelsby Jonas Zausinger,…
Interacting Large Language Model Agents. Interpretable Models and Social Learningby Adit Jain, Vikram KrishnamurthyFirst submitted…
AttackQA: Development and Adoption of a Dataset for Assisting Cybersecurity Operations using Fine-tuned and Open-Source…
Normalization Layer Per-Example Gradients are Sufficient to Predict Gradient Noise Scale in Transformersby Gavia Gray,…
LLaMo: Large Language Model-based Molecular Graph Assistantby Jinyoung Park, Minseong Bae, Dohwan Ko, Hyunwoo J.…
MESS+: Energy-Optimal Inferencing in Language Model Zoos with Service Level Guaranteesby Ryan Zhang, Herbert Woisetschläger,…