Summary of Prompt-based Length Controlled Generation with Reinforcement Learning, by Renlong Jie et al.
Prompt-Based Length Controlled Generation with Reinforcement Learningby Renlong Jie, Xiaojun Meng, Lifeng Shang, Xin Jiang,…
Prompt-Based Length Controlled Generation with Reinforcement Learningby Renlong Jie, Xiaojun Meng, Lifeng Shang, Xin Jiang,…
Safety and Performance, Why not Both? Bi-Objective Optimized Model Compression toward AI Software Deploymentby Jie…
Adversarial Transformer Language Models for Contextual Commonsense Inferenceby Pedro Colon-Hernandez, Henry Lieberman, Yida Xin, Claire…
Iterative Geometry Encoding Volume for Stereo Matchingby Gangwei Xu, Xianqi Wang, Xiaohuan Ding, Xin YangFirst…
Aviary: training language agents on challenging scientific tasksby Siddharth Narayanan, James D. Braza, Ryan-Rhys Griffiths,…
A Unified Framework for Entropy Search and Expected Improvement in Bayesian Optimizationby Nuojin Cheng, Leonard…
GRIT: Faster and Better Image captioning Transformer Using Dual Visual Featuresby Van-Quang Nguyen, Masanori Suganuma,…
Efficiently Serving LLM Reasoning Programs with Certaindexby Yichao Fu, Junda Chen, Siqi Zhu, Zheyu Fu,…
Weber-Fechner Law in Temporal Difference learning derived from Control as Inferenceby Keiichiro Takahashi, Taisuke Kobayashi,…
Frequency-Masked Embedding Inference: A Non-Contrastive Approach for Time Series Representation Learningby En Fu, Yanyan HuFirst…