Summary of Testing the Feasibility Of Linear Programs with Bandit Feedback, by Aditya Gangrade et al.
Testing the Feasibility of Linear Programs with Bandit Feedbackby Aditya Gangrade, Aditya Gopalan, Venkatesh Saligrama,…
Testing the Feasibility of Linear Programs with Bandit Feedbackby Aditya Gangrade, Aditya Gopalan, Venkatesh Saligrama,…
On Giant’s Shoulders: Effortless Weak to Strong by Dynamic Logits Fusionby Chenghao Fan, Zhenyi Lu,…
GOAL: A Generalist Combinatorial Optimization Agent Learnerby Darko Drakulic, Sofia Michel, Jean-Marc AndreoliFirst submitted to…
HLQ: Fast and Efficient Backpropagation via Hadamard Low-rank Quantizationby Seonggon Kim, Eunhyeok ParkFirst submitted to…
Pessimistic asynchronous sampling in high-cost Bayesian optimizationby Amanda A. Volk, Kristofer G. Reyes, Jeffrey G.…
A review of feature selection strategies utilizing graph data structures and knowledge graphsby Sisi Shao,…
Training Greedy Policy for Proposal Batch Selection in Expensive Multi-Objective Combinatorial Optimizationby Deokjae Lee, Hyun…
Direct Multi-Turn Preference Optimization for Language Agentsby Wentao Shi, Mengqi Yuan, Junkang Wu, Qifan Wang,…
Enhancing Dropout-based Bayesian Neural Networks with Multi-Exit on FPGAby Hao Mark Chen, Liam Castelli, Martin…
A Benchmark Study of Deep-RL Methods for Maximum Coverage Problems over Graphsby Zhicheng Liang, Yu…