Summary of Taming Multimodal Joint Training For High-quality Video-to-audio Synthesis, by Ho Kei Cheng et al.
Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesisby Ho Kei Cheng, Masato Ishii, Akio Hayakawa,…
Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesisby Ho Kei Cheng, Masato Ishii, Akio Hayakawa,…
Large Language Models on Small Resource-Constrained Systems: Performance Characterization, Analysis and Trade-offsby Liam Seymour, Basar…
GeoPro-Net: Learning Interpretable Spatiotemporal Prediction Models through Statistically-Guided Geo-Prototypingby Bang An, Xun Zhou, Zirui Zhou,…
LISA: Learning-Integrated Space Partitioning Framework for Traffic Accident Forecasting on Heterogeneous Spatiotemporal Databy Bang An,…
A Generative Framework for Probabilistic, Spatiotemporally Coherent Downscaling of Climate Simulationby Jonathan Schmidt, Luca Schmidt,…
A Multi-Fidelity Graph U-Net Model for Accelerated Physics Simulationsby Rini Jasmine Gladstone, Hadi MeidaniFirst submitted…
Granger Causality Detection with Kolmogorov-Arnold Networksby Hongyu Lin, Mohan Ren, Paolo Barucca, Tomaso AsteFirst submitted…
Jet: A Modern Transformer-Based Normalizing Flowby Alexander Kolesnikov, André Susano Pinto, Michael TschannenFirst submitted to…
Rethinking Uncertainty Estimation in Natural Language Generationby Lukas Aichberger, Kajetan Schweighofer, Sepp HochreiterFirst submitted to…
Data for Mathematical Copilots: Better Ways of Presenting Proofs for Machine Learningby Simon Frieder, Jonas…