Summary of Re-evaluating the Memory-balanced Pipeline Parallelism: Bpipe, by Mincong Huang et al.
Re-evaluating the Memory-balanced Pipeline Parallelism: BPipeby Mincong Huang, Chao Wang, Chi Ma, Yineng Zhang, Peng…
Re-evaluating the Memory-balanced Pipeline Parallelism: BPipeby Mincong Huang, Chao Wang, Chi Ma, Yineng Zhang, Peng…
Iterative Mask Filling: An Effective Text Augmentation Method Using Masked Language Modelingby Himmet Toprak Kesgin,…
Transformer Neural Autoregressive Flowsby Massimiliano Patacchiola, Aliaksandra Shysheya, Katja Hofmann, Richard E. TurnerFirst submitted to…
Kernel-U-Net: Multivariate Time Series Forecasting using Custom Kernelsby Jiang You, Arben Cela, René Natowicz, Jacob…
An Autoregressive Text-to-Graph Framework for Joint Entity and Relation Extractionby Urchade Zaratiana, Nadi Tomeh, Pierre…
Boosting Transformer’s Robustness and Efficacy in PPG Signal Artifact Detection with Self-Supervised Learningby Thanh-Dung LeFirst…
SecFormer: Fast and Accurate Privacy-Preserving Inference for Transformer Models via SMPCby Jinglong Luo, Yehong Zhang,…
GraphGPT: Generative Pre-trained Graph Eulerian Transformerby Qifang Zhao, Weidong Ren, Tianyu Li, Hong Liu, Xingsheng…
L3Cube-MahaSocialNER: A Social Media based Marathi NER Dataset and BERT modelsby Harsh Chaudhari, Anuja Patil,…
Transformer Multivariate Forecasting: Less is More?by Jingjing Xu, Caesar Wu, Yuan-Fang Li, Pascal BouvryFirst submitted…