Transformer – Page 222 – GrooveSquid.com

Loading Now

July 13, 2025

Summary of Re-evaluating the Memory-balanced Pipeline Parallelism: Bpipe, by Mincong Huang et al.

Re-evaluating the Memory-balanced Pipeline Parallelism: BPipeby Mincong Huang, Chao Wang, Chi Ma, Yineng Zhang, Peng…

July 13, 2025

Summary of Iterative Mask Filling: An Effective Text Augmentation Method Using Masked Language Modeling, by Himmet Toprak Kesgin et al.

Iterative Mask Filling: An Effective Text Augmentation Method Using Masked Language Modelingby Himmet Toprak Kesgin,…

July 13, 2025

Summary of Transformer Neural Autoregressive Flows, by Massimiliano Patacchiola et al.

Transformer Neural Autoregressive Flowsby Massimiliano Patacchiola, Aliaksandra Shysheya, Katja Hofmann, Richard E. TurnerFirst submitted to…

July 13, 2025

Summary of Kernel-u-net: Multivariate Time Series Forecasting Using Custom Kernels, by Jiang You et al.

Kernel-U-Net: Multivariate Time Series Forecasting using Custom Kernelsby Jiang You, Arben Cela, René Natowicz, Jacob…

July 13, 2025

Summary of An Autoregressive Text-to-graph Framework For Joint Entity and Relation Extraction, by Urchade Zaratiana et al.

An Autoregressive Text-to-Graph Framework for Joint Entity and Relation Extractionby Urchade Zaratiana, Nadi Tomeh, Pierre…

July 13, 2025

Summary of Boosting Transformer’s Robustness and Efficacy in Ppg Signal Artifact Detection with Self-supervised Learning, by Thanh-dung Le

Boosting Transformer’s Robustness and Efficacy in PPG Signal Artifact Detection with Self-Supervised Learningby Thanh-Dung LeFirst…

July 13, 2025

Summary of Secformer: Fast and Accurate Privacy-preserving Inference For Transformer Models Via Smpc, by Jinglong Luo et al.

SecFormer: Fast and Accurate Privacy-Preserving Inference for Transformer Models via SMPCby Jinglong Luo, Yehong Zhang,…

July 13, 2025

Summary of Graphgpt: Generative Pre-trained Graph Eulerian Transformer, by Qifang Zhao et al.

GraphGPT: Generative Pre-trained Graph Eulerian Transformerby Qifang Zhao, Weidong Ren, Tianyu Li, Hong Liu, Xingsheng…

July 13, 2025

Summary of L3cube-mahasocialner: a Social Media Based Marathi Ner Dataset and Bert Models, by Harsh Chaudhari et al.

L3Cube-MahaSocialNER: A Social Media based Marathi NER Dataset and BERT modelsby Harsh Chaudhari, Anuja Patil,…

July 13, 2025

Summary of Transformer Multivariate Forecasting: Less Is More?, by Jingjing Xu et al.

Transformer Multivariate Forecasting: Less is More?by Jingjing Xu, Caesar Wu, Yuan-Fang Li, Pascal BouvryFirst submitted…