Summary of High-fidelity and Lip-synced Talking Face Synthesis Via Landmark-based Diffusion Model, by Weizhi Zhong et al.
High-fidelity and Lip-synced Talking Face Synthesis via Landmark-based Diffusion Modelby Weizhi Zhong, Junfan Lin, Peixin…
High-fidelity and Lip-synced Talking Face Synthesis via Landmark-based Diffusion Modelby Weizhi Zhong, Junfan Lin, Peixin…
EPAM-Net: An Efficient Pose-driven Attention-guided Multimodal Network for Video Action Recognitionby Ahmed Abdelkawy, Asem Ali,…
Investigating Instruction Tuning Large Language Models on Graphsby Kerui Zhu, Bo-Wei Huang, Bowen Jin, Yizhu…
Multi-Agent Planning Using Visual Language Modelsby Michele Brienza, Francesco Argenziano, Vincenzo Suriani, Domenico D. Bloisi,…
Structure and Reduction of MCTS for Explainable-AIby Ronit Bustin, Claudia V. GoldmanFirst submitted to arxiv…
Disentangled Noisy Correspondence Learningby Zhuohang Dang, Minnan Luo, Jihong Wang, Chengyou Jia, Haochen Han, Herun…
Multi-layer Sequence Labeling-based Joint Biomedical Event Extractionby Gongchi Chen, Pengchao Wu, Jinghang Gu, Longhua Qian,…
Document-Level Event Extraction with Definition-Driven ICLby Zhuoyuan Liu, Yilin LuoFirst submitted to arxiv on: 10…
In-Context Exploiter for Extensive-Form Gamesby Shuxin Li, Chang Yang, Youzhi Zhang, Pengdeng Li, Xinrun Wang,…
Metacognitive Myopia in Large Language Modelsby Florian Scholten, Tobias R. Rebholz, Mandy HütterFirst submitted to…