Summary of Scene Graph Generation Strategy with Co-occurrence Knowledge and Learnable Term Frequency, by Hyeongjin Kim et al.
Scene Graph Generation Strategy with Co-occurrence Knowledge and Learnable Term Frequencyby Hyeongjin Kim, Sangwon Kim,…
Scene Graph Generation Strategy with Co-occurrence Knowledge and Learnable Term Frequencyby Hyeongjin Kim, Sangwon Kim,…
Evaluating and Modeling Social Intelligence: A Comparative Study of Human and AI Capabilitiesby Junqi Wang,…
Detecting Multimodal Situations with Insufficient Context and Abstaining from Baseless Predictionsby Junzhang Liu, Zhecan Wang,…
Revisiting the Robust Generalization of Adversarial Prompt Tuningby Fan Yang, Mingxuan Xia, Sangzhou Xia, Chicheng…
Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Expertsby Yunxin Li, Shenyuan Jiang, Baotian Hu,…
MC-GPT: Empowering Vision-and-Language Navigation with Memory Map and Reasoning Chainsby Zhaohuan Zhan, Lisha Yu, Sijie…
Large Language Model Bias Mitigation from the Perspective of Knowledge Editingby Ruizhe Chen, Yichen Li,…
HAAP: Vision-context Hierarchical Attention Autoregressive with Adaptive Permutation for Scene Text Recognitionby Honghui Chen, Yuhang…
Adaptation of Distinct Semantics for Uncertain Areas in Polyp Segmentationby Quang Vinh Nguyen, Van Thong…
SwapTalk: Audio-Driven Talking Face Generation with One-Shot Customization in Latent Spaceby Zeren Zhang, Haibo Qin,…