Summary of Teasergen: Generating Teasers For Long Documentaries, by Weihan Xu et al.
TeaserGen: Generating Teasers for Long Documentariesby Weihan Xu, Paul Pu Liang, Haven Kim, Julian McAuley,…
TeaserGen: Generating Teasers for Long Documentariesby Weihan Xu, Paul Pu Liang, Haven Kim, Julian McAuley,…
Herd Mentality in Augmentation – Not a Good Idea! A Robust Multi-stage Approach towards Deepfake…
CodePMP: Scalable Preference Model Pretraining for Large Language Model Reasoningby Huimu Yu, Xing Wu, Weidong…
Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisitionby Jiyeon Kim, Hyunji Lee,…
MAP: Unleashing Hybrid Mamba-Transformer Vision Backbone’s Potential with Masked Autoregressive Pretrainingby Yunze Liu, Li YiFirst…
Procedure-Aware Surgical Video-language Pretraining with Hierarchical Knowledge Augmentationby Kun Yuan, Vinkle Srivastav, Nassir Navab, Nicolas…
UniEmoX: Cross-modal Semantic-Guided Large-Scale Pretraining for Universal Scene Emotion Perceptionby Chuang Chen, Xiao Sun, Zhi…
EfficientCrackNet: A Lightweight Model for Crack Segmentationby Abid Hasan Zim, Aquib Iqbal, Zaid Al-Huda, Asad…
Enhancing elusive clues in knowledge learning by contrasting attention of language modelsby Jian Gao, Xiao…
Multi-Modal Generative AI: Multi-modal LLM, Diffusion and Beyondby Hong Chen, Xin Wang, Yuwei Zhou, Bin…