Summary of Planllm: Video Procedure Planning with Refinable Large Language Models, by Dejie Yang and Zijing Zhao and Yang Liu
PlanLLM: Video Procedure Planning with Refinable Large Language Modelsby Dejie Yang, Zijing Zhao, Yang LiuFirst…
PlanLLM: Video Procedure Planning with Refinable Large Language Modelsby Dejie Yang, Zijing Zhao, Yang LiuFirst…
To Predict or Not To Predict? Proportionally Masked Autoencoders for Tabular Data Imputationby Jungkyu Kim,…
Cross-Spectral Vision Transformer for Biometric Authentication using Forehead Subcutaneous Vein Pattern and Periocular Patternby Arun…
Reversed in Time: A Novel Temporal-Emphasized Benchmark for Cross-Modal Video-Text Retrievalby Yang Du, Yuqi Liu,…
Mask Approximation Net: A Novel Diffusion Model Approach for Remote Sensing Change Captioningby Dongwei Sun,…
An End-to-End Depth-Based Pipeline for Selfie Image Rectificationby Ahmed Alhawwary, Phong Nguyen-Ha, Janne Mustaniemi, Janne…
Provably Efficient Exploration in Reward Machines with Low Regretby Hippolyte Bourel, Anders Jonsson, Odalric-Ambrym Maillard,…
GAIS: A Novel Approach to Instance Selection with Graph Attention Networksby Zahiriddin Rustamov, Ayham Zaitouny,…
Developing Explainable Machine Learning Model using Augmented Concept Activation Vectorby Reza Hassanpour, Kasim Oztoprak, Niels…
Context-Aware Deep Learning for Multi Modal Depression Detectionby Genevieve Lam, Huang Dongyan, Weisi LinFirst submitted…