Summary of Language Model Can Listen While Speaking, by Ziyang Ma et al.
Language Model Can Listen While Speakingby Ziyang Ma, Yakun Song, Chenpeng Du, Jian Cong, Zhuo…
Language Model Can Listen While Speakingby Ziyang Ma, Yakun Song, Chenpeng Du, Jian Cong, Zhuo…
ParkingE2E: Camera-based End-to-end Parking Network, from Images to Planningby Changze Li, Ziheng Ji, Zhe Chen,…
WAS: Dataset and Methods for Artistic Text Segmentationby Xudong Xie, Yuzhe Li, Yang Liu, Zhifei…
AI Safety in Practice: Enhancing Adversarial Robustness in Multimodal Image Captioningby Maisha Binte Rashid, Pablo…
Rethinking RGB-D Fusion for Semantic Segmentation in Surgical Datasetsby Muhammad Abdullah Jamal, Omid MohareriFirst submitted…
Improving Domain-Specific ASR with LLM-Generated Contextual Descriptionsby Jiwon Suh, Injae Na, Woohwan JungFirst submitted to…
GaussianSR: High Fidelity 2D Gaussian Splatting for Arbitrary-Scale Image Super-Resolutionby Jintong Hu, Bin Xia, Bin…
ALLaM: Large Language Models for Arabic and Englishby M Saiful Bari, Yazeed Alnumay, Norah A.…
Qalam : A Multimodal LLM for Arabic Optical Character and Handwriting Recognitionby Gagan Bhatia, El…
Multiobjective Vehicle Routing Optimization with Time Windows: A Hybrid Approach Using Deep Reinforcement Learning and…