Summary of From Simple to Professional: a Combinatorial Controllable Image Captioning Agent, by Xinran Wang et al.
From Simple to Professional: A Combinatorial Controllable Image Captioning Agentby Xinran Wang, Muxi Diao, Baoteng…
From Simple to Professional: A Combinatorial Controllable Image Captioning Agentby Xinran Wang, Muxi Diao, Baoteng…
SceneLLM: Implicit Language Reasoning in LLM for Dynamic Scene Graph Generationby Hang Zhang, Zhuoling Li,…
RAC3: Retrieval-Augmented Corner Case Comprehension for Autonomous Driving with Vision-Language Modelsby Yujin Wang, Quanfeng Liu,…
NITRO: LLM Inference on Intel Laptop NPUsby Anthony Fei, Mohamed S. AbdelfattahFirst submitted to arxiv…
Seeing the Forest and the Trees: Solving Visual Graph and Tree Based Data Structure Problems…
LAW: Legal Agentic Workflows for Custody and Fund Services Contractsby William Watson, Nicole Cho, Nishan…
AD-LLM: Benchmarking Large Language Models for Anomaly Detectionby Tiankai Yang, Yi Nian, Shawn Li, Ruiyao…
Leveraging Large Language Models for Active Merchant Non-player Charactersby Byungjun Kim, Minju Kim, Dayeon Seo,…
Efficient Quantization-Aware Training on Segment Anything Model in Medical Images and Its Deploymentby Haisheng Lu,…
Task-Oriented Dialog Systems for the Senegalese Wolof Languageby Derguene Mbaye, Moussa DialloFirst submitted to arxiv…