FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions | ComputerVisionFoundation Videos | Podwise