ClipSitu: Effectively Leveraging CLIP for Conditional Predictions in Situation Recognition | ComputerVisionFoundation Videos | Podwise